Details
Join the data revolution. Companies are searching for data scientists. This specialized field demands multiple skills not easy to obtain through conventional curricula. Introduce yourself to the basics of data science and leave armed with practical experience
extracting value from big data. #uwdatasci
Commerce and research are being transformed by data-driven discovery and prediction. Skills required for data analytics at massive levels – scalable data management on and off the cloud, parallel algorithms, statistical modeling, and proficiency with a complex
ecosystem of tools and platforms – span a variety of disciplines and are not easy to obtain through conventional curricula. Tour the basic techniques of data science, including both SQL and NoSQL solutions for massive data management (e.g., MapReduce and contemporaries),
algorithms for data mining (e.g., clustering and association rule mining), and basic statistical modeling (e.g., linear and non-linear regression).
DMCC is pleased to offer our intermediate Hadoop class:
Overview to Big Data, Hadoop Architecture, MapReduce Framework, A typical Hadoop Cluster, Data Loading into HDFS, Hadoop Cluster Administrator: Roles and Responsibilities Hadoop server roles and their usage, Rack Awareness, Anatomy of Write and Read, Replication
Pipeline, Data Processing, Hadoop Installation and Initial Configuration, Deploying Hadoop in pseudo-distributed mode, deploying a multi-node Hadoop cluster, Installing Hadoop Clients Planning the Hadoop Cluster, Cluster Size, Hardware and Software considerations,
Managing and Scheduling Jobs, types of schedulers in Hadoop, Configuring the schedulers and run MapReduce jobs, Cluster Monitoring and Troubleshooting. Configure Rack awareness, Setting up Hadoop Backup, whitelist and blacklist data nodes in a cluster, setup
quota's, upgrade Hadoop cluster, copy data across clusters using distcp, Diagnostics and Recovery, Cluster Maintenance. Configuring Secondary NameNode, Hadoop 2.0, YARN framework, MRv2, Hadoop 2.0 Cluster setup, Deploying Hadoop 2.0 in pseudo-distributed mode,
deploying a multi-node Hadoop 2.0 cluster.
DMCC is pleased to offer our Big Data Advance Hadoop class:
Configuring HDFS Federation, Basics of Hadoop Platform Security, Securing the Platform, Configuring Kerberos. Oozie, Hcatalog/Hive Administration, HBase Architecture, HBase setup, HBase and Hive Integration, HBase performance optimization. Understanding the
Problem, Plan, Design, and Create a Hadoop Cluster for a Real World Use Case, Setup and Configure commonly used Hadoop ecosystem components such as Pig and Hive, Configure Ganglia on the Hadoop cluster and trouble shoot the common Cluster Problems
Don Mills Career College for Health, Business and Technology is committed to presenting a challenging curriculum, implementing to the best practices for our students in a nurturing setting. We take pride on delivering a balanced program where students can achieve great level of success in their education and career field. Don Mills Career College for Health, Business and Technology promotes active learning through experienced faculties. Our aim is to cultivate the learning skills of students and to help them to be effective and independent learners. We provide individual assistance that who seeks improvement of their learning skills by monitoring their academic progress. Don Mills Career College for Health, Business and Technology provides a diversity of assistance for academic program learning such as individual, small group, group study and training sessions. We also support and assist students by providing student services. As part of our service we offer Career Assistance, Practicum Assistance, Home Work Assistance, Special Communication Classes, Computer Facility, and Workshops or Seminars to our students. The students also have an opportunity to be a member of the college and to be part of the Don Mills Career College Association. ...