The capability of collecting and storing huge amounts of versatile data necessitate the development and use of new techniques and methodologies for processing and analyzing big data. This course provides a comprehensive covering of a number of technologies that are at the foundation of the Big Data movement. The Hadoop architecture and ecosystem of tools will be of special focus to this course. Students who complete this course will understand the architecture of Hadoop clusters at both the hardware and system software levels. Students will learn to apply Hadoop and related Big Data technologies in developing analytics and solving the types of problems faced by enterprises today. The course strongly emphasizes implementation of big data routines using Java and Python. |