Introduction to Big Data Hadoop. Understanding HDFS & Mapreduce Presenting Big Data and Hadoop, what is Big Data and where does Hadoop fits in, two essential Hadoop biological community componentsnamely Map Reduce and HDFS, top to bottom Hadoop Distributed File System – Replications, Block Size, Secondary Name hub, High Availability, top to bottom YARN – Resource Manager, Node Manager.

Deep Dive in Mapreduce Taking in the working system of MapReduce, understanding the mapping and lessening stages in MR, the different wordings in MR like Input Format, Output Format, Partitioners, Combiners, Shuffle and Sort

Introduction to Hive Presenting Hadoop Hive, nitty gritty engineering of Hive, contrasting Hive and Pig and RDBMS, working with Hive Query Language, making of database, table, Group by and different conditions, the different kinds of Hive tables, Hcatalog, putting away the Hive Results, Hive apportioning and Buckets.

Advance Hive & Impala The ordering in Hive, the Map side Join in Hive, working with complex information types, the Hive User-characterized Functions, Introduction to Impala, contrasting Hive and Impala, the point by point design of Impala

Introduction to Pig Apache Pig presentation, its different highlights, the different information types and diagram in Hive, the accessible capacities in Pig, Hive Bags, Tuples and Fields.

Flume, Sqoop & HBase Apache Sqoop presentation, review, bringing in and sending out information, execution enhancement with Sqoop, Sqoop confinements, prologue to Flume and understanding the design of Flume, what is HBase and the CAP hypothesis.

Writing Spark Applications using Scala Utilizing Scala for composing Apache Spark applications, nitty gritty investigation of Scala, the requirement for Scala, the idea of article situated programing, executing the Scala code, the different classes in Scala like Getters,Setters, Constructors, Abstract ,Extending Objects, Overriding Methods, the Java and Scala interoperability, the idea of practical programming and mysterious capacities, Bobsrockets bundle, contrasting the alterable and changeless accumulations.

Spark framework Nitty gritty Apache Spark, its different highlights, contrasting and Hadoop, the different Spark parts, consolidating HDFS with Spark, Scalding, prologue to Scala, significance of Scala and RDD.

Hadoop Administration – Multi Node Cluster Setup using Amazon EC2 Make a four hub Hadoop bunch setup, running the MapReduce Jobs on the Hadoop group, effectively running the MapReduce code, working with the Cloudera Manager setup.

Hadoop Administration – Cluster Configuration The review of Hadoop arrangement, the significance of Hadoop design document, the different parameters and estimations of setup, the HDFS parameters and MapReduce parameters, setting up the Hadoop condition, the Include' and Exclude setup records, the organization and upkeep of Name hub, Data hub index structures and documents, What is a File framework picture, understanding Edit log.

