Exist Management LLC (ExistBI) design the advanced big data training course exercises to give a high level overview of the capabilities for the Big Data Platform and Ecosystem. No programming experience is required for this training, but a background in programming can help with faster knowledge ingestion.

Prerequisites:

  • None.

Duration:

  • 4 days.

Agenda

Module 1: Introduction to Big Data (Lectures only)

  • Hadoop overview and Ecosystem
  • Delivering business benefit from Big Data
  • Storing & analyzing data in Big Data environment (HDFS, HBase, Hive, Impala, Solr)
  • Important building blocks for Big Data platform (Flume, Kafka, Pig, Hive, Hbase, Impala, Solr)

Module 2: Apache Hadoop architecture (Lectures + Hands-on)

  • Apache Hadoop, YARN and HDFS in more detail
  • Proper cluster configuration and deployment
  • Management and monitoring tools
  • Best practices for maintaining Apache Hadoop in Production
  • Installing and managing other Apache Hadoop projects

Module 3: Ingestion, and processing data with Hadoop tools (Lectures + Hands-on)

  • Deep dive into different Hadoop components: Sqoop, Flume, HDFS, HBase, Hive, Impala, Solr
  • Use Sqoop and Flume to ingest data
  • Relational Data Analysis within Big Data platform
  • Modeling structured data as tables in Impala and Hive

Module 4: Big Data analytics (Lectures + Hands-on)

  • Defining Big Data analytics
  • Difference between batch processing and real-time data processing
  • Handling streaming data
  • Data Visualization
  • Big Data Cases – Industry specific (Telco, Banking, Retail …)
  • Big Data Exploration
  • Enhanced 360o View of the Customer Security Intelligence Extension
  • Operations Analysis
  • Data Warehouse Modernization and many more
  • Defining Big Data Strategy
Print Friendly, PDF & Email