Exist Management LLC (ExistBI) are a leading Big Data and Business Intelligence training Company with offices in the US, Canada, UK and European Union. We have designed a fantastic Advanced Big Data training with hands-on advanced big data exercises to give you a strong understanding and high-level overview of the capabilities for the Big Data Platform / Ecosystem. No programming experience is required for this training, but a background in programming can help you on-board the information a little quicker.
Prerequisites:
- None.
Duration:
- 4 days.
Agenda
Module 1: Introduction to Big Data (Lectures only)
- Hadoop overview and Ecosystem
- Delivering business benefit from Big Data
- Storing & analyzing data in Big Data environment (HDFS, HBase, Hive, Impala, Solr)
- Important building blocks for Big Data platform (Flume, Kafka, Pig, Hive, Hbase, Impala, Solr)
Module 2: Apache Hadoop architecture (Lectures + Hands-on)
- Apache Hadoop, YARN and HDFS in more detail
- Proper cluster configuration and deployment
- Management and monitoring tools
- Best practices for maintaining Apache Hadoop in Production
- Installing and managing other Apache Hadoop projects
Module 3: Ingestion, and processing data with Hadoop tools (Lectures + Hands-on)
- Deep dive into different Hadoop components: Sqoop, Flume, HDFS, HBase, Hive, Impala, Solr
- Use Sqoop and Flume to ingest data
- Relational Data Analysis within Big Data platform
- Modeling structured data as tables in Impala and Hive
Module 4: Big Data analytics (Lectures + Hands-on)
- Defining Big Data analytics
- Difference between batch processing and real-time data processing
- Handling streaming data
- Data Visualization
- Big Data Cases – Industry specific (Telco, Banking, Retail …)
- Big Data Exploration
- Enhanced 360o View of the Customer Security Intelligence Extension
- Operations Analysis
- Data Warehouse Modernization and many more
- Defining Big Data Strategy