Course Description
This 3-day course is designed to give you a better understanding of Big Data topics focusing on Hadoop. It covers ins and outs of Big Data to clarify is it a buzzword, catch-phrase or something useful in our/your daily business. Also, it describes when we are considering something to be Big Data system, how architecture of typical Big Data system looks, what is ecosystem and who key players within Big Data space are? Is it related to data volume or technology in background? Does it replace existing technologies or is it enhancing them in joint existence? It will cover how to implement Hadoop jobs to extract business value from large and varied data sets, how to develop queries to simplify data analysis (with Pig, Hive, Cassandra and Impala).
Course Summary
Next Public Course Dates | |
| Prerequisites |
|
| Duration |
|
| Available Formats |
|
| Audience |
|
Course Modules
- Introduction to Big Data (definition through 3V, 4V, 6V…)
- Hadoop overview and Ecosystem
- Delivering business benefit from Big Data
- Integrating Big Data with traditional data
- Storing & analyzing data in Big Data environment
- Overview of Big Data stores and Data models: key value, graph, document, column-family
- Deep dive into storage components: HDFS, HBase, Hive, Cassandra and Impala
- Use Cases of different storage component
- Comparing selected Big Data storage components to Traditional Databases
- Relational Data Analysis within Big Data platform
- Limitations and Future Directions for storage components
Testimonials
The trainers clear and obvious enthusiasm for number crunching, analytics, and teaching others is infectious. He doesn’t waste time, shows exactly what you need to know and is genuinely hilarious.
Every one of my employees had tons of positive stuff to say.
- Benjamin G, MXSG Analysis and Integration Chief, US Air Force
“Tomi was great. He was very detailed and made sure I fully understood the section before moving on. There are so many functions, so the Informatica materials and lab books are super helpful.”
- Leroy Smith, ETL Developer, US Army
“The SAP BOE310 Business Intelligence Platform: Administration and Security training was well presented and the trainer was very patient with us, now signed up to the BOE320 Administering Servers training”
- Yocelyn Robles, Platform Engineer, Franchise Tax Board
























