Course Description
ExistBI provides this 3-day Data Engineering Integration course to assist Developers learn to accelerate Data Engineering Integration through mass ingestion, incremental loads, transformations, processing of complex files, creating dynamic mappings, and integrating data science using Python. Optimize the Data Engineering system performance through monitoring, troubleshooting, and best practices while gaining an understanding of how to reuse application logic for Data Engineering use cases.
Course Outcomes
At the end of the course, learners will be able to:
- Mass ingest data to Hive and HDFS
- Perform incremental loads in Mass Ingestion
- Perform initial and incremental loads
- Integrate with relational databases using SQOOP
- Perform transformations across various engines
- Execute a mapping using JDBC in Spark mode
- Perform stateful computing and windowing
- Process complex files
- Parse hierarchical data on Spark engine
- Run profiles and choose sampling options on Spark engine
- Execute Dynamic Mappings
- Create Audits on Mappings
- Monitor logs using REST Operations Hub
- Monitor logs using Log Aggregation and troubleshoot
- Run mappings in Databricks environment
- Create mappings to access Delta Lake tables
- Tune performances of Spark and Databricks jobs
Course Summary
| Next Public Course Dates | |
| Prerequisites | |
| Duration |
|
| Available Formats |
|
| Audience |
|
Course Modules
Informatica Data Engineering Management Overview
- Data Engineering concepts
- Data Engineering Management features
- Benefits of Data Engineering Management
- Data Engineering Management architecture
- Data Engineering Management developer tasks
- Data Engineering Integration 10.4 new features
Testimonials
Thoroughly enjoyed the training. The trainer was fantastic! It is rare but always an awesome experience when a trainer is also an experienced practioner with a breadth of knowledge and hands on experience… even well beyond the subject matter at hand. I had the feeling that the trainer could have answered in detail any question we might have had related to not only BDM but Hadoop and other relevant big data topics as well. Time well spent and I hope to encounter Tomi again.
- Rick Kirk, CTO, Alliant Energy / Ernst & Young
“The Informatica Cloud Data Integration for Developers training class was excellent, the teacher was very knowledgeable and the materials were very useful”
- James Martin, Data Engineer, RoyOMartin
“Tomi was great. He was very detailed and made sure I fully understood the section before moving on. There are so many functions, so the Informatica materials and lab books are super helpful.”
- Leroy Smith, ETL Developer, US Army

























