Course Description

This unique 4-day IBM InfoSphere Advanced Datastage bootcamp training is designed to introduce advanced job development techniques in DataStage. This advanced course is for experienced developers seeking training in more advanced techniques and who seek an understanding of the parallel framework architecture. This course combines InfoSphere Advanced DataStage – Parallel Framework (3-days) and InfoSphere DataStage – Advanced Data Processing (2-days). Materials and environment for hands-on labs provided.

Course Outcomes

At the end of the course, learners will be able to:

  • Describe the parallel processing architecture and development and runtime environments
  • Describe the compile process and the runtime job execution process
  • Describe how partitioning and collection works in the parallel framework
  • Describe sorting and buffering in the parallel framework and optimization techniques
  • Describe and work with parallel framework data types
  • Create reusable job components
  • Use loop processing in a Transformer stage
  • Process groups in a Transformer stage
  • Extend the functionality of DataStage by building custom stages and creating new Transformer functions
  • Use Connector stages to read and write from relational tables and handle errors in Connector stages
  • Process XML data in DataStage jobs using the XML stage
  • Design a job that processes a star schema database with Type 1 and Type 2 slowly changing dimensions
  • List job and stage best practices

Course Summary

Next Public Course Dates

UK & Europe:

  • 18 - 21 November 2025
  • 2 - 5 December 2025
  • 16 - 19 December 2025

US & Canada:

  • 9 - 12 December 2025
  • 13 - 16 January 2026
  • 27 - 30 January 2026

More dates

Prerequisites
  • You should complete our IBM InfoSphere DataStage Essentials training course and have at least one year of experience developing parallel jobs using DataStage.
Duration
  • 4 Days
Available Formats
  • Public Virtual Live Instructor-Led
  • Private Virtual Live Instructor-Led
  • Private Onsite
Audience
  • Experienced Developers

Course Modules

  • Module 1: Introduction to the Parallel Framework Architecture
  • Module 2: Compilation and Execution
  • Module 3: Partitioning and Collecting Data
  • Module 4: Sorting Data
  • Module 5: Buffering in Parallel Jobs
  • Module 6: Parallel Framework Data Types
  • Module 7: Reusable components
  • Module 8: Advanced Transformer Logic
  • Module 9: Extending the Functionality of Parallel Jobs
  • Module 10: Accessing Databases (start if there is time)
  • Module 11: Processing XML Data
  • Module 12: Slowly Changing Dimensions Stages
  • Module 13: Best Practices

Testimonials

The course was comprehensive and interactive, and the trainer really understood the content. I have been able to implement much of what I learned into my daily work activities and have saved a lot of time.

★★★★★

- Susan Medina, Advocate Health Care

Absolutely loved the enthusiasm and appreciate the knowledge he brought to class!!!

★★★★★

- Shelly Fruits, KPERS

The trainer was INCREDIBLE. He was extremely passionate, made sure to consistently ask if anybody needed help, logged on early to answer any questions, and was an overall great human being.

★★★★★

- Salvatore, Hilton Grand Vacations

Upcoming Public Virtual Live Instructor-led Course Dates

UK & Europe:

US & Canada:

    To discuss your project requirements, send us a message

      For a free assessment, quick quote or training information, send us a message

        To book this course, please fill in your details and submit the form.

          To book this course, please fill in your details and submit the form.

            To discuss your training requirements or book a class, drop us a line