Course Description

This unique 4-day IBM InfoSphere Advanced Datastage bootcamp training is designed to introduce advanced job development techniques in DataStage. This advanced course is for experienced developers seeking training in more advanced techniques and who seek an understanding of the parallel framework architecture. This course combines InfoSphere Advanced DataStage – Parallel Framework (3-days) and InfoSphere DataStage – Advanced Data Processing (2-days). Materials and environment for hands-on labs provided.

Course Outcomes

At the end of the course, learners will be able to:

  • Describe the parallel processing architecture and development and runtime environments
  • Describe the compile process and the runtime job execution process
  • Describe how partitioning and collection works in the parallel framework
  • Describe sorting and buffering in the parallel framework and optimization techniques
  • Describe and work with parallel framework data types
  • Create reusable job components
  • Use loop processing in a Transformer stage
  • Process groups in a Transformer stage
  • Extend the functionality of DataStage by building custom stages and creating new Transformer functions
  • Use Connector stages to read and write from relational tables and handle errors in Connector stages
  • Process XML data in DataStage jobs using the XML stage
  • Design a job that processes a star schema database with Type 1 and Type 2 slowly changing dimensions
  • List job and stage best practices

Course Summary

Next Public Course Dates

UK & Europe:

  • 17 - 20 September 2024
  • 1 - 4 October 2024
  • 15 - 18 October 2024

US & Canada:

  • 10 - 13 September 2024
  • 24 - 27 September 2024
  • 8 - 11 October 2024

More dates

Prerequisites
  • You should complete our IBM InfoSphere DataStage Essentials training course and have at least one year of experience developing parallel jobs using DataStage.
Duration
  • 4 Days
Available Formats
  • Public Virtual Live Instructor-Led
  • Private Virtual Live Instructor-Led
  • Private Onsite
Audience
  • Experienced Developers

Course Modules

  • Module 1: Introduction to the Parallel Framework Architecture
  • Module 2: Compilation and Execution
  • Module 3: Partitioning and Collecting Data
  • Module 4: Sorting Data
  • Module 5: Buffering in Parallel Jobs
  • Module 6: Parallel Framework Data Types
  • Module 7: Reusable components
  • Module 8: Advanced Transformer Logic
  • Module 9: Extending the Functionality of Parallel Jobs
  • Module 10: Accessing Databases (start if there is time)
  • Module 11: Processing XML Data
  • Module 12: Slowly Changing Dimensions Stages
  • Module 13: Best Practices

Upcoming Public Virtual Live Instructor-led Course Dates

UK & Europe:

US & Canada:

    To discuss your project requirements, send us a message

      For a free assessment, quick quote or training information, send us a message

        To book this course, please fill in your details and submit the form.

          To book this course, please fill in your details and submit the form.

            To discuss your training requirements or book a class, drop us a line