DSXchange Learning Center
DataStage Training Professionals
On-site Training Classes

DSXchange Learning Center on site classes are instructor-led classes designed to be delivered at your business location or a nearby training facility. The training room requirements should be equipped with one PC per student and connectivity to a DataStage server.

For more information or to schedule an on site class at your business
click here.

Advanced Parallel Job Techniques

Duration: Variable (1 to 5 days)

Objectives:
Having completed this class the student will be able to:

  • configure partitioning, sorting and buffering optimally
  • make use of inbuilt reporting and troubleshooting tools
  • create jobs that work with vectors and subrecord types
  • implement change detection and surrogate key management
  • employ parallelism in Enterprise stage types

Contents:
The Advanced Parallel Job Techniques Class is constructed from the following thirteen modules. The class can be delivered in variable amounts of time from 1 day to 5 days. The first four modules must be included, and require one day. The remaining nine modules, if taken, must be taken in the order specified and require one half-day each. All thirteen modules can be delivered in five teaching days.

  • Review of the parallel execution framework (architecture)
  • Partitioning and collecting techniques
  • Sorting techniques
  • Buffering
  • Managing resource allocation and performance tuning
  • DB2 Enterprise stage
  • Oracle Enterprise stage
  • Teradata stages
  • Monitoring: the OSH and the Score
  • Vectors and Subrecords and handling structured and raw data
  • Change detection techniques (vectors and subrecords module must also be selected)
  • Surrogate key management in a parallel environment
  • Troubleshooting and debugging

For more information click here.

DataStage Fundamentals
Server Edition


Duration:
Four days

Objectives:
Having completed this class the student will be able to:

  • use DataStage tools to construct ETL tasks using DataStage server jobs
  • compile and execute those tasks
  • construct a hierarchy of control for those tasks

Contents:

  • Beginning DataStage - what DataStage is, how it works, what it is not
  • Metadata - classes of metadata used by ETL developers
  • Administrator client - setting/inspecting project-wide defaults
  • Developer's Toolkit - introduction to the client tools that a developer uses
  • Repository Manager - metadata creation, import, export and management
  • Developing Jobs - standard, structured technique for constructing DataStage jobs
  • Designer - editing jobs, job parameters, stages, links and their properties
  • Transformer stage - the workhorse of server jobs; lookups, stage variables, constraints, derivations
  • Hashed Files - populating and using hashed files
  • Some other active stages - Aggregator, Sort, Pivot
  • Director - execution and inspection/review tool
  • Job Sequences - using a GUI to construct control hierarchies

For more information click here.

DataStage® BASIC Programming

Duration: Four days

Objectives:
Having completed this training the student will be able to:

  • construct and validate DataStage BASIC expressions (using the Expression Editor where relevant)
  • provide parameter or argument values in job sequences
  • exert flow control decisions in job sequences
  • implement custom triggers in job sequences
  • constrain output links in BASIC Transformer stages
  • derive output column values in BASIC Transformer stages
  • initialize and derive BASIC Transformer stage variables
  • create, compile, test and troubleshoot routines that can be invoked from expressions or from job sequences
  • create before/after subroutines that can be invoked from server jobs and their active stages

Contents:

  • Introduction to the Language
  • Expression Editor
  • Expression Elements
  • Character Strings
  • Dynamic Arrays
  • Dates and Times
  • Comparison and Boolean Expressions
  • Routines: Programming Constructs
  • Transform Functions
  • Before/After Subroutines
  • Job Control Routines
  • Hashed File I/O
  • Sequential File I/O
  • Troubleshooting Techniques
  • DataStage API
  • BASIC SQL Client Interface
For more information click here.


DataStage Fundamentals
Enterprise Edition


Duration: Four days

Objectives:
Having completed this class the student will be able to:

  • use DataStage tools to construct ETL tasks using DataStage parallel jobs
  • compile and execute those tasks
  • construct a hierarchy of control for those tasks

Contents:

  • Beginning DataStage - What DataStage is, how it works, what it is not
  • Architectures - Design-time, compile-time and run-time
  • Developer's Toolkit - Introduction to the client tools that a developer uses
  • Parallelism Concepts - pipeline and partition parallelism and how each is implemented in parallel jobs
  • Configuration - how parallel execution is controlled
  • Repository Manager - metadata creation, import, export and management
  • Creating a Job - standard, structured technique for constructing a DataStage job
  • Designer - editing jobs, job parameters, stages, links and their properties
  • Data Sets and File Sets - parallel data storage; creation, use and management
  • Director - execution and inspection/review tool
  • Sequential Files - processing data in text files of various kinds
  • Combining Data - horizontal combination (lookup, join, merge, funnel) and vertical combination (aggregation)
  • Transforming Data - Modify and Transformer stage types
  • Accessing Tables - Enterprise, API and bulk loader stage types
  • Job Sequences - using a GUI to construct control hierarchies

For more information click here.