DATASTAGE

 

 

 

FACULTY : KUMAR

 

24HOURS LAB/UNLIMITED WI-FI

DURATION : 30 Hr

 

DEMO DATE:

 

IBM Web Sphere Data stage and Quality Stage –Version 8.0

Unit-1:Data ware house Fundamentals

An Introduction to Data Wareousing-purpose of Data Warehouse-Data Warehouse Architecture-Operational Data Store-OLTP Vs Warehouse Applications-Data marts Vs Data Warehouses-Data Warehouse Life  Cycle-Metadata Management

Unit-2:Data Modeling

Introduction to Data Modeling-Entity Relationship model(E-R model)-Data Modeling for Data Warehouse-Dimensions and fact tables-Star schema amnd Snowflake schemas-Coverage Tables-Fact less Tables-What to look for in Modeling tools

Unit-3:ETL Design

Introduction to extraction, transformation and loading – Types of ETL Tools – What to look for in ETL Tools – Key Tools in the market – ETL Trends & New solution Options

 Unit-4: Data Stage Installation

DataStage Installation – Prerequisites to install Data Stage – Installation Process

Unit-5: Introduction Data Stage Version 8.0

Data Stage Introduction – IBM Information Server Architecture – Data Stage with in the IBM Information Server Architecture – Data Stage Components – Data Stage Main Functions – Client Components

Unit-6: Data Stage Administrator

Data Stage Project Administrator – Editing Project & Adding Projects – Deleting Projects, Cleaning up Project Files – global Variable Setting – Environment Management – Auto Purging – Runt Time Column Propogation (RCP) – Enable Remote Execution of Parallel Jobs – Add check Points for Sequencer – NLS Configuration – Generated OSH(Orchestra Engine) – System Formats like Date, TimeStamp – Project – Version Details                                                                                                   

Unit-7: Data Stage Director

Introduction to Data Stage Director – Validating Data Stage Jobs – Execting Data Stage Jobs – Job Execution Status – Monitoring a Job – Job Log View – Job Scheduling – Creating Batches – Scheduling Batches

Unit-8: Data Stage Designer

Itroduction to Data Stage Designer – Importance of paralellism – Pipeline Paralellism -  Partitioning & Collecting – Symetric Multi Processing(SMP) – Massively Parallel Processing(MRP) – Partition technique – Data Stage Repositiory – Palette – Passive and active Stages – Job Design Over View – Designer Work Area – Annotatations – Creating Jobs, Deleting Jobs – Parameter Passing – Compiling Jobs – Batch Compiling – Validating Jobs – Importing Flat File Definitions – Managing The Meta Data Environment – Dataset  Management – Deletion Of Data Set – Routines – Arguments Passign to Routine  Importing Jobs – Exportig Jobs (Backup)

Unti-9: Working with Parallel Job Stages

Database Stages                               Oracle - DB2 - Teradata - ODBC - Sql Server Stage      FileStages                Sequential File – CFF – Dataset – Fileset – Lookup Fileset          Processing Stages      Copy – Filter – Funnel – Sort – Remove Duplicate – Aggregator – Modify – Compress  - Expand – Decode – Encode – switch – Pivot Stage – Lookup – Join –Merge – Difference Between Lookup, Join & Merge – Change Apply – compare – Difference – Surrogate Key Generator – Transformer                    Debug Stages            Head – Tail – Peak – Column Generator – Row Generator – Write Range Map    Real Time stages        XML Input – XML Output – XML Transformer           Local & Shared Containers          Routine Creation      

Unit-10: Advanced Stages in Parallel Jobs(Version 8.0)

Range Look Process – Surrogate Key Generator stage – slowly changing dimension stage – iway stage – SFTP stage – java plugin – job performance analysis – resource estimation – slowly changing dimension implementation – local and shared containers – erformance tuning

Unit-11: Data Stage Server Jobs

Server jobs stages         Oracle – unit12 job sequencers    DB2 – Teradata – ODBC – Sql server stage – sequencial file – FTP Stage – command stage – hash file stage – inter process – link collector – link partitioner – sort – aggregator – transformer

Unit-12: Job Sequencers

Arrange job activities in sequence – triggers in sequencer – restability – recoverability – notification activity – terminator activity – wait for file activity – start look activity – execute command activity – nested condition activity – routine activity – exceptio handling activity – user variable activity – endloop activity – adding check points

Unit-13: information analyser

IBM WebSphere Information analyser overview – data profiling process – column analysiss – primary key analysis – foreign key analysis – cross – domain analysis – baseline anlysis – analysis result publication – deleting analysis – results – baseline analysis reports – cross – domain analysis reports – primary key reports – foreign key analysis reports

Unit-14: websphere quality stage

Bout data quality – datastage quality stages – investigate stage -  standardise stage match frequency stage – unduplicate match stage -  reference match stage – survive stage

Unit-15: IBM Information server Administration guide

IBM WebSphere data stage administration – opening the IBM information server web console – setting up a project in the console – customising the project dash board – setting up security – creating users in the console – assigning security roles to users and grouping schedules -  managing liescence – managing active sessios – managing logs – managing schedules – backing up & Restoring IBM Information Server