ETL Testing

img

Course Information

  • Course Price $250
  • Total Students 800+
  • Course Duration 4 Weeks

Description

ETL (Extract-Transform-Load) Testing is used to check the accuracy of the loaded data after the transformation from one source to the destination. It is also for verifying the data at the multiple stages of the transmission. ETL Testing Training in Chennai is essential to learn as it helps to compare large numbers of data between miscellaneous data sources. Automating ETL Testing is now on-demand testing technology in the market considering its 100% accuracy on data validation. ETL Testing is involved in validating the data when the data migration or data integration happening in any big companies with the help of data-centric process. It helps to ensure the transferred data has been loaded into the destination from the source successfully as expected.

Benefits

  • Metadata Testing which involves the verification of the table definition
  • Data Completeness Testing used to compare and validate the counts and aggregates of data which is transferred from the source to the target (ex: sum, avg, min, max)
  • Data Quality Testing is to check the accuracy of the data with the use of data profiling concept
  • Data Transformation Testing involves in the midst of the processing with the approaches of White Box and Block Box Testing.
  • ETL Regression Testing verifies the output produced by ETL is same for the given input and validate any changes to be made.
  • Reference Data Testing checks the data values which is inside the database (ex: country_code) whether it is as per the syntax.
  • Incremental ETL Testing verifies the updates from the source system are loaded into the destination system properly. It also checks the duplication of data, dimension of data and the values of data.
  • ETL Integration Testing involves four steps which are Setup test data, Execute ETL Process, View the data in destination system and Validation of data and the application to perform End-to-End testing completion
  • ETL Performance Testing is to check the performance of all the processes which are involved in whole ETL Testing

Syllabus

DWH: Data Ware Housing Concepts
  • What is Data Warehouse?
  • Need of Data Warehouse
  • Introduction to OLTP, ETL and OLAP Systems
  • Difference between OLTP and OLAP
  • Data Warehouse Architecture
  • Data Marts
  • ODS [Operational Data Store]
  • Dimensional Modelling
  • Difference between relation and dimensional modelling
  • Star Schema and Snowflake Schema
  • What is fact table
  • What is the Dimension table
  • Normalization and De-Normalization
ETL Testing
  • ETL architecture.
  • What is ETL and importance of ETL testing
  • How DWH ETL Testing is different from the Application Testing
  • SDLC/STLC in the ETL Projects (ex: V Model, Water fall model)
Challenges in DWH ETL Testing compare to other testing
  • Incompatible and duplicate data.
  • Loss of data during ETL process.
  • Testers have no privileges to execute ETL jobs by their own.
  • Volume and complexity of data is very huge.
  • Fault in business process and procedures.
  • Trouble acquiring and building test data.
ETL Testing Work flow activities involved
  • Analyze and interpret business requirements/ workflows to Create estimations
  • Approve requirements and prepare the Test plan for the system testing
  • Prepare the test cases with the help of design documents provided by the
  • developer team
  • Execute system testing and integration testing.
  • Best practices to Create quality documentations (Test plans, Test Scripts, and Test closure summaries)
  • How to detect the bugs in the ETL testing
  • How to report the bugs in the ETL testing
  • How to co-ordinate with the developer team for resolving the defects
Types of ETL Testing
  • Data completeness.
  • Data transformation.
  • Data quality.
  • Performance and scalability.
  • Integration testing.
  • User-acceptance testing.
  • SQL Queries for ETL Testing
  • Incremental load testing
  • Initial Load / Full load testing
Different ETL tools available in the market
  • Informatica
  • Ab Initio
  • IBM Data stage
Power Center Components
  • Designer
  • Repository Manager
  • Workflow Manager
  • Workflow Monitor
  • Power Center Admin Console
Informatica Concepts and Overview
  • Informatica Architecture.
Sources
  • Working with relational Sources
  • Working with Flat Files
Targets
  • Working with Relational Targets
  • Working with Flat file Targets
Transformations Active and Passive Transformations
  • Expression
  • þÿLookup Different types of lookup Caches
  • Sequence Generator
  • Filter
  • Joi
  • Stored Procedure
Slowly Changing Dimension
  • SCD Type1
  • SCD Type2
  • Date, Flag and Version
  • SCD Type3
Workflow Manger
  • Creating Reusable tasks
  • Workflows, Worklets & Sessions
  • Tasks
  • Session
  • Decision task
  • Control Task
  • Event wait task
Timer task
  • Monitoring workflows and debugging errors
  • Indirect Loading
  • Constraint based load ordering
  • Target Load plan
  • Worklet
  • Migration ?ML migration and Folder Copy.
  • Scheduling Workflow
  • Parameter and variables
  • XML Source, Target, and Transformations
Performance Tuning
  • Pipeline Partition
  • Dynamic Partition
  • Pushdown optimization
  • Preparation of Test Cases
  • Executing Test case
  • Preparing Sample data
  • Data validation in Source and target
  • Load and performance testing
  • Unit testing Procedures.
  • Error handling procedures.
LETFIX Technologies
LETFIX Technologies