ETL (Ab Initio)
Course number: CGIETL40
The Ab Initio software is a Business Intelligence platform containing six data processing products: Co>Operating System, The Component Library, Graphical Development Environment, Enterprise Meta>Environment, Data Profiler and Conduct. It is a powerful graphical user interface-based parallel processing tool for ETL data management and analysis.
Prerequisites
- An Associate’s Degree and two years of experience in computer programming
Target Audience
The primary audiences for this course are: ETL Admins, and ETL and BI Developers.
Certification
COMNet Group Certification
Exam
COMNet Group Exam
Accreditation
COMNet Group Accreditation
Course Outline
- Ab Initio Product Architecture
- Client server Communication
- Ab Initio – Process Flow
- Co>operating System
- Graphical Development Environment
- The Graph Model
- Parts of Typical Graph
- Layout and Phase of Graph
- Anatomy of a Running Job
- Data Manipulation Language
- Built-in Functions
- Lookup File
- Transform Functions
- Reformat
- Filter by Expression
- Redefine Format
- Replicate
- Aggregate
- Sort
- Join
- Dedup sorted
- Multi-files
- Parallelism Concepts
- Component Parallelism
- Pipeline Parallelism
- Data Parallelism
- Partitioning Components
- Broadcast
- Partition by Key
- Partition by Expression
- Partition by Round-Robin
- Partition by Range
- Partition by Percentage
- Partition by Load Balance
- Gather
- Concatenate
- Interleave
- Merge
- Environment Set up
- Sandbox & graph parameters
- Private and Public Sandbox
- Parameters sourcing technique
- Input Table
- Output Table
- Update Table
- Truncate Table
- Run SQL
- DBC File
- Normalize
- De-normalize Sorted
- Meta Pivot
- Multi Reformat
- Inflate
- Deflate
- Assign Keys
- Scan
- Sort within Group
- Match Sorted
- Rollup
- Conditional Components
- Vectors
- Custom Layouts
- Control Statements (if, for, while, etc.)
- Performance Tuning Tips
- Check-in
- Check-out
- Tagging
- MFS Commands
- Debugging
- Features of EME
- Sandbox and projects
- Checking out graphs, files and projects
- Checking in projects, sandboxes, graphs and files
- Dependency analysis
- Using web to access EME datastore
- Reports, Versioning and Tagging
- Air commands