ETL is short for 'Extract, Transform, and Load' – and it's the overall process businesses use to combine disparate data sources for analysis.
The ETL process consists of the following steps:
Data gathered from external sources
Data changed to fit required standard
Converted data inserted into DW
- Extract data from sources
- Load into stages tables
- Audit reports
- Clean up
Sometimes those steps are supervised and performed indirectly, which is very time-consuming and might prove to be inaccurate. ETL tools make the entire process more reliable.
The purpose of this website is to provide detailed information about both commercial and open source ETL vendors. Some tools are relatively expensive, or some might be be too complex, if you don't want to transform a large amount of information, are not interested in using many sources, or do not need sophisticated features.
Begin with defining the business requirements, then consider the technical aspects, and finally, choose the right ETL tool. Refer to the ETL wiki to help you become more informed when choosing a technology or provider.
Providers of ETL Tools
The following is a list of the most popular commercial and open-source ETL tools.
Comercial ETL tools:
- IBM Infosphere DataStage
- Informatica PowerCenter
- Oracle Warehouse Builder (OWB)
- Oracle Data Integrator (ODI)
- SAS ETL Studio
- Business Objects Data Integrator (BODI)
- Microsoft SQL Server Integration Services (SSIS)
- Ab Initio
Freeware, open source ETL tools:
- Pentaho Data Integration (Kettle)
- Talend Integrator Suite
- Jasper ETL