Abstract
Extract, Transform and Load (ETL) is a fundamental process used to populate a data warehouse. It involves extracting data from various sources, transforming the data according to business requirements and loading them into target data structures. Inside the transform phase a well-designed ETL system should also enforce data quality, data consistency and conforms data so that data from various source systems can be integrated. Once the data is loaded into target systems in a presentation-ready format, the end users can run queries against them to generate reports which help them make better business decisions. Even though ETL process consumes roughly 70% of computing resources they are hardly visible to the end users [5]. The objective of this project is to create a website which contains courseware on ETL process and a web based ETL tool. The website, containing the ETL courseware, can be accessed by anyone with internet access. This will be helpful to a wide range of audience from beginners to experienced users. The website is created using technologies HTML, PHP, Korn shell scripts and MySQL. The ETL tool is web based and anyone with internet access can use this tool for free. However guests have limited access to this tool and registered users have complete access. Using this tool, data can be extracted from text files and MySQL tables, combined together and loaded into MySQL tables. Before loading the data into target MySQL tables, various transformations according to business requirements, can be applied to them. This tool is developed using HTML, PHP, SQL and Korn shell scripts.