ETL stands for Extract, Transform and Load. It refers to a trio of processes which are required to move the raw data from its source to a data warehouse, a business intelligence system, or a big data platform.
- Extract: This step involves accessing the data from all the Storage Systems like RDBMS, Excel files, XML files, flat files etc.
- Transform: In this step, entire data is analyzed and various functions are applied on it to transform that into the required format.
- Load: In this step, the processed data, i.e. the extracted and transformed data, is then loaded to a target data repository which usually is the database, by utilizing minimal resources.