Disadvantages of AWS Glue
- Amount of Work Involved: It is not a full-fledged ETL service. Hence in order to customize the services as per our requirements, we need experienced and skillful candidates. And it involves a huge amount of work to be done as well.
- Platform Compatibility: AWS Glue is specifically made for the AWS console and its subsidiaries. And hence it isn’t compatible with other technologies.
- Limited Data Sources: It only supports limited data sources like S3 and JDBC
- High Skillset Requirement: AWS Glue is a serverless application, and it is still a new technology. Hence, the skillset required to implement and operate the AWS Glue is high.
Introduction To AWS Glue ETL
The Extract, Transform, Load(ETL) process has been designed specifically for the purpose of transferring data from its source database to the data warehouse. However, the challenges and complexities of ETL can make it hard to implement them successfully for all our enterprise data. For this reason, Amazon has introduced AWS Glue.
AWS Glue is a fully managed ETL(Extract, Transform, and Load) service that makes it simple and cost-effective to categorize our data, clean it, enrich it, and move it reliably between various data stores. It consists of a central metadata repository known as the AWS Glue data catalog an ETL engine that automatically generates Python code and a flexible scheduler that handles dependency resolution job monitoring. AWS Glue is serverless which means that there is no infrastructure to set or manage a setup.