Overview
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.
Key Features
- Serverless ETL: No infrastructure to provision or manage
- Data Catalog: Centralized metadata repository for all data assets
- Crawlers: Automatic schema discovery and cataloging
- Spark-Based: Built on Apache Spark for distributed processing