AWS Glue, the serverless ETL service of AWS, supports two types of jobs: Spark and Python shell. In this article, we'll focus on Python shell jobs and explain how you can make optimal use of your S3 Data Lake using Athena within Python shell jobs.
Modern Data Platform
Chances are you have recently heard a lot about data mesh, a decentralized approach to sharing, accessing, and managing analytical data. So, let's dive into a practical example to help you understand what a data mesh stands for.
Data lakehouses are the talk of the town when it comes to data architecture. But why is that? And why is that happening right now? Let's take a refreshing dive into the history of data warehouses, data lakes, and data lakehouses.
Before you can generate insights from your data, you need to move those data from an operational to an analytical environment - a process commonly referred to as data ingestion. An event-driven architecture provides an elegant way to achieve a process marked by timeliness, performance, and cost-effectiveness.