AWS Analytics Bytes: Building Transactional Data Lakes on AWS | Amazon Web Services

Welcome to the fourth episode of Analytics Bytes: A video series where we explore common Startups analytics use cases and how to architect for them. This video describes the need for transactional data lake and explores how to run successful transactional data lakes on AWS. It explains the open source frameworks that integrate with AWS that achieve the transactional behavior on the data lake.

Transactional Data Lakes provide lower ETL time resulting in the end consumer being able to access data with low latency. This enables businesses and their consumers to successfully implement near real time data analytics.

How Nerd Wallet Uses AWS and Apache Hudi to build a serverless, real-time analytics platform:

Process Apache Hudi, Delta Lake, Apache Iceberg Datasets at Scale part 1: AWS Glue Studio Notebook:

Process Apache Hudi, Delta Lake, Apache Iceberg Datasets at Scale part 2: Using AWS Glue Studio Visual Editor:

Crawl Delta Lake tables using AWS Glue crawlers:

To get hands on with Transactional Data Lakes –

Transactional & Mutable Data Lakes on AWS:

Modern Data Lake Storage Layers:

More AWS videos –
More AWS events videos –

Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers — including the fastest-growing startups, largest enterprises, and leading government agencies — are using AWS to lower costs, become more agile, and innovate faster.

#AWSStartups #datalake #ApacheHudi #ApacheIceberg #DeltaLake #AWS #AmazonWebServices #CloudComputing

Duration: 00:11:52
Publisher: Amazon Web Services
You can watch this video also at the source.