Welcome to the fourth episode of Analytics Bytes: A video series where we explore common Startups analytics use cases and how to architect for them. This video describes the need for transactional data lake and explores how to run successful transactional data lakes on AWS. It explains the open source frameworks that integrate with AWS that achieve the transactional behavior on the data lake.
Transactional Data Lakes provide lower ETL time resulting in the end consumer being able to access data with low latency. This enables businesses and their consumers to successfully implement near real time data analytics.
How Nerd Wallet Uses AWS and Apache Hudi to build a serverless, real-time analytics platform: https://bit.ly/3SkpNC8
Process Apache Hudi, Delta Lake, Apache Iceberg Datasets at Scale part 1: AWS Glue Studio Notebook: https://bit.ly/3RnzBdb
Process Apache Hudi, Delta Lake, Apache Iceberg Datasets at Scale part 2: Using AWS Glue Studio Visual Editor: https://bit.ly/3SmFf0u
Crawl Delta Lake tables using AWS Glue crawlers: https://bit.ly/3DWuldw
To get hands on with Transactional Data Lakes –
Transactional & Mutable Data Lakes on AWS: https://bit.ly/3fszaAS
Modern Data Lake Storage Layers: https://bit.ly/3fl9kyJ
More AWS videos – http://bit.ly/2O3zS75
More AWS events videos – http://bit.ly/316g9t4
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers — including the fastest-growing startups, largest enterprises, and leading government agencies — are using AWS to lower costs, become more agile, and innovate faster.
#AWSStartups #datalake #ApacheHudi #ApacheIceberg #DeltaLake #AWS #AmazonWebServices #CloudComputing
Publisher: Amazon Web Services
You can watch this video also at the source.