AWS Summit Series 2016 | Santa Clara – Best Practices for Using Apache Spark on AWS

Learn More –

– – – – – – – – – – – – – – – – – – – – – –
Organizations need to perform increasingly complex analysis on data — streaming analytics, ad-hoc querying, and predictive analytics — in order to get better customer insights and actionable business intelligence. Apache Spark has recently emerged as the framework of choice to address many of these challenges.

In this session, we show you how to use Apache Spark on AWS to implement and scale common big data use cases such as real-time data processing, interactive data science, predictive analytics, and more. We will talk about common architectures, best practices to quickly create Spark clusters using Amazon EMR, and ways to integrate Spark with other big data services in AWS.

Learning Objectives: Learn why Spark is great for ad-hoc interactive analysis and real-time stream processing.How to deploy and tune scalable clusters running Spark on Amazon EMR. How to use EMR File System (EMRFS) with Spark to query data directly in Amazon S3. Common architectures to leverage Spark with Amazon DynamoDB, Amazon Redshift, Amazon Kinesis, and more.

Duration: 53:1
Publisher: Amazon Web Services
You can watch this video also at the source.

Inxy Hosting CDN Marketplace