logo-ontapCloud Volumes ONTAP

Big data with Amazon EMR

Managing short-term and long-term analytics workloads.

Start Free Trial

CHALLENGE

Complicated sizing and increased costs

Amazon EMR provides a variety of capabilities that eliminate some of the complexities surrounding analytics workloads management. However, some considerable challenges still remain. Sizing an EMR cluster that is going to run on a dynamic data set of a growing size, dealing with the capacity limits of Amazon EBS volumes, the HDFS 3x replication factor, and added Amazon S3 charges for API calls all introduce additional complexity and costs.

SOLUTION

Decouple EMR from native storage services

With the help of the NetApp In-Place Analytics Module, EMR clusters can get access to data managed by Cloud Volumes ONTAP using NFS. With Cloud Volumes ONTAP’s NAS capabilities, you get the advantages of EMR without having to consume additional storage, replicate existing data sets, or make endless of API calls to Amazon S3, saving significant costs and operational efforts.

How it works

how-it-works

  • 1

    Create NFS volumes for big data

  • 2

    Install NetApp In-Place Analytics Module (NIPAM)

  • 3

    Mount NFS volumes to EMR clusters

Benefits
  • cost

    Cost

    Significant savings by reducing the amount of EMR clusters, eliminating the need for three HDFS copies and the substantial amount of S3 API calls.

  • performance

    Performance

    NetApp data cloning technology allows you to instantly deploy volume clones on which you can run variations of analytics while keeping your main volumes dedicated to production workloads.

  • protection

    Protection

    Robust data reliability with NetApp Snapshots, data replication, and high availability pair deployments.

Pricing

Get Block and File Storage for the price of Object Storage.

See Full Pricing

How to get started

Select cloud to get started with