• HOME
  • CATEGORIES

    • CATEGORIES

    • Browse All Categories
  • FOR VENDORS

    • FOR VENDORS

    • Log In to Vendor Portal
    • Get Started
  • REVIEWS

    • REVIEWS

    • Write a Review
    • Product Reviews
    • Vendor Directory
    • Product Comparisons
  • GARTNER PEER COMMUNITY™
  • GARTNER.COM
  • Community GuidelinesListing GuidelinesBrowse VendorsRules of EngagementFAQPrivacyTerms of Service
    ©2026 Gartner, Inc. and/or its affiliates.
    All rights reserved.
  • Categories

    • Loading categories...

      Browse All Categories

      Loading markets...

  • For Vendors

    • Log In to Vendor Portal 

    • Get Started 

  • Write a Review

Join / Sign In
  1. Home
  2. /
  3. Amazon EMR
Logo of Amazon EMR

Amazon EMR

byAmazon Web Services (AWS)
in Hadoop Distributions
4.4

Overview

Product Information on Amazon EMR

Updated 13th October 2025

What is Amazon EMR?

Amazon EMR is a software for processing and analyzing large datasets using open-source tools such as Apache Spark, Apache Hadoop, and Presto. The software enables users to run distributed data processing workloads on scalable cloud infrastructure, automating provisioning and configuration of cluster resources. It supports a range of data analytics tasks including batch processing, machine learning workflows, and interactive SQL queries. Amazon EMR software is designed to address challenges related to managing big data environments, helping organizations reduce operational overhead and optimize resource usage for analytics and business intelligence initiatives.

Amazon EMR Pricing

Amazon EMR is a software that uses a pay-as-you-go pricing model based on the compute and storage resources consumed by clusters. Charges are determined by factors such as instance type, instance count, region, and additional features. There are no upfront costs or long-term commitments, and pricing details vary depending on configuration and usage duration.

Overall experience with Amazon EMR

Senior Director Of Technology
3B - 10B USD, IT Services
FAVORABLE

“ Amazon EMR’s Impact on Cost Savings and Reporting Service Performance Improvements”

5.0
Jul 17, 2025
My organization has utilized Amazon EMR for about 45 days, and the overall experience has been great. We chose EMR due to our existing Amazon Heavy infrastructure, evaluating AWS solutions over external vendors. EMR provided more flexibility in cost management than auto-scaling groups, especially with its dynamic handling of node counts and mixing on-demand and spot instances. EMR addressed key business pain points: Firstly, it delivered significant cost savings. We reduced daily costs by 40-45% for a service previously spending on 100% on-demand instances, by implementing an 80/20 distribution of spot to on-demand nodes. This projects potential savings per month if all our Hadoop loads transition to EMR. Other teams have also started using EMR for heavy Hadoop loads due to its cost optimization. Secondly, EMR vastly improved our heavy reporting service for customers, which previously suffered from report queuing and throttling during peak morning hours. Our report response time improved dramatically, from approximately 30 mins to 2 mins. The system spins up additional nodes when reports are triggered and shuts them down during lean periods, preventing cost incurrence. This directly led to the retention of a significant FMCG giant customer in the US, a contract that could have resulted in over a 5% loss of our overall revenue, with minimal deployment effort. From an onboarding perspective, my prior experience with EMR made the process straightforward. AWS offers extensive use cases, design diagrams, and paid support typically responds within 2-24 hours. Customization was not a major concern, as EMR’s features and documentation meet most industry needs. EMR efficiently handles large-scale data processing, with configurable node limits and built-in fault tolerance. A caution regarding performance: for very large node during peak hours, it's advised to maintain at least a 50/50 split between spot and on-demand instances to avoid availability issues.
Data Analyst
250M - 500M USD, Retail
CRITICAL

“Awesome product but needs some changes”

3.0
Dec 2, 2025
good, but can be better with more features and controls for engineers

About Company

Company Description

Updated 6th March 2025

Amazon Web Services (AWS), established in 2006, is focused on providing essential infrastructure services to businesses globally in the form of cloud computing. The key advantage offered through cloud computing, particularly via AWS, is its capacity to shift fixed infrastructure expenses into flexible costs. Businesses have been able to forgo extensive planning and procurement of servers and other Information Technology (IT) resources, owing to AWS. AWS seeks to provide businesses with prompt and cost-effective access to resources using Amazon's expertise and economies of scale, as and when their business requires. Currently, AWS offers a robust, scalable, economic infrastructure platform on the cloud powering an extensive array of businesses worldwide. It operates across numerous industries with data center locations in various parts of the globe including U.S., Europe, Singapore, and Japan.

Company Details

Updated 23rd December 2024
Company type
Public
Year Founded
2006
Head office location
Seattle, United States
Number of employees
10001+
Website
http://aws.amazon.com

Do You Manage Peer Insights at Amazon Web Services (AWS)?

Access Vendor Portal to update and manage your profile.

Key Insights

A Snapshot of What Matters - Based on Validated User Reviews

Reviewer Insights for: Amazon EMR
Performance of Amazon EMR Across Market Features

Amazon EMR Likes & Dislikes

Like

The most significant advantage of Amazon EMR is its exceptional cost management capabilities. EMR's unique architecture with core nodes (24/7) and transient task nodes that automatically turn on and off is unparalleled by other providers, who typically only offer all-on-demand or all-spot auto-scaling groups. Another highly valued feature is the flexibility in choosing instance families. For example, within the R5 family, we can select from R5, R5A, R5G (graviton, which saves cost), or R5D (memory-optimized). The ability to use multiple node types within a family and assign weightage (e.g., prioritizing cost-saving Graviton instances) provides unmatched flexibility tailored to our specific use case. This also extends to time-dependent scaling, allowing us to allocate more on-demand instances during peak US hours and shift to more spot instances during India's non-peak hours, resulting in significant daily cost savings. Furthermore, EMR's seamless integration with other AWS services, such as Amazon S3, is a major benefit. Our underlying data layer is S3, and EMR provides an intuitive, built-in integration that largely eliminates the need for custom code, relying instead on simple dropdown configurations. It also integrates easily with open-source coding platforms we utilize. Finally, EMR prioritizes customer experience and ensures operational resilience. If spot machines are unavailable during peak times, EMR automatically waits for a configured period (e.g., 5 minutes) and then provisions on-demand instances to prevent customer impact. While this may temporarily increase cost, nodes automatically shut down when the load subsides. EMR also boasts built-in fault tolerance, automatically spinning up a new node if one goes down, ensuring continuous availability and preventing downtime.

Like

hands on, configurability

Like

Great high compute platform

Dislike

My primary concern with Amazon EMR is a significant limitation in its configuration flexibility: the inability to change the instance family of a cluster once it has been created from the UI. If an incorrect or higher-cost instance family is selected during initial cluster setup, there is no direct way to modify it. To rectify such an error, one must terminate the existing cluster, create a clone, make the necessary changes, and then redeploy. This process inevitably leads to downtime or necessitates incurring additional cost by running a parallel cluster until the new one is operational. This is a major pain point, as I've observed similar feedback online from other customers. While EMR allows for editing parameters like the number of spot or on-demand nodes, the fundamental instance family cannot be altered post-implementation. Most other AWS features offer an edit option, and its absence for EMR cluster instance families is a notable drawback that, if resolved, would significantly enhance the product's usability and flexibility for many users.

Dislike

finding failure is hard

Dislike

not much i can think off.

Top Amazon EMR Alternatives

Logo of Amazon Athena
1. Amazon Athena
4.4
(90 Ratings)
Logo of HPE Data Fabric Software
2. HPE Data Fabric Software
4.3
(37 Ratings)
Logo of Azure Data Lake Store
3. Azure Data Lake Store
4.3
(26 Ratings)
View All Alternatives

Peer Discussions

Amazon EMR Reviews and Ratings

4.4

(60 Ratings)

Rating Distribution

5 Star
42%
4 Star
52%
3 Star
7%
2 Star
0%
1 Star
0%
Why ratings and reviews count differ?

Customer Experience

Evaluation & Contracting

4.3

Integration & Deployment

4.4

Service & Support

4.3

Product Capabilities

4.4

Filter Reviews
Sort By:
Most helpful
Last 12 Months
Star Rating
Reviewer Type
Reviewer's Company Size
Reviewer's Industry
Reviewer's Region
Reviewer's Job Function
  • Senior Director Of Technology
    1B-10B USD
    IT Services
    Review Source

    Amazon EMR’s Impact on Cost Savings and Reporting Service Performance Improvements

    5.0
    Jul 17, 2025
    My organization has utilized Amazon EMR for about 45 days, and the overall experience has been great. We chose EMR due to our existing Amazon Heavy infrastructure, evaluating AWS solutions over external vendors. EMR provided more flexibility in cost management than auto-scaling groups, especially with its dynamic handling of node counts and mixing on-demand and spot instances. EMR addressed key business pain points: Firstly, it delivered significant cost savings. We reduced daily costs by 40-45% for a service previously spending on 100% on-demand instances, by implementing an 80/20 distribution of spot to on-demand nodes. This projects potential savings per month if all our Hadoop loads transition to EMR. Other teams have also started using EMR for heavy Hadoop loads due to its cost optimization. Secondly, EMR vastly improved our heavy reporting service for customers, which previously suffered from report queuing and throttling during peak morning hours. Our report response time improved dramatically, from approximately 30 mins to 2 mins. The system spins up additional nodes when reports are triggered and shuts them down during lean periods, preventing cost incurrence. This directly led to the retention of a significant FMCG giant customer in the US, a contract that could have resulted in over a 5% loss of our overall revenue, with minimal deployment effort. From an onboarding perspective, my prior experience with EMR made the process straightforward. AWS offers extensive use cases, design diagrams, and paid support typically responds within 2-24 hours. Customization was not a major concern, as EMR’s features and documentation meet most industry needs. EMR efficiently handles large-scale data processing, with configurable node limits and built-in fault tolerance. A caution regarding performance: for very large node during peak hours, it's advised to maintain at least a 50/50 split between spot and on-demand instances to avoid availability issues.
  • Director of Engineering
    50M-1B USD
    Banking
    Review Source

    EMR Delivers Strong Compute Capacity for Pyspark-Based Data Processing Tasks

    5.0
    Dec 1, 2025
    EMR provides high compute capacity for our data processing needs. We have several models being executed on EMR using Pyspark
  • VP, Data and Analytics
    Gov't/PS/Ed
    Government
    Review Source

    Platform Utilizes Map Reduce Components and Supports Spot CPU Usage

    4.0
    Dec 1, 2025
    the platform is consist of standard of map reduce components and can use spot CPU
  • Engineer
    <50M USD
    Banking
    Review Source

    Comprehensive Spark Configurations Available Through Flexible API Calls in This Product

    5.0
    Nov 30, 2025
    very complete, i can work better spark configs with api calls than in other services
  • Data Analyst
    50M-1B USD
    Retail
    Review Source

    Awesome product but needs some changes

    3.0
    Dec 2, 2025
    good, but can be better with more features and controls for engineers
...
Showing Result 1-5 of 67

Recommended Gartner Research

  • Market Guide for Hadoop Distributions

Gartner Peer Insights content consists of the opinions of individual end users based on their own experiences, and should not be construed as statements of fact, nor do they represent the views of Gartner or its affiliates. Gartner does not endorse any vendor, product or service depicted in this content nor makes any warranties, expressed or implied, with respect to this content, about its accuracy or completeness, including any warranties of merchantability or fitness for a particular purpose.

This site is protected by hCaptcha and its Privacy Policy and Terms of Use apply.


Software reviews and ratings for EMMS, BI, CRM, MDM, analytics, security and other platforms - Peer Insights by Gartner
Community GuidelinesListing GuidelinesBrowse VendorsRules of EngagementFAQsPrivacyTerms of Use

©2026 Gartner, Inc. and/or its affiliates.

All rights reserved.