Hadoop Distributions

Hadoop Distributions Reviews and Ratings

What are Hadoop Distributions?

Hadoop distributions are used to provide scalable, distributed computing against on-premises and cloud-based file store data. Distributions are composed of commercially packaged and supported editions of open-source Apache Hadoop-related projects. Distributions provide access to applications, query/reporting tools, machine learning and data management infrastructure components.

First introduced as collections of components for any use case, distributions are now often delivered as part of a specific solution for data lakes, machine learning or other uses. They subsequently grow into additional, expanded roles, competing with both older technologies like database management systems (DBMSs) and newer ones like Apache Spark.

How Categories and Markets Are Defined

Gartner Client Insights

Market Guide for Hadoop Distributions

Popular Product Comparisons

Amazon EMR vs Google Cloud Platform Amazon EMR vs Azure Data Lake Store Azure Data Lake Store vs Google Cloud Platform

This service may contain translations provided by Google. Google disclaims all warranties related to the translations, express or implied, including any warranties of accuracy, reliability, and any implied warranties of merchantability, fitness for a particular purpose and noninfringement. Gartner's use of this provider is for operational purposes and does not constitute an endorsement of its products or services.

Hadoop Distributions Reviews and Ratings

What are Hadoop Distributions?

Product Listings

Filter by

Gartner Client Insights

Popular Product Comparisons

Hadoop Distributions Reviews and Ratings

What are Hadoop Distributions?

Gartner Client Insights

Popular Product Comparisons

Product Listings

Filter by