DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Greenplum vs. Hive vs. Impala

System Properties Comparison Greenplum vs. Hive vs. Impala

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameGreenplum  Xexclude from comparisonHive  Xexclude from comparisonImpala  Xexclude from comparison
DescriptionAnalytic Database platform built on PostgreSQL. Full name is Pivotal Greenplum Database infoA logical database in Greenplum is an array of individual PostgreSQL databases working together to present a single database image.data warehouse software for querying and managing large distributed datasets, built on HadoopAnalytic DBMS for Hadoop
Primary database modelRelational DBMSRelational DBMSRelational DBMS
Secondary database modelsDocument store
Spatial DBMS
Document store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score10.86
Rank#47  Overall
#29  Relational DBMS
Score70.91
Rank#17  Overall
#11  Relational DBMS
Score18.95
Rank#37  Overall
#23  Relational DBMS
Websitegreenplum.orghive.apache.orgwww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.html
Technical documentationdocs.greenplum.orgcwiki.apache.org/­confluence/­display/­Hive/­Homedocs.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.html
DeveloperPivotal Software Inc.Apache Software Foundation infoinitially developed by FacebookCloudera
Initial release200520122013
Current release6.7.1, April 20203.1.3, April 20224.1.0, June 2022
License infoCommercial or Open SourceOpen Source infoApache 2.0Open Source infoApache Version 2Open Source infoApache Version 2
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageJavaC++
Server operating systemsLinuxAll OS with a Java VMLinux
Data schemeyesyesyes
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.yes infosince Version 4.2no
Secondary indexesyesyesyes
SQL infoSupport of SQLyesSQL-like DML and DDL statementsSQL-like DML and DDL statements
APIs and other access methodsJDBC
ODBC
JDBC
ODBC
Thrift
JDBC
ODBC
Supported programming languagesC
Java
Perl
Python
R
C++
Java
PHP
Python
All languages supporting JDBC/ODBC
Server-side scripts infoStored proceduresyesyes infouser defined functions and integration of map-reduceyes infouser defined functions and integration of map-reduce
Triggersyesnono
Partitioning methods infoMethods for storing different data on different nodesShardingShardingSharding
Replication methods infoMethods for redundantly storing data on multiple nodesSource-replica replicationselectable replication factorselectable replication factor
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesyes infoquery execution via MapReduceyes infoquery execution via MapReduce
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate ConsistencyEventual ConsistencyEventual Consistency
Foreign keys infoReferential integrityyesnono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of dataACIDnono
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nono
User concepts infoAccess controlfine grained access rights according to SQL-standardAccess rights for users, groups and rolesAccess rights for users, groups and roles infobased on Apache Sentry and Kerberos

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
GreenplumHiveImpala
DB-Engines blog posts

Why is Hadoop not listed in the DB-Engines Ranking?
13 May 2013, Paul Andlinger

show all

Recent citations in the news

Greenplum 6 ventures outside the analytic box
19 March 2019, ZDNet

EMC and Greenplum Dress Elephant for IT Parade
8 December 2011, WIRED

What Is Greenplum Database? All You Need To Know
1 November 2022, scalegrid.io

VMware Greenplum® 7 Beta Documentation
16 December 2022, docs.vmware.com

Define a Location To Connect to a Greenplum Database
28 February 2023, support.fivetran.com

provided by Google News

What is Big Data : How it works, Characteristics, Benefits and more
22 March 2023, ETCIO

What Is Apache Hive? (Definition, Benefits, Challenges)
19 December 2022, Built In

Iceberg Data Services Emerge from Tabular, Dremio
1 March 2023, Datanami

What Is a Data Engineer? Salary, Responsibilities, & Roadmap
7 March 2023, Unite.AI

Sqoop Import and Export Tutorial - A Beginner’s Guide
1 March 2023, hackernoon.com

provided by Google News

Pentaho adds Amazon Redshift, Cloudera Impala to stable of data sources
17 February 2015, SiliconANGLE News

Cloudera Boosts Hadoop App Development On Impala
10 November 2014, InformationWeek

Man Busts Out of Google, Rebuilds Top-Secret Query Machine
24 October 2012, WIRED

Apache Impala 4 Supports Operator Multi-Threading
29 July 2021, iProgrammer

Impala vs Hive: Difference between Sql on Hadoop components
2 February 2023, projectpro.io

provided by Google News

Job opportunities

Greenplum Database Administrator
Morgan Stanley, Alpharetta, GA

Data Engineer-297818
Grainger, Lake Forest, IL

Quality Assurance Testers
Focus IT Services, Niskayuna, NY

Jr. Big Data Engineer - 114065
i28 technologies corporation, Wellston, OH

GCP Druid Sr. Engineer #23-00028
Abode Techzone LLC, Alamo, TX

Azure DataLake Analyst
Infinity Quest, Schaumburg, IL

Data Visualization Analyst
Apogee Integration LLC, Chantilly, VA

Data Visualization Analyst
Apogee Integration, LLC, Chantilly, VA

SCON_ETL Developer
Accenture, St. Louis, MO

Planning Engineer II
Lumen, Remote

Data Engineer (Remote)
Analytica, Washington, DC

jobs by Indeed



Share this page

Featured Products

Cassandra Forward online event

Want to level up your Cassandra game?
If you missed the event or would like to re-watch a session, replays are available now. Watch now!

AllegroGraph logo

Graph Database Leader for AI Knowledge Graph Applications - The Most Secure Graph Database Available.
Free Download

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Redis logo

The world’s most loved real‑time data platform.
Try free

Present your product here