DB-EnginesExtremeDB: the mission critical dbmsEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Cassandra vs. Impala vs. Riak KV

System Properties Comparison Cassandra vs. Impala vs. Riak KV

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameCassandra  Xexclude from comparisonImpala  Xexclude from comparisonRiak KV  Xexclude from comparison
DescriptionWide-column store based on ideas of BigTable and DynamoDB infoOptimized for write accessAnalytic DBMS for HadoopDistributed, fault tolerant key-value store
Primary database modelWide column storeRelational DBMSKey-value store infowith links between data sets and object tags for the creation of secondary indexes
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score119.11
Rank#11  Overall
#1  Wide column stores
Score17.55
Rank#40  Overall
#25  Relational DBMS
Score6.25
Rank#69  Overall
#10  Key-value stores
Websitecassandra.apache.orgwww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.htmlriak.com/­products/­riak-kv
Technical documentationcassandra.apache.org/­doc/­latestdocs.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.html
DeveloperApache Software Foundation infoApache top level project, originally developped by FacebookClouderaBasho Technologies
Initial release200820132009
Current release4.0.6, August 20224.0.0, July 20212.1.0, April 2015
License infoCommercial or Open SourceOpen Source infoApache version 2Open Source infoApache Version 2Open Source infoApache version 2, commercial enterprise edition
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
  • Astra DB: Multi-cloud DBaaS built on Apache Cassandra.
  • Aiven for Apache Cassandra: Fully managed, open source NoSQL database specifically designed to be highly available, performant, and scalable.
Implementation languageJavaC++Erlang
Server operating systemsBSD
Linux
OS X
Windows
LinuxLinux
OS X
Data schemeschema-freeyesschema-free
Typing infopredefined data types such as float or dateyesyesno
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesrestricted infoonly equality queries, not always the best performing solutionyesrestricted
SQL infoSupport of SQLSQL-like SELECT, DML and DDL statements (CQL)SQL-like DML and DDL statementsno
APIs and other access methodsProprietary protocol infoCQL (Cassandra Query Language, an SQL-like language)
Thrift
JDBC
ODBC
HTTP API
Native Erlang Interface
Supported programming languagesC#
C++
Clojure
Erlang
Go
Haskell
Java
JavaScript infoNode.js
Perl
PHP
Python
Ruby
Scala
All languages supporting JDBC/ODBCC infounofficial client library
C#
C++ infounofficial client library
Clojure infounofficial client library
Dart infounofficial client library
Erlang
Go infounofficial client library
Groovy infounofficial client library
Haskell infounofficial client library
Java
JavaScript infounofficial client library
Lisp infounofficial client library
Perl infounofficial client library
PHP
Python
Ruby
Scala infounofficial client library
Smalltalk infounofficial client library
Server-side scripts infoStored proceduresnoyes infouser defined functions and integration of map-reduceJavaScript and Erlang
Triggersyesnoyes infopre-commit hooks and post-commit hooks
Partitioning methods infoMethods for storing different data on different nodesSharding infono "single point of failure"ShardingSharding infono "single point of failure"
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factor infoRepresentation of geographical distribution of servers is possibleselectable replication factorselectable replication factor
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesyes infoquery execution via MapReduceyes
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency
Immediate Consistency infocan be individually decided for each write operation
Eventual ConsistencyEventual Consistency
Foreign keys infoReferential integritynonono infolinks between data sets can be stored
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datano infoAtomicity and isolation are supported for single operationsnono
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nono
User concepts infoAccess controlAccess rights for users can be defined per objectAccess rights for users, groups and roles infobased on Apache Sentry and Kerberosno
More information provided by the system vendor
CassandraImpalaRiak KV
Specific characteristicsApache Cassandra is the leading NoSQL, distributed database management system, well...
» more
Competitive advantagesNo single point of failure ensures 100% availability . Operational simplicity for...
» more
Typical application scenariosInternet of Things (IOT), fraud detection applications, recommendation engines, product...
» more
Key customersApple, Netflix, Uber, ING,, Intuit,Fidelity, NY Times, Outbrain, BazaarVoice, Best...
» more
Market metricsCassandra is used by 40% of the Fortune 100.
» more
Licensing and pricing modelsApache license  Pricing for commercial distributions provided by DataStax and available...
» more
News

Blockchain (Data) without the Pain
26 September 2022

Hacking your Emotions & Communicating in the Time of High Stakes with Jennifer Edwards
22 September 2022

Building Invincible Apps with Temporal & Astra DB
20 September 2022

A New Product Documentation Experience
14 September 2022

Knowing When to Take a Risk and When to Opt Out with Kelly Battles
7 September 2022

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesAiven for Apache Cassandra: A truly distributed database that can handle large volumes of writes.
» more

CData: Connect to Big Data & NoSQL through standard Drivers.
» more

Instaclustr: Hosted & Managed Apache Cassandra as a Service
» more

DataStax Cassandra: Zero touch operations, configuration and management with true autoscaling
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
CassandraImpalaRiak KV
DB-Engines blog posts

Cassandra keeps climbing the ranks of the DB-Engines Ranking
3 May 2016, Matthias Gelbmann

Oracle is the DBMS of the Year
5 January 2016, Paul Andlinger, Matthias Gelbmann

Winners, losers and an attractive newcomer in Novembers DB-Engines ranking
2 November 2015, Paul Andlinger

show all

Conferences and events

Cassandra Day London
London, UK, 11 October 2022

Recent citations in the news

Why Choose a NoSQL Database? There Are Many Great Reasons
23 September 2022, thenewstack.io

4 Common Questions We Hear about Apache Cassandra
14 September 2022, thenewstack.io

Major Database Security Threats & How You Can Prevent Them
26 September 2022, tripwire.com

Relational to NoSQL at Enterprise Scale: Lessons from Amazon
9 September 2022, thenewstack.io

DataStax Continues Strong Momentum in Q2 FY23
30 August 2022, Datanami

provided by Google News

Apache Impala 4 Supports Operator Multi-Threading
29 July 2021, iProgrammer

Pentaho adds Amazon Redshift, Cloudera Impala to stable of data sources
17 February 2015, SiliconANGLE News

Man Busts Out of Google, Rebuilds Top-Secret Query Machine
24 October 2012, WIRED

Cloudera's Analytic Database Enables Unrivaled Elastic
22 September 2016, GlobeNewswire

Unravel Data Adds Native Support for Impala and Kafka
29 June 2017, insideBIGDATA

provided by Google News

Is Riak A Good NoSQL Database Option?
1 July 2019, Analytics India Magazine

Basho, Maker of Riak NoSQL Database, Raises $25M
13 January 2015, Data Center Knowledge

Riak NoSQL Database: Use Cases and Best Practices
23 December 2011, InfoQ.com

Database Variants and Which One is Right for Your Enterprise
19 August 2022, Database Trends and Applications

Riak TS for time series analysis at scale
29 September 2016, Opensource.com

provided by Google News

Job opportunities

Database Administrator (Cassandra)
EasyPost, Remote

Database/Devops Engineer - Cassandra+ DB techs+ No SQL
Urpan Technologies, Austin, TX

Database Engineer - Cassandra
Rockstar Games New York & New England, Andover, MA

Senior Database Administrator (Cassandra)
EasyPost, Remote

Database Administrator
Verinovum, Inc., Remote

Planning Engineer II
Lumen, Remote

Sr. ETL Test Lead
Bank of America, Addison, TX

Business Intelligence Developer - Treasury and Liquidity
Barclays, Whippany, NJ

Software Engineer Specialist
FIS Global, Cincinnati, OH

Sr Planning Engineer
Lumen, Remote

Data Engineer II
GM Financial, Remote

Software Engineer II
Spireon, Remote

Software Engineer II
Spireon, Irving, TX

Software Developer
Soothsayer Analytics, Lansing, MI

Senior Data Engineer
GM Financial, Arlington, TX

jobs by Indeed



Share this page

Featured Products

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

AllegroGraph logo

Graph Database Leader for AI Knowledge Graph Applications - The Most Secure Graph Database Available.
Free Download

The definitive guide for Cassandra

Imagine What You Could Do if Scalability Wasn‘t a Problem!
Download the Cassandra e-book for free!

Neo4j logo

Neo4j NODES 2022
Free online conference for developers and data scientists.
November 16/17 2022.
Register now for free!

Redis logo

The world’s most loved real‑time data platform.
Try free

Present your product here