DB-EnginesextremeDB - Data management wherever you need itEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by Redgate Software

DBMS > Apache Cassandra vs. Apache Impala vs. Apache Kylin vs. HarperDB

System Properties Comparison Apache Cassandra vs. Apache Impala vs. Apache Kylin vs. HarperDB

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Cassandra  Xexclude from comparisonApache Impala  Xexclude from comparisonApache Kylin  Xexclude from comparisonHarperDB  Xexclude from comparison
DescriptionWide-column store based on ideas of BigTable and DynamoDB infoOptimized for write accessAnalytic DBMS for HadoopA distributed analytics engine for big data, providing a SQL interface and multi-dimensional analysis (OLAP) and leveraging the Hadoop stackHarperDB takes the "stack" out of "tech stack" by combining an ultra-fast document-style data store, in-memory cache, real-time message broker, and your application components into a single distributed technology.
Primary database modelWide column storeRelational DBMSRelational DBMSDocument store
Secondary database modelsVector DBMS infostarting with release V5Document store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score98.94
Rank#11  Overall
#1  Wide column stores
Score10.63
Rank#40  Overall
#24  Relational DBMS
Score1.14
Rank#171  Overall
#77  Relational DBMS
Score0.55
Rank#243  Overall
#38  Document stores
Websitecassandra.apache.orgimpala.apache.orgkylin.apache.orgwww.harperdb.io
Technical documentationcassandra.apache.org/­doc/­latestimpala.apache.org/­impala-docs.htmlkylin.apache.org/­docsdocs.harperdb.io/­docs
DeveloperApache Software Foundation infoApache top level project, originally developped by FacebookApache Software Foundation infoApache top-level project, originally developed by ClouderaApache Software Foundation, originally contributed from eBay IncHarperDB
Initial release2008201320152017
Current release5.0-rc1, July 20244.1.0, June 20223.1.0, July 20203.1, August 2021
License infoCommercial or Open SourceOpen Source infoApache version 2Open Source infoApache Version 2Open Source infoApache Version 2.0commercial infofree community edition available
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Astra DB: Multi-cloud DBaaS built on Apache Cassandra.
Implementation languageJavaC++JavaNode.js
Server operating systemsBSD
Linux
OS X
Windows
LinuxLinuxLinux
OS X
Data schemeschema-freeyesyesdynamic schema
Typing infopredefined data types such as float or dateyesyesyesyes infoJSON data types
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nononono
Secondary indexesrestricted infoonly equality queries, not always the best performing solutionyesyesyes
SQL infoSupport of SQLSQL-like SELECT, DML and DDL statements (CQL)SQL-like DML and DDL statementsANSI SQL for queries (using Apache Calcite)SQL-like data manipulation statements
APIs and other access methodsProprietary protocol infoCQL (Cassandra Query Language, an SQL-like language)
Thrift
JDBC
ODBC
JDBC
ODBC
RESTful HTTP API
JDBC
ODBC
React Hooks
RESTful HTTP/JSON API
WebSocket
Supported programming languagesC#
C++
Clojure
Erlang
Go
Haskell
Java
JavaScript infoNode.js
Perl
PHP
Python
Ruby
Scala
All languages supporting JDBC/ODBC.Net
C
C#
C++
ColdFusion
D
Dart
Delphi
Erlang
Go
Haskell
Java
JavaScript (Node.js)
Lisp
MatLab
Objective C
Perl
PHP
PowerShell
Prolog
Python
R
Ruby
Rust
Scala
Swift
Server-side scripts infoStored proceduresnoyes infouser defined functions and integration of map-reduceCustom Functions infosince release 3.1
Triggersyesnono
Partitioning methods infoMethods for storing different data on different nodesSharding infono "single point of failure"ShardingA table resides as a whole on one (or more) nodes in a cluster
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factor infoRepresentation of geographical distribution of servers is possibleselectable replication factoryes infothe nodes on which a table resides can be defined
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesyes infoquery execution via MapReduceyesno
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency
Immediate Consistency infocan be individually decided for each write operation
Eventual ConsistencyImmediate ConsistencyImmediate Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datano infoAtomicity and isolation are supported for single operationsnoAtomic execution of specific operations
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes, using LMDB
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nononoyes
User concepts infoAccess controlAccess rights for users can be defined per objectAccess rights for users, groups and roles infobased on Apache Sentry and KerberosAccess rights for users and roles
More information provided by the system vendor
Apache CassandraApache ImpalaApache KylinHarperDB
Specific characteristicsApache Cassandra is the leading NoSQL, distributed database management system, well...
» more
Competitive advantagesNo single point of failure ensures 100% availability . Operational simplicity for...
» more
Typical application scenariosInternet of Things (IOT), fraud detection applications, recommendation engines, product...
» more
Key customersApple, Netflix, Uber, ING,, Intuit,Fidelity, NY Times, Outbrain, BazaarVoice, Best...
» more
Market metricsApache Cassandra is used by 40% of the Fortune 100.
» more
Licensing and pricing modelsApache license  Pricing for commercial distributions provided by DataStax and available...
» more
News

How to Create Vector Embeddings in Node.js
25 September 2024

Glean and DataStax Partner to Help Developers Harness Powerful Enterprise Search
23 September 2024

DataStax Named a Leader in The Forrester Wave™: Vector Databases, Q3 2024
23 September 2024

How to Build a Notion Clone with Astra DB and Mongoose
20 September 2024

How to Monitor DataStax-Powered RAG Applications with Fiddler
19 September 2024

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesCData: Connect to Big Data & NoSQL through standard Drivers.
» more

Cassandra Forward event: Want to level up your Cassandra game? Watch now replays of the March 2023 sessions.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache CassandraApache ImpalaApache KylinHarperDB
Recent citations in the news

New Apache Cassandra 5.0 gives open source NoSQL database a scalability and performance boost
6 September 2024, VentureBeat

Apache Cassandra introduces version 5.0 of open-source NoSQL database
9 September 2024, Techzine Europe

Migrate Cassandra workloads to Amazon Keyspaces using CQLReplicator
4 September 2024, AWS Blog

Cassandra redesigns indexing, storage management for 5.0 release
10 September 2024, The Register

How Uber ensures Apache Cassandra®’s tolerance for single-zone failure
20 June 2024, Uber

provided by Google News

Apache Impala 4 Supports Operator Multi-Threading
29 July 2021, iProgrammer

Apache Impala becomes Top-Level Project
28 November 2017, SDTimes.com

Cloudera Bringing Impala to AWS Cloud
28 November 2017, Datanami

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop
12 March 2017, Uber

Updates & Upserts in Hadoop Ecosystem with Apache Kudu
27 October 2017, KDnuggets

provided by Google News

Overhauling Apache Kylin for the cloud
18 November 2021, InfoWorld

Migrating from ClickHouse to Apache Doris: Boosting OLAP Performance
9 October 2023, hackernoon.com

eBay’s Kylin Becomes a Top-Level Apache Open Source Project
9 December 2015, eBay Inc.

Distributed OLAPer Kyligence accelerates core engine, adds real-time data support
10 August 2021, Blocks & Files

How Kyligence Cloud uses Amazon EMR Serverless to simplify OLAP
9 November 2022, AWS Blog

provided by Google News

HarperDB Attracts Significant Growth Investment from Serent Capital
12 June 2024, Business Wire

Defense Innovation Unit Issues Success Memo to HarperDB for its Common Operation Database Prototype
11 September 2024, PR Newswire

DoD Successfully Deploys Commercial AI Solutions for COD Project
10 September 2024, Defense Innovation Unit

Serent Capital invests in software firm HarperDB
12 June 2024, PE Hub

HarperDB Receives Growth Investment from Serent Capital
12 June 2024, FinSMEs

provided by Google News



Share this page

Featured Products

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

SingleStore logo

The data platform to build your intelligent applications.
Try it free.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here