DB-EnginesextremeDB - solve IoT connectivity disruptionsEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by Redgate Software

DBMS > Apache HBase vs. Apache Spark (SQL) vs. ClickHouse

System Properties Comparison Apache HBase vs. Apache Spark (SQL) vs. ClickHouse

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache HBase  Xexclude from comparisonApache Spark (SQL)  Xexclude from comparisonClickHouse  Xexclude from comparison
DescriptionWide-column store based on Apache Hadoop and on concepts of BigTableApache Spark SQL is a component on top of 'Spark Core' for structured data processingA high-performance, column-oriented SQL DBMS for online analytical processing (OLAP) that uses all available system resources to their full potential to process each analytical query as fast as possible. It is available as both an open-source software and a cloud offering.
Primary database modelWide column storeRelational DBMSRelational DBMS
Secondary database modelsTime Series DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score22.62
Rank#27  Overall
#3  Wide column stores
Score21.62
Rank#29  Overall
#18  Relational DBMS
Score18.77
Rank#31  Overall
#19  Relational DBMS
Websitehbase.apache.orgspark.apache.org/­sqlclickhouse.com
Technical documentationhbase.apache.org/­book.htmlspark.apache.org/­docs/­latest/­sql-programming-guide.htmlclickhouse.com/­docs
DeveloperApache Software Foundation infoApache top-level project, originally developed by PowersetApache Software FoundationClickhouse Inc.
Initial release200820142016
Current release2.3.4, January 20213.5.0 ( 2.13), September 2023v24.6.2.17-stable, July 2024
License infoCommercial or Open SourceOpen Source infoApache version 2Open Source infoApache 2.0Open Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageJavaScalaC++
Server operating systemsLinux
Unix
Windows infousing Cygwin
Linux
OS X
Windows
FreeBSD
Linux
macOS
Data schemeschema-free, schema definition possibleyesyes
Typing infopredefined data types such as float or dateoptions to bring your own types, AVROyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesnonoyes
SQL infoSupport of SQLnoSQL-like DML and DDL statementsClose to ANSI SQL (SQL/JSON + extensions)
APIs and other access methodsJava API
RESTful HTTP API
Thrift
JDBC
ODBC
gRPC
HTTP REST
JDBC
MySQL wire protocol
ODBC
PostgreSQL wire protocol
Proprietary protocol
Supported programming languagesC
C#
C++
Groovy
Java
PHP
Python
Scala
Java
Python
R
Scala
C# info3rd party library
C++
Elixir info3rd party library
Go info3rd party library
Java info3rd party library
JavaScript (Node.js) info3rd party library
Kotlin info3rd party library
Nim info3rd party library
Perl info3rd party library
PHP info3rd party library
Python info3rd party library
R info3rd party library
Ruby info3rd party library
Rust
Scala info3rd party library
Server-side scripts infoStored proceduresyes infoCoprocessors in Javanoyes
Triggersyesnono
Partitioning methods infoMethods for storing different data on different nodesShardingyes, utilizing Spark Corekey based and custom
Replication methods infoMethods for redundantly storing data on multiple nodesMulti-source replication
Source-replica replication
noneAsynchronous and synchronous physical replication; geographically distributed replicas; support for object storages.
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesno
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate Consistency or Eventual ConsistencyImmediate Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of dataSingle row ACID (across millions of columns)nono
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.yesnoyes
User concepts infoAccess controlAccess Control Lists (ACL) for RBAC, integration with Apache Ranger for RBAC & ABACnoAccess rights for users and roles. Column and row based policies. Quotas and resource limits. Pluggable authentication with LDAP and Kerberos. Password based, X.509 certificate, and SSH key authentication.

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache HBaseApache Spark (SQL)ClickHouse
DB-Engines blog posts

DB-Engines shares Q1 2025 database industry rankings and top climbers: Snowflake and PostgreSQL trending
1 May 2025, DB-Engines

show all

Recent citations in the news

Apache HBase online migration to Amazon EMR | Amazon Web Services
23 October 2024, Amazon Web Services

Implement Amazon EMR HBase Graceful Scaling | Amazon Web Services
18 March 2025, Amazon Web Services

How to optimize Hbase for the Cloud [Tutorial]
22 October 2024, Packt

Cost and Performance Advantages of Replacing HBase with TencentDB TDSQL TDStore Engine in Historical Data Scenarios
22 April 2025, thewire.in

Cost and Performance Advantages of Replacing HBase…
22 April 2025, aapnews.aap.com.au

provided by Google News

Introducing AWS Glue 5.0 for Apache Spark
4 December 2024, Amazon Web Services (AWS)

Scala vs Python for Apache Spark: An In-depth Comparison With Use Cases For Each
21 April 2025, Simplilearn.com

How to run Pandas code on Spark
25 January 2025, Theodo Data & AI

The 6 Best Apache Spark Courses on Udemy to Consider for 2025
1 January 2025, solutionsreview.com

18 top big data tools and technologies to know about in 2025
22 January 2025, TechTarget

provided by Google News

Snowflake Challenger ClickHouse Targets $6 Billion Valuation
9 May 2025, The Information

Wiz Research Uncovers Exposed DeepSeek Database Leaking Sensitive Information, Including Chat History
29 January 2025, wiz.io

Bcachefs, Btrfs, EXT4, F2FS & XFS File-System Performance On Linux 6.15
10 May 2025, Phoronix

DeepSeek Exposed Database Leaks Sensitive Data
30 January 2025, Infosecurity Magazine

DeepSeek AI Database Exposed: Over 1 Million Log Lines, Secret Keys Leaked
30 January 2025, The Hacker News

provided by Google News



Share this page

Featured Products

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

SingleStore logo

The data platform to build your intelligent applications.
Try it free.

Present your product here