DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > HBase vs. Kdb+ vs. Spark SQL vs. STSdb

System Properties Comparison HBase vs. Kdb+ vs. Spark SQL vs. STSdb

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameHBase  Xexclude from comparisonKdb+  Xexclude from comparisonSpark SQL  Xexclude from comparisonSTSdb  Xexclude from comparison
DescriptionWide-column store based on Apache Hadoop and on concepts of BigTableHigh performance Time Series DBMSSpark SQL is a component on top of 'Spark Core' for structured data processingKey-Value Store with special method for indexing infooptimized for high performance using a special indexing method
Primary database modelWide column storeTime Series DBMSRelational DBMSKey-value store
Secondary database modelsRelational DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score43.19
Rank#26  Overall
#2  Wide column stores
Score8.98
Rank#54  Overall
#2  Time Series DBMS
Score22.86
Rank#35  Overall
#21  Relational DBMS
Score0.01
Rank#345  Overall
#53  Key-value stores
Websitehbase.apache.orgkx.comspark.apache.org/­sqlgithub.com/­STSSoft/­STSdb4
Technical documentationhbase.apache.orgcode.kx.comspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperApache Software Foundation infoApache top-level project, originally developed by PowersetKx Systems, a division of First Derivatives plcApache Software FoundationSTS Soft SC
Initial release20082000 infokdb was released 2000, kdb+ in 200320142011
Current release2.3.4, January 20213.6, May 20183.2.0, October 20214.0.8, September 2015
License infoCommercial or Open SourceOpen Source infoApache version 2commercial infofree 32-bit versionOpen Source infoApache 2.0Open Source infoGPLv2, commercial license available
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Cloudera Operational Database: Cloud-native autonomous Apache HBase for unparalleled scale, performance and reliability. Automate and simplify database management with capabilities like auto-scale, auto-heal, and auto-tune.
Implementation languageJavaqScalaC#
Server operating systemsLinux
Unix
Windows infousing Cygwin
Linux
OS X
Solaris
Windows
Linux
OS X
Windows
Windows
Data schemeschema-free, schema definition possibleyesyesyes
Typing infopredefined data types such as float or dateoptions to bring your own types, AVROyesyesyes infoprimitive types and user defined types (classes)
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.noyesno
Secondary indexesnoyes infotable attribute 'grouped'nono
SQL infoSupport of SQLnoSQL-like query language (q)SQL-like DML and DDL statementsno
APIs and other access methodsJava API
RESTful HTTP API
Thrift
HTTP API
JDBC
Jupyter
Kafka
ODBC
WebSocket
JDBC
ODBC
.NET Client API
Supported programming languagesC
C#
C++
Groovy
Java
PHP
Python
Scala
C
C#
C++
Go
J
Java
JavaScript
Lua
MatLab
Perl
PHP
Python
R
Scala
Java
Python
R
Scala
C#
Java
Server-side scripts infoStored proceduresyes infoCoprocessors in Javauser defined functionsnono
Triggersyesyes infowith viewsnono
Partitioning methods infoMethods for storing different data on different nodesShardinghorizontal partitioningyes, utilizing Spark Corenone
Replication methods infoMethods for redundantly storing data on multiple nodesMulti-source replication
Source-replica replication
Source-replica replicationnonenone
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesno infosimilar paradigm used for internal processingno
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate Consistency or Eventual ConsistencyImmediate Consistency
Foreign keys infoReferential integritynoyesnono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of dataSingle row ACID (across millions of columns)nonono
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.yesyesno
User concepts infoAccess controlAccess Control Lists (ACL) for RBAC, integration with Apache Ranger for RBAC & ABACrights management via user accountsnono
More information provided by the system vendor
HBaseKdb+Spark SQLSTSdb
Specific characteristicsApache HBase is the leading NoSQL, distributed database management system, well suited...
» more
Competitive advantagesNo single point of failure ensures very high availability with multiple customers...
» more
Typical application scenariosInternet of Things (IOT), fraud detection applications, recommendation engines, product...
» more
Key customersApple, Salesforce, Cerner, Allegis Group, Bloomberg, Airtel, Thomson Reuters, Dish,...
» more
Market metrics#1 largest NoSQL database by revenue
» more
Licensing and pricing modelsApache license P ricing for commercial distribution provided by Cloudera and available...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesSQLFlow: Provides a visual representation of the overall flow of data. Automated SQL data lineage analysis across Databases, ETL, Business Intelligence, Cloud and Hadoop environments by parsing SQL Script and stored procedure.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
HBaseKdb+Spark SQLSTSdb
DB-Engines blog posts

Cloudera's HBase PaaS offering now supports Complex Transactions
11 August 2021,  Krishna Maheshwari (sponsor) 

Why is Hadoop not listed in the DB-Engines Ranking?
13 May 2013, Paul Andlinger

show all

Time Series DBMS are the database category with the fastest increase in popularity
4 July 2016, Matthias Gelbmann

show all

Recent citations in the news

Review: HBase is massively scalable -- and hugely complex
31 March 2014, InfoWorld

HBase: The database big data left behind
6 May 2016, InfoWorld

The Apache Software Foundation Announces the 10th Anniversary of Apache® HBase
13 May 2020, GlobeNewswire

Big Data Analytics Market Worth $638.66 Billion, Globally, by 2028 at 15.3% CAGR - Exclusive Report by The Insight Partners
25 May 2022, PR Newswire

Overhauling Apache Kylin for the cloud
18 November 2021, InfoWorld

provided by Google News

KX Welcomes New Languages to Speedy Analytics Database
2 November 2021, Datanami

How time series platforms unlock the potential of IoT: Join theCUBE for May 17 event
10 May 2022, SiliconANGLE News

The coding language you can learn in months for top finance jobs
26 July 2021, eFinancialCareers

Treliant and KX Build Alliance Combining Advanced Real-Time Analytics Services and Consultancy - Benzinga
10 May 2022, Benzinga

Storage news ticker – December 9 – Blocks and Files
9 December 2021, Blocks and Files

provided by Google News

Apache Spark vs Apache Hadoop: Compare data science tools
26 May 2022, TechRepublic

Top online courses to learn Apache Spark
5 May 2022, Analytics India Magazine

Inside the Modern Data Stack
23 May 2022, Datanami

Software Architectural Patterns in Data Engineering | by Kunal Sharma | Expedia Group Technology | May, 2022
24 May 2022, Medium

Spark Gets Closer Hooks to Pandas, SQL with Version 3.2
26 October 2021, Datanami

provided by Google News

Job opportunities

Data Analyst- Field Reliability
Tesla, Palo Alto, CA

AWS Redshift Consultant - 4208254
Accenture, Houston, TX

Data Scientist
Source Enterprises, New York, NY

Device Management Developer
Zoom Video Communications, Inc., San Jose, CA

Enterprise Architect, Product
Zoom Video Communications, Inc., San Francisco Bay Area, CA

Sr KDB+/Q Developer
Jefferies & Company, Inc., New York, NY

UI Developer
Virtu Financial, New York, NY

Agency Strat
Morgan Stanley, New York, NY

Data Analytics
Quantbot Technologies, New York, NY

Macro Desk Strategist
Morgan Stanley, New York, NY

Java with spark ( remote )
Purple Drive Technologies, Remote

ETL Tester
Cool Minds LLC, Remote

Big Data Support Engineer (Spark, SQL)
Teamware Solutions, United States

Spark scala developer
Ait, Delaware, OH

Data Engineer - Spark SQL, Scala, AWS Data Lake
Nasdaq, Inc., Boston, MA

jobs by Indeed



Share this page

Featured Products

Vertica logo

Vertica Accelerator. The fastest analytics and machine learning, delivered as SaaS, with automated setup, administration, and management. Free trial.

The definitive guide for Cassandra

Imagine What You Could Do if Scalability Wasn‘t a Problem!
Download the Cassandra e-book for free!

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

Redis logo

The world’s most loved real‑time data platform.
Try free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here