DB-EnginesExtremeDB for everyone with an RTOSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Hive vs. Kdb vs. Oracle Berkeley DB vs. Spark SQL

System Properties Comparison Hive vs. Kdb vs. Oracle Berkeley DB vs. Spark SQL

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameHive  Xexclude from comparisonKdb  Xexclude from comparisonOracle Berkeley DB  Xexclude from comparisonSpark SQL  Xexclude from comparison
Descriptiondata warehouse software for querying and managing large distributed datasets, built on HadoopHigh performance Time Series DBMSWidely used in-process key-value storeSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelRelational DBMSTime Series DBMS
Vector DBMS
Key-value store infosupports sorted and unsorted key sets
Native XML DBMS infoin the Oracle Berkeley DB XML version
Relational DBMS
Secondary database modelsRelational DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score59.76
Rank#18  Overall
#12  Relational DBMS
Score7.71
Rank#49  Overall
#2  Time Series DBMS
#1  Vector DBMS
Score2.01
Rank#126  Overall
#21  Key-value stores
#3  Native XML DBMS
Score18.04
Rank#33  Overall
#20  Relational DBMS
Websitehive.apache.orgkx.comwww.oracle.com/­database/­technologies/­related/­berkeleydb.htmlspark.apache.org/­sql
Technical documentationcwiki.apache.org/­confluence/­display/­Hive/­Homecode.kx.comdocs.oracle.com/­cd/­E17076_05/­html/­index.htmlspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperApache Software Foundation infoinitially developed by FacebookKx Systems, a division of First Derivatives plcOracle infooriginally developed by Sleepycat, which was acquired by OracleApache Software Foundation
Initial release20122000 infokdb was released 2000, kdb+ in 200319942014
Current release3.1.3, April 20223.6, May 201818.1.40, May 20203.5.0 ( 2.13), September 2023
License infoCommercial or Open SourceOpen Source infoApache Version 2commercial infofree 32-bit versionOpen Source infocommercial license availableOpen Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageJavaqC, Java, C++ (depending on the Berkeley DB edition)Scala
Server operating systemsAll OS with a Java VMLinux
OS X
Solaris
Windows
AIX
Android
FreeBSD
iOS
Linux
OS X
Solaris
VxWorks
Windows
Linux
OS X
Windows
Data schemeyesyesschema-freeyes
Typing infopredefined data types such as float or dateyesyesnoyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.yesyes infoonly with the Berkeley DB XML editionno
Secondary indexesyesyes infotable attribute 'grouped'yesno
SQL infoSupport of SQLSQL-like DML and DDL statementsSQL-like query language (q)yes infoSQL interfaced based on SQLite is availableSQL-like DML and DDL statements
APIs and other access methodsJDBC
ODBC
Thrift
HTTP API
JDBC
Jupyter
Kafka
ODBC
WebSocket
JDBC
ODBC
Supported programming languagesC++
Java
PHP
Python
C
C#
C++
Go
J
Java
JavaScript
Lua
MatLab
Perl
PHP
Python
R
Scala
.Net infoFigaro is a .Net framework assembly that extends Berkeley DB XML into an embeddable database engine for .NET
others infoThird-party libraries to manipulate Berkeley DB files are available for many languages
C
C#
C++
Java
JavaScript (Node.js) info3rd party binding
Perl
Python
Tcl
Java
Python
R
Scala
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceuser defined functionsnono
Triggersnoyes infowith viewsyes infoonly for the SQL APIno
Partitioning methods infoMethods for storing different data on different nodesShardinghorizontal partitioningnoneyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factorSource-replica replicationSource-replica replicationnone
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReduceno infosimilar paradigm used for internal processingno
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual ConsistencyImmediate Consistency
Foreign keys infoReferential integritynoyesnono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonoACIDno
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.yesyesno
User concepts infoAccess controlAccess rights for users, groups and rolesrights management via user accountsnono
More information provided by the system vendor
HiveKdbOracle Berkeley DBSpark SQL
Specific characteristicsIntegrated columnar database & programming system for streaming, real time and historical...
» more
Competitive advantagesprovides seamless scalability; runs on industry standard server platforms; is top-ranked...
» more
Typical application scenariostick database streaming sensor data massive intelligence applications oil and gas...
» more
Key customersGoldman Sachs Morgan Stanley Merrill Lynch J.P. Morgan Deutsche Bank IEX Securities...
» more
Market metricskdb+ performance and reliability proven by our customers in critical infrastructure...
» more
Licensing and pricing modelsupon request
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
HiveKdbOracle Berkeley DBSpark SQL
DB-Engines blog posts

Why is Hadoop not listed in the DB-Engines Ranking?
13 May 2013, Paul Andlinger

show all

Recent citations in the news

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore ...
10 June 2024, AWS Blog

Apache Software Foundation Announces Apache Hive 4.0
30 April 2024, Datanami

18 Top Big Data Tools and Technologies to Know About in 2024
24 January 2024, TechTarget

ASF Unveils the Next Evolution of Big Data Processing With the Launch of Hive 4.0
2 May 2024, Datanami

Run Apache Hive workloads using Spark SQL with Amazon EMR on EKS | Amazon Web Services
18 October 2023, AWS Blog

provided by Google News

Turbocharging the Engine: KX Unleashes AI-First Transformation with kdb+
28 February 2024, Business Wire

McLaren Applied and KX partner to enhance ATLAS software analytics capabilities
9 August 2023, Professional Motorsport World

Introducing Amazon FinSpace with Managed kdb Insights, a fully managed analytics engine, commonly used by capital ...
18 May 2023, AWS Blog

KX ANNOUNCES KDB INSIGHTS AS FULLY MANAGED SERVICE ON AMAZON FINSPACE
18 May 2023, PR Newswire

KX Brings the Power and Performance of kdb+ to Python Developers with PyKX
7 June 2023, Datanami

provided by Google News

ACM recognizes far-reaching technical achievements with special awards
26 May 2021, EurekAlert

Margo Seltzer Named ACM Athena Lecturer for Technical and Mentoring Contributions
26 April 2023, Datanami

Oracle buys Sleepycat Software
14 February 2006, MarketWatch

Margo I. Seltzer | Berkman Klein Center
18 August 2020, Berkman Klein Center

How to store financial market data for backtesting
26 January 2019, Towards Data Science

provided by Google News

Use Amazon Athena with Spark SQL for your open-source transactional table formats | Amazon Web Services
24 January 2024, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
1 April 2024, Simplilearn

Performance Insights from Sigma Rule Detections in Spark Streaming
1 June 2024, Towards Data Science

Simba Technologies(R) Introduces New, Powerful JDBC Driver With SQL Connector for Apache Spark(TM)
17 March 2024, Yahoo Singapore News

provided by Google News



Share this page

Featured Products

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Present your product here