DB-EnginesExtremeDB: mitigate connectivity issues in a DBMSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Apache Druid vs. Databricks vs. Datastax Enterprise vs. Yanza

System Properties Comparison Apache Druid vs. Databricks vs. Datastax Enterprise vs. Yanza

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Druid  Xexclude from comparisonDatabricks  Xexclude from comparisonDatastax Enterprise  Xexclude from comparisonYanza  Xexclude from comparison
Yanza seems to be discontinued. Therefore it is excluded from the DB-Engines Ranking.
DescriptionOpen-source analytics data store designed for sub-second OLAP queries on high dimensionality and high cardinality dataThe Databricks Lakehouse Platform combines elements of data lakes and data warehouses to provide a unified view onto structured and unstructured data. It is based on Apache Spark.DataStax Enterprise (DSE) is the always-on, scalable data platform built on Apache Cassandra and designed for hybrid Cloud. DSE integrates graph, search, analytics, administration, developer tooling, and monitoring into a unified platform.Time Series DBMS for IoT Applications
Primary database modelRelational DBMS
Time Series DBMS
Document store
Relational DBMS
Wide column storeTime Series DBMS
Secondary database modelsDocument store
Graph DBMS
Spatial DBMS
Search engine
Vector DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score3.34
Rank#88  Overall
#48  Relational DBMS
#7  Time Series DBMS
Score78.61
Rank#15  Overall
#2  Document stores
#10  Relational DBMS
Score5.80
Rank#60  Overall
#4  Wide column stores
Websitedruid.apache.orgwww.databricks.comwww.datastax.com/­products/­datastax-enterpriseyanza.com
Technical documentationdruid.apache.org/­docs/­latest/­designdocs.databricks.comdocs.datastax.com
DeveloperApache Software Foundation and contributorsDatabricksDataStaxYanza
Initial release2012201320112015
Current release29.0.1, April 20246.8, April 2020
License infoCommercial or Open SourceOpen Source infoApache license v2commercialcommercialcommercial infofree version available
Cloud-based only infoOnly available as a cloud servicenoyesnono infobut mainly used as a service provided by Yanza
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Datastax Astra DB: Astra DB simplifies cloud-native Cassandra application development for your apps, microservices and functions. Deploy in minutes on AWS, Google Cloud, Azure, and have it managed for you by the experts, with serverless, pay-as-you-go pricing.
Implementation languageJavaJava
Server operating systemsLinux
OS X
Unix
hostedLinux
OS X
Windows
Data schemeyes infoschema-less columns are supportedFlexible Schema (defined schema, partial schema, schema free)schema-freeschema-free
Typing infopredefined data types such as float or dateyesyesno
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.noyesnono
Secondary indexesyesyesyesno
SQL infoSupport of SQLSQL for queryingwith Databricks SQLSQL-like DML and DDL statements (CQL); Spark SQLno
APIs and other access methodsJDBC
RESTful HTTP/JSON API
JDBC
ODBC
RESTful HTTP API
Proprietary protocol infoCQL (Cassandra Query Language)
TinkerPop Gremlin infowith DSE Graph
HTTP API
Supported programming languagesClojure
JavaScript
PHP
Python
R
Ruby
Scala
Python
R
Scala
C
C#
C++
Java
JavaScript (Node.js)
PHP
Python
Ruby
any language that supports HTTP calls
Server-side scripts infoStored proceduresnouser defined functions and aggregatesnono
Triggersnoyesyes infoTimer and event based
Partitioning methods infoMethods for storing different data on different nodesSharding infomanual/auto, time-basedSharding infono "single point of failure"none
Replication methods infoMethods for redundantly storing data on multiple nodesyes, via HDFS, S3 or other storage enginesyesconfigurable replication factor, datacenter aware, advanced replication for edge computingnone
MapReduce infoOffers an API for user-defined Map/Reduce methodsnoyesno
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate ConsistencyImmediate ConsistencyImmediate Consistency
Tunable Consistency infoconsistency level can be individually decided with each write operation
Immediate Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoACIDno infoAtomicity and isolation are supported for single operationsno
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nonoyes
User concepts infoAccess controlRBAC using LDAP or Druid internals for users and groups for read/write by datasource and systemAccess rights for users can be defined per objectno
More information provided by the system vendor
Apache DruidDatabricksDatastax EnterpriseYanza
Specific characteristicsSupported database models : In addition to the Document store and Relational DBMS...
» more
DataStax Enterprise is scale-out data infrastructure for enterprises that need to...
» more
Competitive advantagesSupporting the following application requirements: Zero downtime - Built on Apache...
» more
Typical application scenariosApplications that must be massively and linearly scalable with 100% uptime and able...
» more
Key customersCapital One, Cisco, Comcast, eBay, McDonald's, Microsoft, Safeway, Sony, UBS, and...
» more
Market metricsAmong the Forbes 100 Most Innovative Companies, DataStax is trusted by 5 of the top...
» more
Licensing and pricing modelsAnnual subscription
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache DruidDatabricksDatastax EnterpriseYanza
DB-Engines blog posts

PostgreSQL is the DBMS of the Year 2023
2 January 2024, Matthias Gelbmann, Paul Andlinger

show all

Recent citations in the news

Apache Druid Wins Best Big Data Product in the 2023 BigDATAwire Readers' Choice Awards
26 January 2024, Datanami

'Lucifer' Botnet Turns Up the Heat on Apache Hadoop Servers
21 February 2024, Dark Reading

New DDoS malware Attacking Apache big-data stack, Hadoop, & Druid Servers
26 February 2024, GBHackers

Imply Data gives Apache Druid schema auto-discover capability
6 June 2023, SiliconANGLE News

Imply Announces Automatic Schema Discovery for Apache Druid, Reinforcing Druid's Leadership for Real-Time ...
6 June 2023, Business Wire

provided by Google News

Databricks is expanding the scope of its AI investments with second VC fund
21 May 2024, Fortune

5. Databricks
14 May 2024, CNBC

Microsoft Fabric adds real-time intelligence, workload development kit
21 May 2024, InfoWorld

Nvidia, Databricks Sued in Latest AI Copyright Class Actions
3 May 2024, Bloomberg Law

This Is the Platform Nancy Pelosi Used to Make Her Private Investment in Databricks
9 May 2024, Yahoo Finance

provided by Google News

DataStax previews new Hyper Converged Data Platform for enterprise AI
15 May 2024, VentureBeat

DataStax Launches New Hyper-Converged Data Platform Giving Enterprises the Complete Modern Data Center Suite ...
15 May 2024, Business Wire

DataStax Rolls Out Vector Search for Astra DB to Support Gen AI
19 July 2023, EnterpriseAI

DataStax announces vector search capabilities in its on-prem Apache Cassandra database
8 August 2023, SDTimes.com

DataStax and LlamaIndex Partner to Make Building RAG Applications Easier than Ever for GenAI Developers
20 February 2024, Business Wire

provided by Google News



Share this page

Featured Products

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

AllegroGraph logo

Graph Database Leader for AI Knowledge Graph Applications - The Most Secure Graph Database Available.
Free Download

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Present your product here