DBMS > Apache Druid vs. MongoDB vs. Spark SQL
System Properties Comparison Apache Druid vs. MongoDB vs. Spark SQL
Please select another system to include it in the comparison.
|Editorial information provided by DB-Engines|
|Name||Apache Druid Xexclude from comparison||MongoDB Xexclude from comparison||Spark SQL Xexclude from comparison|
|Description||Open-source analytics data store designed for sub-second OLAP queries on high dimensionality and high cardinality data||One of the most popular document stores available both as a fully managed cloud service and for deployment on self-managed infrastructure||Spark SQL is a component on top of 'Spark Core' for structured data processing|
|Primary database model||Relational DBMS|
Time Series DBMS
|Document store||Relational DBMS|
|Secondary database models||Spatial DBMS|
Search engine integrated Lucene index, currently in MongoDB Atlas only.
Time Series DBMS Time Series Collections introduced in Release 5.0
|Developer||Apache Software Foundation and contributors||MongoDB, Inc||Apache Software Foundation|
|Current release||25.0.0, January 2023||6.0.1, August 2022||3.3.0 ( 2.13), June 2022|
|License Commercial or Open Source||Open Source Apache license v2||Open Source MongoDB Inc.'s Server Side Public License v1. Prior versions were published under GNU AGPL v3.0. Commercial licenses are also available.||Open Source Apache 2.0|
|Cloud-based only Only available as a cloud service||no||no MongoDB available as DBaaS (MongoDB Atlas)||no|
|DBaaS offerings (sponsored links) Database as a Service|
Providers of DBaaS offerings, please contact us to be listed.
|ScaleGrid for MongoDB Database: Fully managed hosting for MongoDB Database on a wide variety of cloud providers and On-Premises. Automate your management, scaling and backups through one centralized platform.|
|Server operating systems||Linux|
|Data scheme||yes schema-less columns are supported||schema-free Although schema-free, documents of the same collection often follow the same structure. Optionally impose all or part of a schema by defining a JSON schema.||yes|
|Typing predefined data types such as float or date||yes||yes string, integer, double, decimal, boolean, date, object_id, geospatial||yes|
|XML support Some form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.||no||no|
|SQL Support of SQL||SQL for querying||Read-only SQL queries via the MongoDB Connector for BI||SQL-like DML and DDL statements|
|APIs and other access methods||JDBC|
RESTful HTTP/JSON API
|proprietary protocol using JSON||JDBC|
|Supported programming languages||Clojure|
|Actionscript unofficial driver|
Clojure unofficial driver
ColdFusion unofficial driver
D unofficial driver
Dart unofficial driver
Delphi unofficial driver
Groovy unofficial driver
Lisp unofficial driver
Lua unofficial driver
MatLab unofficial driver
PowerShell unofficial driver
Prolog unofficial driver
R unofficial driver
Smalltalk unofficial driver
|Triggers||no||yes in MongoDB Atlas only||no|
|Partitioning methods Methods for storing different data on different nodes||Sharding manual/auto, time-based||Sharding Partitioned by hashed, ranged, or zoned sharding keys. Live resharding allows users to change their shard keys as an online operation with zero downtime.||yes, utilizing Spark Core|
|Replication methods Methods for redundantly storing data on multiple nodes||yes, via HDFS, S3 or other storage engines||Multi-Source deployments with MongoDB Atlas Global Clusters|
|MapReduce Offers an API for user-defined Map/Reduce methods||no||yes|
|Consistency concepts Methods to ensure consistency in a distributed system||Immediate Consistency||Eventual Consistency|
Immediate Consistency can be individually decided for each write operation
|Foreign keys Referential integrity||no||no typically not used, however similar functionality with DBRef possible||no|
|Transaction concepts Support to ensure data integrity after non-atomic manipulations of data||no||Multi-document ACID Transactions with snapshot isolation||no|
|Concurrency Support for concurrent manipulation of data||yes||yes||yes|
|Durability Support for making data persistent||yes||yes optional, enabled by default||yes|
|In-memory capabilities Is there an option to define some or all structures to be held in-memory only.||no||yes In-memory storage engine introduced with MongoDB version 3.2||no|
|User concepts Access control||RBAC using LDAP or Druid internals for users and groups for read/write by datasource and system||Access rights for users and roles||no|
More information provided by the system vendor
We invite representatives of system vendors to contact us for updating and extending the system information,
|Related products and services|
|3rd parties||CData: Connect to Big Data & NoSQL through standard Drivers.|
Percona: Database problems? Not on your watch.
Databases run better with Percona.
Studio 3T: The world's favorite IDE for working with MongoDB
Navicat for MongoDB gives you a highly effective GUI interface for MongoDB database management, administration and development.
We invite representatives of vendors of related products to contact us for presenting information about their offerings here.
|Apache Druid||MongoDB||Spark SQL|
|DB-Engines blog posts|
Snowflake is the DBMS of the Year 2021
|Recent citations in the news|
Apache Druid 25.0 Delivers Multi-Stage Query Engine and ...
Stream Big, Think Bigger: Analyze Streaming Data at Scale
As CIO budgets tighten, Apache Superset looks like a Big Data winner
Imply Announces Major Open Source Contribution for Apache Druid ...
Apache Druid’s Role in Modern Data Analytics
provided by Google News
MongoDB, Inc. (NASDAQ:MDB) Shares Acquired by Bank Julius ...
Why Datadog, MongoDB, and CrowdStrike Were Sinking Today
Cloud-Computing Stocks Rally as Microsoft Results Bring Relief ...
Premarket Mover: Mongodb Inc (MDB) Down 2.44%
Satori Expands Support to NoSQL Databases, Streamlines Secure ...
provided by Google News
Apache® Kyuubi Becomes Top-Level Project
A Deep Dive into Custom Spark Transformers for ML Pipelines
Data Engineer (Apache Airflow, Hive, Spark, SQL, AWS)
Accelerating SQL Queries on a Modern Real-Time Database
Data chess game: Databricks vs. Snowflake, part 1
provided by Google News
Kubernetes Admin-Apache Druid cluster
DevOps Engineer Fully Remote
Database Performance Consultant - MongoDB
Scala/Spark Data Engineer
Entry Level Data Engineer with BigData - 1
Big Data Support Engineer (Spark, SQL)
AWS DATA ENGINEER_(Scala/Spark)
Senior Member of Technical Staff
Share this page