DB-EnginesextremeDB - Data management wherever you need itEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by Redgate Software

DBMS > Apache Spark (SQL) vs. EJDB vs. Microsoft Azure Cosmos DB

System Properties Comparison Apache Spark (SQL) vs. EJDB vs. Microsoft Azure Cosmos DB

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Spark (SQL)  Xexclude from comparisonEJDB  Xexclude from comparisonMicrosoft Azure Cosmos DB infoformer name was Azure DocumentDB  Xexclude from comparison
DescriptionApache Spark SQL is a component on top of 'Spark Core' for structured data processingEmbeddable document-store database library with JSON representation of queries (in MongoDB style)Globally distributed, horizontally scalable, multi-model database service
Primary database modelRelational DBMSDocument storeDocument store
Graph DBMS
Key-value store
Wide column store
Secondary database modelsSpatial DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score16.73
Rank#35  Overall
#20  Relational DBMS
Score0.14
Rank#331  Overall
#46  Document stores
Score24.50
Rank#27  Overall
#4  Document stores
#2  Graph DBMS
#3  Key-value stores
#3  Wide column stores
Websitespark.apache.org/­sqlgithub.com/­Softmotions/­ejdbazure.microsoft.com/­services/­cosmos-db
Technical documentationspark.apache.org/­docs/­latest/­sql-programming-guide.htmlgithub.com/­Softmotions/­ejdb/­blob/­master/­README.mdlearn.microsoft.com/­azure/­cosmos-db
DeveloperApache Software FoundationSoftmotionsMicrosoft
Initial release201420122014
Current release3.5.0 ( 2.13), September 2023
License infoCommercial or Open SourceOpen Source infoApache 2.0Open Source infoGPLv2commercial
Cloud-based only infoOnly available as a cloud servicenonoyes
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageScalaC
Server operating systemsLinux
OS X
Windows
server-lesshosted
Data schemeyesschema-freeschema-free
Typing infopredefined data types such as float or dateyesyes infostring, integer, double, bool, date, object_idyes infoJSON types
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.no
Secondary indexesnonoyes infoAll properties auto-indexed by default
SQL infoSupport of SQLSQL-like DML and DDL statementsnoSQL-like query language
APIs and other access methodsJDBC
ODBC
in-process shared libraryDocumentDB API
Graph API (Gremlin)
MongoDB API
RESTful HTTP API
Table API
Supported programming languagesJava
Python
R
Scala
Actionscript
C
C#
C++
Go
Java
JavaScript (Node.js)
Lua
Objective-C
Pike
Python
Ruby
.Net
C#
Java
JavaScript
JavaScript (Node.js)
MongoDB client drivers written for various programming languages
Python
Server-side scripts infoStored proceduresnonoJavaScript
TriggersnonoJavaScript
Partitioning methods infoMethods for storing different data on different nodesyes, utilizing Spark CorenoneSharding infoImplicit feature of the cloud service
Replication methods infoMethods for redundantly storing data on multiple nodesnonenoneyes infoImplicit feature of the cloud service
MapReduce infoOffers an API for user-defined Map/Reduce methodsnowith Hadoop integration infoIntegration with Hadoop/HDInsight on Azure*
Consistency concepts infoMethods to ensure consistency in a distributed systemBounded Staleness
Consistent Prefix
Eventual Consistency
Immediate Consistency infoConsistency level configurable on request level
Session Consistency
Foreign keys infoReferential integritynono infotypically not needed, however similar functionality with collection joins possibleno
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonoMulti-item ACID transactions with snapshot isolation within a partition
Concurrency infoSupport for concurrent manipulation of datayesyes infoRead/Write Lockingyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.no
User concepts infoAccess controlnonoAccess rights can be defined down to the item level

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesCData: Connect to Big Data & NoSQL through standard Drivers.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache Spark (SQL)EJDBMicrosoft Azure Cosmos DB infoformer name was Azure DocumentDB
Recent citations in the news

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
30 September 2024, Simplilearn

Amazon EMR 7.1 runtime for Apache Spark and Iceberg can run Spark workloads 2.7 times faster than Apache Spark 3.5.1 and Iceberg 1.5.2
26 August 2024, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Apache Hadoop and Apache Spark for Big Data Analysis
7 May 2024, Towards Data Science

Build Spark Structured Streaming applications with the open source connector for Amazon Kinesis Data Streams
24 May 2024, AWS Blog

provided by Google News

Start your AI journey with Microsoft Azure Cosmos DB—compete for $10K
9 May 2024, azure.microsoft.com

Public Preview: DiskANN vector indexing and search in Azure Cosmos DB NoSQL
21 May 2024, azure.microsoft.com

Public preview: Filtered vector search in vCore-based Azure Cosmos DB for MongoDB
24 April 2024, azure.microsoft.com

Public Preview: vCore-based Azure Cosmos DB for MongoDB cross-region disaster recovery (DR)
21 May 2024, azure.microsoft.com

General availability: Azure Cosmos DB API for MongoDB RU supports version 5.0 and 6.0
8 May 2024, azure.microsoft.com

provided by Google News



Share this page

Featured Products

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

SingleStore logo

The data platform to build your intelligent applications.
Try it free.

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here