DB-EnginesExtremeDB: mitigate connectivity issues in a DBMSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Cassandra vs. Citus vs. Spark SQL vs. Yaacomo

System Properties Comparison Cassandra vs. Citus vs. Spark SQL vs. Yaacomo

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameCassandra  Xexclude from comparisonCitus  Xexclude from comparisonSpark SQL  Xexclude from comparisonYaacomo  Xexclude from comparison
Yaacomo seems to be discontinued and is removed from the DB-Engines ranking
DescriptionWide-column store based on ideas of BigTable and DynamoDB infoOptimized for write accessScalable hybrid operational and analytics RDBMS for big data use cases based on PostgreSQLSpark SQL is a component on top of 'Spark Core' for structured data processingOpenCL based in-memory RDBMS, designed for efficiently utilizing the hardware via parallel computing
Primary database modelWide column storeRelational DBMSRelational DBMSRelational DBMS
Secondary database modelsVector DBMS infostarting with release V5Document store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score98.83
Rank#12  Overall
#1  Wide column stores
Score2.15
Rank#117  Overall
#56  Relational DBMS
Score18.04
Rank#33  Overall
#20  Relational DBMS
Websitecassandra.apache.orgwww.citusdata.comspark.apache.org/­sqlyaacomo.com
Technical documentationcassandra.apache.org/­doc/­latestdocs.citusdata.comspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperApache Software Foundation infoApache top level project, originally developped by FacebookApache Software FoundationQ2WEB GmbH
Initial release2008201020142009
Current release4.1.3, July 20238.1, December 20183.5.0 ( 2.13), September 2023
License infoCommercial or Open SourceOpen Source infoApache version 2Open Source infoAGPL, commercial license also availableOpen Source infoApache 2.0commercial
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Astra DB: Multi-cloud DBaaS built on Apache Cassandra.
Implementation languageJavaCScala
Server operating systemsBSD
Linux
OS X
Windows
LinuxLinux
OS X
Windows
Android
Linux
Windows
Data schemeschema-freeyesyesyes
Typing infopredefined data types such as float or dateyesyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.noyes infospecific XML type available, but no XML query functionalitynono
Secondary indexesrestricted infoonly equality queries, not always the best performing solutionyesnoyes
SQL infoSupport of SQLSQL-like SELECT, DML and DDL statements (CQL)yes infostandard, with numerous extensionsSQL-like DML and DDL statementsyes
APIs and other access methodsProprietary protocol infoCQL (Cassandra Query Language, an SQL-like language)
Thrift
ADO.NET
JDBC
native C library
ODBC
streaming API for large objects
JDBC
ODBC
JDBC
ODBC
Supported programming languagesC#
C++
Clojure
Erlang
Go
Haskell
Java
JavaScript infoNode.js
Perl
PHP
Python
Ruby
Scala
.Net
C
C++
Delphi
Java
JavaScript (Node.js)
Perl
PHP
Python
Tcl
Java
Python
R
Scala
Server-side scripts infoStored proceduresnouser defined functions inforealized in proprietary language PL/pgSQL or with common languages like Perl, Python, Tcl etc.no
Triggersyesyesnoyes
Partitioning methods infoMethods for storing different data on different nodesSharding infono "single point of failure"Shardingyes, utilizing Spark Corehorizontal partitioning
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factor infoRepresentation of geographical distribution of servers is possibleSource-replica replication infoother methods possible by using 3rd party extensionsnoneSource-replica replication
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesnono
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency
Immediate Consistency infocan be individually decided for each write operation
Immediate ConsistencyImmediate Consistency
Foreign keys infoReferential integritynoyesnoyes
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datano infoAtomicity and isolation are supported for single operationsACIDnoACID
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nononoyes
User concepts infoAccess controlAccess rights for users can be defined per objectfine grained access rights according to SQL-standardnofine grained access rights according to SQL-standard
More information provided by the system vendor
CassandraCitusSpark SQLYaacomo
Specific characteristicsApache Cassandra is the leading NoSQL, distributed database management system, well...
» more
Competitive advantagesNo single point of failure ensures 100% availability . Operational simplicity for...
» more
Typical application scenariosInternet of Things (IOT), fraud detection applications, recommendation engines, product...
» more
Key customersApple, Netflix, Uber, ING,, Intuit,Fidelity, NY Times, Outbrain, BazaarVoice, Best...
» more
Market metricsCassandra is used by 40% of the Fortune 100.
» more
Licensing and pricing modelsApache license  Pricing for commercial distributions provided by DataStax and available...
» more
News

An Introduction to David Jones-Gilardi, Developer Relations
11 June 2024

Introducing Tejas Kumar, Developer Relations Engineer
7 June 2024

Indexing All of Wikipedia on a Laptop
5 June 2024

An Introduction to Phil Nash, Developer Relations
3 June 2024

Focus on Creativity, Not Clusters: DataStax Mission Control in Action!
30 May 2024

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesCassandra Forward event: Want to level up your Cassandra game? Watch now replays of the March 2023 sessions.
» more

CData: Connect to Big Data & NoSQL through standard Drivers.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
CassandraCitusSpark SQLYaacomo
DB-Engines blog posts

Cassandra keeps climbing the ranks of the DB-Engines Ranking
3 May 2016, Matthias Gelbmann

Oracle is the DBMS of the Year
5 January 2016, Paul Andlinger, Matthias Gelbmann

Winners, losers and an attractive newcomer in Novembers DB-Engines ranking
2 November 2015, Paul Andlinger

show all

Recent citations in the news

ScyllaDB 6.0 advances open source database scalability
12 June 2024, VentureBeat

Here's What to Know About Apache Cassandra 5.0
15 April 2024, hackernoon.com

Microsoft extends life support for aging Apache Cassandra 3.11 database
10 October 2023, The Register

How Zalando migrated their shopping carts to Amazon DynamoDB from Apache Cassandra | Amazon Web Services
11 October 2023, AWS Blog

Microsoft Build 2024: Cosmos DB for NoSQL gets vector search
21 May 2024, InfoWorld

provided by Google News

Ubicloud wants to build an open source alternative to AWS
5 March 2024, TechCrunch

Ubicloud reels in $16M for its open-source cloud platform
5 March 2024, SiliconANGLE News

Microsoft acquires Citus Data, re-affirming its commitment to Open Source and accelerating Azure PostgreSQL ...
24 January 2019, blogs.microsoft.com

Distributed PostgreSQL Benchmarks: Azure Cosmos DB, CockroachDB, and YugabyteDB
8 July 2023, InfoQ.com

Microsoft Benchmarks Distributed PostgreSQL DBs
10 July 2023, Datanami

provided by Google News

Run Apache Hive workloads using Spark SQL with Amazon EMR on EKS | Amazon Web Services
18 October 2023, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Performance Insights from Sigma Rule Detections in Spark Streaming
1 June 2024, Towards Data Science

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
1 April 2024, Simplilearn

Simba Technologies(R) Introduces New, Powerful JDBC Driver With SQL Connector for Apache Spark(TM)
17 March 2024, Yahoo Singapore News

provided by Google News



Share this page

Featured Products

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here