DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Dolt vs. Microsoft Azure Data Explorer vs. ScyllaDB vs. Sequoiadb vs. Spark SQL

System Properties Comparison Dolt vs. Microsoft Azure Data Explorer vs. ScyllaDB vs. Sequoiadb vs. Spark SQL

Editorial information provided by DB-Engines
NameDolt  Xexclude from comparisonMicrosoft Azure Data Explorer  Xexclude from comparisonScyllaDB  Xexclude from comparisonSequoiadb  Xexclude from comparisonSpark SQL  Xexclude from comparison
DescriptionA MySQL compatible DBMS with Git-like versioning of data and schemaFully managed big data interactive analytics platformCassandra and DynamoDB compatible wide column storeNewSQL database with distributed OLTP and SQLSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelRelational DBMSRelational DBMS infocolumn orientedWide column storeDocument store
Relational DBMS
Relational DBMS
Secondary database modelsDocument storeDocument store infoIf a column is of type dynamic docs.microsoft.com/­en-us/­azure/­kusto/­query/­scalar-data-types/­dynamic then it's possible to add arbitrary JSON documents in this cell
Event Store infothis is the general usage pattern at Microsoft. Billing, Logs, Telemetry events are stored in ADX and the state of an individual entity is defined by the arg_max(timestamps)
Spatial DBMS
Search engine infosupport for complex search expressions docs.microsoft.com/­en-us/­azure/­kusto/­query/­parseoperator FTS, Geospatial docs.microsoft.com/­en-us/­azure/­kusto/­query/­geo-point-to-geohash-function distributed search -> ADX acts as a distributed search engine
Time Series DBMS infosee docs.microsoft.com/­en-us/­azure/­data-explorer/­time-series-analysis
Key-value store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score0.96
Rank#193  Overall
#90  Relational DBMS
Score4.38
Rank#77  Overall
#41  Relational DBMS
Score4.75
Rank#68  Overall
#5  Wide column stores
Score0.45
Rank#261  Overall
#41  Document stores
#122  Relational DBMS
Score18.96
Rank#33  Overall
#20  Relational DBMS
Websitegithub.com/­dolthub/­dolt
www.dolthub.com
azure.microsoft.com/­services/­data-explorerwww.scylladb.comwww.sequoiadb.comspark.apache.org/­sql
Technical documentationdocs.dolthub.comdocs.microsoft.com/­en-us/­azure/­data-explorerdocs.scylladb.comwww.sequoiadb.com/­en/­index.php?m=Files&a=indexspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperDoltHub IncMicrosoftScyllaDBSequoiadb Ltd.Apache Software Foundation
Initial release20182019201520132014
Current releasecloud service with continuous releasesScyllaDB Open Source 5.4.1, January 20243.5.0 ( 2.13), September 2023
License infoCommercial or Open SourceOpen Source infoApache Version 2.0commercialOpen Source infoOpen Source (AGPL), commercial license availableOpen Source infoServer: AGPL; Client: Apache V2Open Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenoyesnonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Scylla Cloud: Create real-time applications that run at global scale with Scylla Cloud, the industry’s most powerful NoSQL DBaaS
Implementation languageGoC++C++Scala
Server operating systemsLinux
macOS
Windows
hostedLinuxLinuxLinux
OS X
Windows
Data schemeyesFixed schema with schema-less datatypes (dynamic)schema-freeschema-freeyes
Typing infopredefined data types such as float or dateyesyes infobool, datetime, dynamic, guid, int, long, real, string, timespan, double: docs.microsoft.com/­en-us/­azure/­kusto/­query/­scalar-data-typesyesyes infooid, date, timestamp, binary, regexyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.noyesnonono
Secondary indexesyesall fields are automatically indexedyes infocluster global secondary indicesyesno
SQL infoSupport of SQLyesKusto Query Language (KQL), SQL subsetSQL-like DML and DDL statements (CQL)SQL-like query languageSQL-like DML and DDL statements
APIs and other access methodsCLI Client
HTTP REST
Microsoft SQL Server communication protocol (MS-TDS)
RESTful HTTP API
Proprietary protocol (CQL) infocompatible with CQL (Cassandra Query Language, an SQL-like language)
RESTful HTTP API (DynamoDB compatible)
Thrift
proprietary protocol using JSONJDBC
ODBC
Supported programming languagesAda
C
C#
C++
D
Delphi
Eiffel
Erlang
Haskell
Java
JavaScript (Node.js)
Objective-C
OCaml
Perl
PHP
Python
Ruby
Scheme
Tcl
.Net
Go
Java
JavaScript (Node.js)
PowerShell
Python
R
For CQL interface: C#, C++, Clojure, Erlang, Go, Haskell, Java, JavaScript, Node.js, Perl, PHP, Python, Ruby, Rust, Scala
For DynamoDB interface: .Net, ColdFusion, Erlang, Groovy, Java, JavaScript, Perl, PHP, Python, Ruby
.Net
C++
Java
PHP
Python
Java
Python
R
Scala
Server-side scripts infoStored proceduresyes infocurrently in alpha releaseYes, possible languages: KQL, Python, Ryes, LuaJavaScriptno
Triggersyesyes infosee docs.microsoft.com/­en-us/­azure/­kusto/­management/­updatepolicynonono
Partitioning methods infoMethods for storing different data on different nodesnoneSharding infoImplicit feature of the cloud serviceShardingShardingyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesA database can be cloned to multiple locations and be used there in isolation. Data/schema changes can be pushed/pulled explicitly between locations.yes infoImplicit feature of the cloud service. Replication either local, cross-facility or geo-redundant.selectable replication factor infoRepresentation of geographical distribution of servers is possibleSource-replica replicationnone
MapReduce infoOffers an API for user-defined Map/Reduce methodsnoSpark connector (open source): github.com/­Azure/­azure-kusto-sparknono
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency
Immediate Consistency
Eventual Consistency
Tunable Consistency infocan be individually decided for each write operation
Eventual Consistency
Foreign keys infoReferential integrityyesnononono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of dataACIDnono infoAtomicity and isolation are supported for single operationsDocument is locked during a transactionno
Concurrency infoSupport for concurrent manipulation of datayesyesyesyesyes
Durability infoSupport for making data persistentyesyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.noyes infoin-memory tablesnono
User concepts infoAccess controlOnly one user is configurable, and must be specified in the config file at startupAzure Active Directory AuthenticationAccess rights for users can be defined per objectsimple password-based access controlno
More information provided by the system vendor
DoltMicrosoft Azure Data ExplorerScyllaDBSequoiadbSpark SQL
Specific characteristicsScyllaDB is engineered to deliver predictable performance at scale. It’s adopted...
» more
Competitive advantagesHighly-performant (efficiently utilizes full resources of a node and network; millions...
» more
Typical application scenariosScyllaDB is ideal for applications that require high throughput and low latency at...
» more
Key customersDiscord, Epic Games, Expedia, Zillow, Comcast, Disney+ Hotstar, Samsung, ShareChat,...
» more
Market metricsScyllaDB typically offers ~75% total cost of ownership savings, with ~5X higher throughput...
» more
Licensing and pricing modelsScyllaDB Open Source - free open source software (AGPL) ScyllaDB Enterprise - subscription-based...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
DoltMicrosoft Azure Data ExplorerScyllaDBSequoiadbSpark SQL
Recent citations in the news

Top Data Version Control Tools for Machine Learning Research in 2023
24 July 2023, MarkTechPost

Dolt, a Relational Database with Git-Like Cloning Features
19 August 2020, The New Stack

Data Versioning at Scale: Chaos and Chaos Management
10 February 2023, InfoQ.com

Radar Trends to Watch: July 2022 – O'Reilly
5 July 2022, oreilly.com

Are you still not using Version Control for Data?
11 April 2020, Towards Data Science

provided by Google News

Azure Data Explorer: Log and telemetry analytics benchmark
16 August 2022, Microsoft

Providing modern data transfer and storage service at Microsoft with Microsoft Azure - Inside Track Blog
13 July 2023, Microsoft

Controlling costs in Azure Data Explorer using down-sampling and aggregation
11 February 2019, Microsoft

Microsoft Introduces Azure Integration Environments and Business Process Tracking in Public Preview
23 November 2023, InfoQ.com

Individually great, collectively unmatched: Announcing updates to 3 great Azure Data Services
7 February 2019, Microsoft

provided by Google News

Sleeping at Scale - Delivering 10k Timers per Second per Node with Rust, Tokio, Kafka, and Scylla
26 April 2024, InfoQ.com

ScyllaDB on AWS is a NoSQL Database Built for Gigabyte-to-Petabyte Scale | Amazon Web Services
6 January 2023, AWS Blog

Scylla Eyes Cassandra's NoSQL Workloads
13 February 2018, Datanami

ScyllaDB Database Review | eWeek
21 August 2018, eWeek

ScyllaDB Launches Scylla Cloud Database as a Service
14 April 2019, insideBIGDATA

provided by Google News

Run Apache Hive workloads using Spark SQL with Amazon EMR on EKS | Amazon Web Services
18 October 2023, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
1 April 2024, Simplilearn

Performant IPv4 Range Spark Joins | by Jean-Claude Cote
24 January 2024, Towards Data Science

18 Top Big Data Tools and Technologies to Know About in 2024
24 January 2024, TechTarget

provided by Google News



Share this page

Featured Products

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

SingleStore logo

The database to transact, analyze and contextualize your data in real time.
Try it today.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Present your product here