DB-EnginesExtremeDB: mitigate connectivity issues in a DBMSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Apache Drill vs. DuckDB vs. Spark SQL

System Properties Comparison Apache Drill vs. DuckDB vs. Spark SQL

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Drill  Xexclude from comparisonDuckDB  Xexclude from comparisonSpark SQL  Xexclude from comparison
DescriptionSchema-free SQL Query Engine for Hadoop, NoSQL and Cloud StorageAn embeddable, in-process, column-oriented SQL OLAP RDBMSSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelDocument store
Relational DBMS
Relational DBMSRelational DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score2.02
Rank#124  Overall
#22  Document stores
#59  Relational DBMS
Score4.63
Rank#69  Overall
#37  Relational DBMS
Score18.04
Rank#33  Overall
#20  Relational DBMS
Websitedrill.apache.orgduckdb.orgspark.apache.org/­sql
Technical documentationdrill.apache.org/­docsduckdb.org/­docsspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperApache Software FoundationApache Software Foundation
Initial release201220182014
Current release1.20.3, January 20231.0.0, June 20243.5.0 ( 2.13), September 2023
License infoCommercial or Open SourceOpen Source infoApache Version 2Open Source infoMIT LicenseOpen Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageC++Scala
Server operating systemsLinux
OS X
Windows
server-lessLinux
OS X
Windows
Data schemeschema-freeyesyes
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesnoyesno
SQL infoSupport of SQLSQL SELECT statement is SQL:2003 compliantyesSQL-like DML and DDL statements
APIs and other access methodsJDBC
ODBC
RESTful HTTP API
Arrow Database Connectivity (ADBC)
CLI Client
JDBC
ODBC
JDBC
ODBC
Supported programming languagesC++C
C# info3rd party driver
C++
Crystal info3rd party driver
Go info3rd party driver
Java
Lisp info3rd party driver
Python
R
Ruby info3rd party driver
Rust
Swift
Zig info3rd party driver
Java
Python
R
Scala
Server-side scripts infoStored proceduresuser defined functionsnono
Triggersnonono
Partitioning methods infoMethods for storing different data on different nodesShardingnoneyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesnonenone
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesno
Consistency concepts infoMethods to ensure consistency in a distributed systemnoneImmediate Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoACIDno
Concurrency infoSupport for concurrent manipulation of datayesyes, multi-version concurrency control (MVCC)yes
Durability infoSupport for making data persistentDepending on the underlying data sourceyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.Depending on the underlying data sourceyesno
User concepts infoAccess controlDepending on the underlying data sourcenono

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache DrillDuckDBSpark SQL
Recent citations in the news

Apache Drill vs. Apache Spark — Which SQL query engine is better for you?
23 September 2019, Towards Data Science

Apache Drill case study: A tutorial on processing CSV files
9 June 2016, TheServerSide.com

Apache Drill Poised to Crack Tough Data Challenges
19 May 2015, Datanami

Apache Drill Eliminates ETL, Data Transformation for MapR Database
11 April 2016, The New Stack

Drill Mines Diverse Data Sets, Google Style
20 May 2015, The Next Platform

provided by Google News

MotherDuck Announces General Availability; Brings Simplicity and Power of DuckDB in a Serverless Data Warehouse
11 June 2024, PR Newswire

DuckDB: The tiny but powerful analytics database
15 May 2024, InfoWorld

DuckDB promises greater stability with 1.0 release
5 June 2024, The Register

My First Billion (of Rows) in DuckDB | by João Pedro | May, 2024
1 May 2024, Towards Data Science

DuckDB: In-Process Python Analytics for Not-Quite-Big Data
31 May 2024, The New Stack

provided by Google News

Use Amazon Athena with Spark SQL for your open-source transactional table formats | Amazon Web Services
24 January 2024, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
1 April 2024, Simplilearn

Performant IPv4 Range Spark Joins | by Jean-Claude Cote
24 January 2024, Towards Data Science

Simba Technologies(R) Introduces New, Powerful JDBC Driver With SQL Connector for Apache Spark(TM)
17 March 2024, Yahoo Singapore News

provided by Google News



Share this page

Featured Products

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Present your product here