DBMS > Apache Spark (SQL) vs. Databricks vs. DataFS vs. Trino

System Properties Comparison Apache Spark (SQL) vs. Databricks vs. DataFS vs. Trino

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines

Name

Description

Apache Spark SQL is a component on top of 'Spark Core' for structured data processing

The Databricks Lakehouse Platform combines elements of data lakes and data warehouses to provide a unified view onto structured and unstructured data. It is based on Apache Spark.

All data is stored inside objects which are linked by so-called link attributes. Objects consist of classes which can be extended and de-extended at runtime. Graphs can be defined with a struct.

Fast distributed SQL query engine for big data analytics. Forked from Presto and originally named PrestoSQL

Primary database model

Relational DBMS

Document store
Relational DBMS

Object oriented DBMS

Relational DBMS

Secondary database models

Graph DBMS

Document store
Key-value store
Spatial DBMS
Search engine
Time Series DBMS
Wide column store

DB-Engines Ranking measures the popularity of database management systems
Trend Chart

Score	21.62
Rank	#29	Overall
	#18	Relational DBMS

Score	102.66
Rank	#12	Overall
	#2	Document stores
	#8	Relational DBMS

Score	0.00
Rank	#385	Overall
	#21	Object oriented DBMS

Score	5.34
Rank	#60	Overall
	#34	Relational DBMS

Website

Technical documentation

spark.apache.org/docs/latest/sql-programming-guide.html

docs.databricks.com

dev.mobiland.com/Overview.xsp

trino.io/broadcast
trino.io/docs/current

Developer

Apache Software Foundation

Databricks

Mobiland AG

Trino Software Foundation

Initial release

2014

2013

2018

2012

2020 rebranded from PrestoSQL

Current release

3.5.0 ( 2.13), September 2023

1.1.263, October 2022

License

Commercial or Open Source

Open Source

Apache 2.0

commercial

Open Source

Apache Version 2.0

Cloud-based only

Only available as a cloud service

yes

DBaaS offerings (sponsored links)

Database as a Service

Providers of DBaaS offerings, please contact us to be listed.

Implementation language

Scala

Java

Server operating systems

Linux
OS X
Windows

hosted

Windows

Linux
macOS

for devlopment

Data scheme

yes

Flexible Schema (defined schema, partial schema, schema free)

Classes, Structs, and Lists are written in proprietary DataTypeDefinitionLanguage (.dtdl) and Objects consisting of those are written in proprietary DataAccessDefinitionLanguage (.dadl)

yes

Typing

predefined data types such as float or date

yes

XML support

Some form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.

yes

Secondary indexes

yes

depending on connected data-source

SQL

Support of SQL

SQL-like DML and DDL statements

with Databricks SQL

yes

APIs and other access methods

JDBC
ODBC

JDBC
ODBC
RESTful HTTP API

.NET Client API
Proprietary client DLL
WinRT client

JDBC
RESTful HTTP API
Trino CLI

Supported programming languages

Java
Python
R
Scala

Python
R
Scala

.Net
C
C#
C++
VB.Net

Go
Java
JavaScript (Node.js)
Python
R
Ruby

Server-side scripts

Stored procedures

user defined functions and aggregates

yes, depending on connected data-source

Triggers

no, except callback-events from server when changes happened

Partitioning methods

Methods for storing different data on different nodes

yes, utilizing Spark Core

Proprietary Sharding system

depending on connected data-source

Replication methods

Methods for redundantly storing data on multiple nodes

none

yes

depending on connected data-source

MapReduce

Offers an API for user-defined Map/Reduce methods

Consistency concepts

Methods to ensure consistency in a distributed system

Immediate Consistency

depending on connected data-source

Foreign keys

Referential integrity

yes

Transaction concepts

Support to ensure data integrity after non-atomic manipulations of data

ACID

depending on connected data-source

Concurrency

Support for concurrent manipulation of data

yes

Durability

Support for making data persistent

yes

depending on connected data-source

In-memory capabilities

Is there an option to define some or all structures to be held in-memory only.

User concepts

Access control

Windows-Profile

SQL standard access control

More information provided by the system vendor

News

73: Wrapping Trino packages with a bow
9 April 2025

Core Principles and Design Practices of OLAP Engines
27 March 2025

72: Keeping the lake clean
17 March 2025

Twenty four
3 March 2025

71: Fake it real good
27 February 2025

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources

DB-Engines blog posts

DB-Engines shares Q1 2025 database industry rankings and top climbers: Snowflake and PostgreSQL trending
1 May 2025, DB-Engines

PostgreSQL is the DBMS of the Year 2023
2 January 2024, Matthias Gelbmann, Paul Andlinger

show all

Recent citations in the news

Introducing AWS Glue 5.0 for Apache Spark
4 December 2024, Amazon Web Services (AWS)

Scala vs Python for Apache Spark: An In-depth Comparison With Use Cases For Each
21 April 2025, Simplilearn.com

How to run Pandas code on Spark
25 January 2025, Theodo Data & AI

The 6 Best Apache Spark Courses on Udemy to Consider for 2025
1 January 2025, solutionsreview.com

18 top big data tools and technologies to know about in 2025
22 January 2025, TechTarget

provided by Google News

Exclusive | Databricks to Buy Startup Neon for $1 Billion
14 May 2025, WSJ

Databricks more than quadruples footprint in Seattle's West8
14 May 2025, The Business Journals

Databricks takes aim at marketers with new platform for data and AI
14 May 2025, MarTech

Databricks Agrees to Acquire Neon to Deliver Serverless Postgres for Developers + AI Agents
14 May 2025, PR Newswire

Databricks Is On An M&A Roll With $1B Neon Acquisition
14 May 2025, Crunchbase News

provided by Google News

A look at Presto, Trino SQL query engines
9 August 2022, TechTarget

How to Deploy MinIO and Trino with Kubernetes
23 May 2024, HackerNoon

The Perfect AI Storage: Trino From Facebook And Iceberg From Netflix?
30 April 2024, The Next Platform

Query big data with resilience using Trino in Amazon EMR with Amazon EC2 Spot Instances for less cost
4 October 2023, Amazon Web Services (AWS)

Trino turns 10: Starburst celebrates a decade of its open source query engine
11 August 2022, VentureBeat

provided by Google News

Share this page