DB-EnginesextremeDB - solve IoT connectivity disruptionsEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by Redgate Software

DBMS > Apache Spark (SQL) vs. Databricks vs. DataFS vs. Trino

System Properties Comparison Apache Spark (SQL) vs. Databricks vs. DataFS vs. Trino

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Spark (SQL)  Xexclude from comparisonDatabricks  Xexclude from comparisonDataFS  Xexclude from comparisonTrino  Xexclude from comparison
DescriptionApache Spark SQL is a component on top of 'Spark Core' for structured data processingThe Databricks Lakehouse Platform combines elements of data lakes and data warehouses to provide a unified view onto structured and unstructured data. It is based on Apache Spark.All data is stored inside objects which are linked by so-called link attributes. Objects consist of classes which can be extended and de-extended at runtime. Graphs can be defined with a struct.Fast distributed SQL query engine for big data analytics. Forked from Presto and originally named PrestoSQL
Primary database modelRelational DBMSDocument store
Relational DBMS
Object oriented DBMSRelational DBMS
Secondary database modelsGraph DBMSDocument store
Key-value store
Spatial DBMS
Search engine
Time Series DBMS
Wide column store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score21.62
Rank#29  Overall
#18  Relational DBMS
Score102.66
Rank#12  Overall
#2  Document stores
#8  Relational DBMS
Score0.00
Rank#385  Overall
#21  Object oriented DBMS
Score5.34
Rank#60  Overall
#34  Relational DBMS
Websitespark.apache.org/­sqlwww.databricks.comnewdatabase.comtrino.io
Technical documentationspark.apache.org/­docs/­latest/­sql-programming-guide.htmldocs.databricks.comdev.mobiland.com/­Overview.xsptrino.io/­broadcast
trino.io/­docs/­current
DeveloperApache Software FoundationDatabricksMobiland AGTrino Software Foundation
Initial release2014201320182012 info2020 rebranded from PrestoSQL
Current release3.5.0 ( 2.13), September 20231.1.263, October 2022
License infoCommercial or Open SourceOpen Source infoApache 2.0commercialcommercialOpen Source infoApache Version 2.0
Cloud-based only infoOnly available as a cloud servicenoyesnono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageScalaJava
Server operating systemsLinux
OS X
Windows
hostedWindowsLinux
macOS infofor devlopment
Data schemeyesFlexible Schema (defined schema, partial schema, schema free)Classes, Structs, and Lists are written in proprietary DataTypeDefinitionLanguage (.dtdl) and Objects consisting of those are written in proprietary DataAccessDefinitionLanguage (.dadl)yes
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.noyesnono
Secondary indexesnoyesnodepending on connected data-source
SQL infoSupport of SQLSQL-like DML and DDL statementswith Databricks SQLnoyes
APIs and other access methodsJDBC
ODBC
JDBC
ODBC
RESTful HTTP API
.NET Client API
Proprietary client DLL
WinRT client
JDBC
RESTful HTTP API
Trino CLI
Supported programming languagesJava
Python
R
Scala
Python
R
Scala
.Net
C
C#
C++
VB.Net
Go
Java
JavaScript (Node.js)
Python
R
Ruby
Server-side scripts infoStored proceduresnouser defined functions and aggregatesyes, depending on connected data-source
Triggersnono, except callback-events from server when changes happenedno
Partitioning methods infoMethods for storing different data on different nodesyes, utilizing Spark CoreProprietary Sharding systemdepending on connected data-source
Replication methods infoMethods for redundantly storing data on multiple nodesnoneyesdepending on connected data-source
MapReduce infoOffers an API for user-defined Map/Reduce methodsnono
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate ConsistencyImmediate Consistencydepending on connected data-source
Foreign keys infoReferential integritynoyesno
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoACIDACIDdepending on connected data-source
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesdepending on connected data-source
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nonono
User concepts infoAccess controlnoWindows-ProfileSQL standard access control
More information provided by the system vendor
Apache Spark (SQL)DatabricksDataFSTrino
News

73: Wrapping Trino packages with a bow
9 April 2025

Core Principles and Design Practices of OLAP Engines
27 March 2025

72: Keeping the lake clean
17 March 2025

Twenty four
3 March 2025

71: Fake it real good
27 February 2025

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache Spark (SQL)DatabricksDataFSTrino
DB-Engines blog posts

DB-Engines shares Q1 2025 database industry rankings and top climbers: Snowflake and PostgreSQL trending
1 May 2025, DB-Engines

PostgreSQL is the DBMS of the Year 2023
2 January 2024, Matthias Gelbmann, Paul Andlinger

show all

Recent citations in the news

Introducing AWS Glue 5.0 for Apache Spark
4 December 2024, Amazon Web Services (AWS)

Scala vs Python for Apache Spark: An In-depth Comparison With Use Cases For Each
21 April 2025, Simplilearn.com

How to run Pandas code on Spark
25 January 2025, Theodo Data & AI

The 6 Best Apache Spark Courses on Udemy to Consider for 2025
1 January 2025, solutionsreview.com

18 top big data tools and technologies to know about in 2025
22 January 2025, TechTarget

provided by Google News

Exclusive | Databricks to Buy Startup Neon for $1 Billion
14 May 2025, WSJ

Databricks more than quadruples footprint in Seattle's West8
14 May 2025, The Business Journals

Databricks takes aim at marketers with new platform for data and AI
14 May 2025, MarTech

Databricks Agrees to Acquire Neon to Deliver Serverless Postgres for Developers + AI Agents
14 May 2025, PR Newswire

Databricks Is On An M&A Roll With $1B Neon Acquisition
14 May 2025, Crunchbase News

provided by Google News

A look at Presto, Trino SQL query engines
9 August 2022, TechTarget

How to Deploy MinIO and Trino with Kubernetes
23 May 2024, HackerNoon

The Perfect AI Storage: Trino From Facebook And Iceberg From Netflix?
30 April 2024, The Next Platform

Query big data with resilience using Trino in Amazon EMR with Amazon EC2 Spot Instances for less cost
4 October 2023, Amazon Web Services (AWS)

Trino turns 10: Starburst celebrates a decade of its open source query engine
11 August 2022, VentureBeat

provided by Google News



Share this page

Featured Products

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

SingleStore logo

The data platform to build your intelligent applications.
Try it free.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Present your product here