DB-EnginesExtremeDB for everyone with an RTOSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Apache Impala vs. BaseX vs. ClickHouse vs. Databricks

System Properties Comparison Apache Impala vs. BaseX vs. ClickHouse vs. Databricks

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Impala  Xexclude from comparisonBaseX  Xexclude from comparisonClickHouse  Xexclude from comparisonDatabricks  Xexclude from comparison
DescriptionAnalytic DBMS for HadoopLight-weight Native XML DBMS with support for XQuery 3.0 and interactive GUI.A high-performance, column-oriented SQL DBMS for online analytical processing (OLAP) that uses all available system resources to their full potential to process each analytical query as fast as possible. It is available as both an open-source software and a cloud offering.The Databricks Lakehouse Platform combines elements of data lakes and data warehouses to provide a unified view onto structured and unstructured data. It is based on Apache Spark.
Primary database modelRelational DBMSNative XML DBMSRelational DBMSDocument store
Relational DBMS
Secondary database modelsDocument storeTime Series DBMS
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score12.45
Rank#40  Overall
#24  Relational DBMS
Score1.84
Rank#135  Overall
#4  Native XML DBMS
Score15.55
Rank#38  Overall
#23  Relational DBMS
Score81.08
Rank#15  Overall
#2  Document stores
#10  Relational DBMS
Websiteimpala.apache.orgbasex.orgclickhouse.comwww.databricks.com
Technical documentationimpala.apache.org/­impala-docs.htmldocs.basex.orgclickhouse.com/­docsdocs.databricks.com
DeveloperApache Software Foundation infoApache top-level project, originally developed by ClouderaBaseX GmbHClickhouse Inc.Databricks
Initial release2013200720162013
Current release4.1.0, June 202211.0, June 2024v24.4.2.141-stable, June 2024
License infoCommercial or Open SourceOpen Source infoApache Version 2Open Source infoBSD licenseOpen Source infoApache 2.0commercial
Cloud-based only infoOnly available as a cloud servicenononoyes
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
  • Aiven for Clickhouse: Managed cloud data warehousing with high-speed analytics.
  • ClickHouse Cloud: Get the performance you love from open source ClickHouse in a serverless offering that takes care of the details so you can spend more time getting insight out of the fastest database on earth.
  • DoubleCloud: Fully managed ClickHouse alongside best-in-class managed open-source services to build analytics at scale.
Implementation languageC++JavaC++
Server operating systemsLinuxLinux
OS X
Windows
FreeBSD
Linux
macOS
hosted
Data schemeyesschema-freeyesFlexible Schema (defined schema, partial schema, schema free)
Typing infopredefined data types such as float or dateyesno infoXQuery supports typesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonoyes
Secondary indexesyesyesyesyes
SQL infoSupport of SQLSQL-like DML and DDL statementsnoClose to ANSI SQL (SQL/JSON + extensions)with Databricks SQL
APIs and other access methodsJDBC
ODBC
Java API
RESTful HTTP API
RESTXQ
WebDAV
XML:DB
XQJ
gRPC
HTTP REST
JDBC
MySQL wire protocol
ODBC
PostgreSQL wire protocol
Proprietary protocol
JDBC
ODBC
RESTful HTTP API
Supported programming languagesAll languages supporting JDBC/ODBCActionscript
C
C#
Haskell
Java
JavaScript infoNode.js
Lisp
Perl
PHP
Python
Qt
Rebol
Ruby
Scala
Visual Basic
C# info3rd party library
C++
Elixir info3rd party library
Go info3rd party library
Java info3rd party library
JavaScript (Node.js) info3rd party library
Kotlin info3rd party library
Nim info3rd party library
Perl info3rd party library
PHP info3rd party library
Python info3rd party library
R info3rd party library
Ruby info3rd party library
Rust
Scala info3rd party library
Python
R
Scala
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceyesyesuser defined functions and aggregates
Triggersnoyes infovia eventsno
Partitioning methods infoMethods for storing different data on different nodesShardingnonekey based and custom
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factornoneAsynchronous and synchronous physical replication; geographically distributed replicas; support for object storages.yes
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReducenono
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual ConsistencyImmediate ConsistencyImmediate Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanomultiple readers, single writernoACID
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.noyesno
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and KerberosUsers with fine-grained authorization concept on 4 levelsAccess rights for users and roles. Column and row based policies. Quotas and resource limits. Pluggable authentication with LDAP and Kerberos. Password based, X.509 certificate, and SSH key authentication.
More information provided by the system vendor
Apache ImpalaBaseXClickHouseDatabricks
Specific characteristicsSupported database models : In addition to the Document store and Relational DBMS...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesAiven for Clickhouse: Managed cloud data warehousing with high-speed analytics.
» more

DoubleCloud: Fully managed ClickHouse alongside best-in-class managed open-source services to build analytics at scale.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache ImpalaBaseXClickHouseDatabricks
DB-Engines blog posts

PostgreSQL is the DBMS of the Year 2023
2 January 2024, Matthias Gelbmann, Paul Andlinger

show all

Recent citations in the news

Apache Impala becomes Top-Level Project
28 November 2017, SDTimes.com

Cloudera Bringing Impala to AWS Cloud
28 November 2017, Datanami

Apache Doris just 'graduated': Why care about this SQL data warehouse
24 June 2022, InfoWorld

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop
12 March 2017, Uber

Updates & Upserts in Hadoop Ecosystem with Apache Kudu
27 October 2017, KDnuggets

provided by Google News

XML Injection Attacks: What to Know About XPath, XQuery, XXE & More
18 May 2022, Hashed Out by The SSL Store™

9 Skills You Need to Become a Data Engineer
2 November 2022, KDnuggets

Xml Databases Software Market Thriving at a Tremendous Growth – TIMC
16 June 2024, TIMC

provided by Google News

ClickHouse Cloud & Amazon S3 Express One Zone: Making a blazing fast analytical database even faster | Amazon ...
28 November 2023, AWS Blog

Why Clickhouse Should Be Your Next Database
6 July 2023, The New Stack

Intel Xeon 6766E/6780E Sierra Forest vs. Ampere Altra Performance & Power Efficiency Review
5 June 2024, Phoronix

A 1000x Faster Database Solution: ClickHouse’s Aaron Katz
1 November 2023, GrowthCap

From Open Source to SaaS: the Journey of ClickHouse
16 January 2024, InfoQ.com

provided by Google News

Databricks is Taking the Ultimate Risk of Building 'USB for AI' – AIM
15 June 2024, Analytics India Magazine

The Three Big Announcements by Databricks AI Team in June 2024
17 June 2024, MarkTechPost

Databricks launches LakeFlow to help its customers build their data pipelines
12 June 2024, TechCrunch

Databricks tells investors annualized revenue will reach $2.4 billion at midway point of year
13 June 2024, CNBC

Databricks open-sources Unity Catalog, challenging Snowflake on interoperability for data workloads
12 June 2024, VentureBeat

provided by Google News



Share this page

Featured Products

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here