DB-EnginesExtremeDB for everyone with an RTOSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Apache Impala vs. Apache Kylin vs. ArcadeDB vs. Snowflake vs. Sphinx

System Properties Comparison Apache Impala vs. Apache Kylin vs. ArcadeDB vs. Snowflake vs. Sphinx

Editorial information provided by DB-Engines
NameApache Impala  Xexclude from comparisonApache Kylin  Xexclude from comparisonArcadeDB  Xexclude from comparisonSnowflake  Xexclude from comparisonSphinx  Xexclude from comparison
DescriptionAnalytic DBMS for HadoopA distributed analytics engine for big data, providing a SQL interface and multi-dimensional analysis (OLAP) and leveraging the Hadoop stackFast and scalable multi-model DBMS, originally forked from OrientDB but most of the code has been rewrittenCloud-based data warehousing service for structured and semi-structured dataOpen source search engine for searching in data from different sources, e.g. relational databases
Primary database modelRelational DBMSRelational DBMSDocument store
Graph DBMS
Key-value store
Time Series DBMS infoin next version
Relational DBMSSearch engine
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score13.77
Rank#40  Overall
#24  Relational DBMS
Score1.18
Rank#172  Overall
#79  Relational DBMS
Score0.02
Rank#366  Overall
#50  Document stores
#38  Graph DBMS
#53  Key-value stores
#36  Time Series DBMS
Score121.33
Rank#9  Overall
#6  Relational DBMS
Score5.98
Rank#56  Overall
#5  Search engines
Websiteimpala.apache.orgkylin.apache.orgarcadedb.comwww.snowflake.comsphinxsearch.com
Technical documentationimpala.apache.org/­impala-docs.htmlkylin.apache.org/­docsdocs.arcadedb.comdocs.snowflake.net/­manuals/­index.htmlsphinxsearch.com/­docs
DeveloperApache Software Foundation infoApache top-level project, originally developed by ClouderaApache Software Foundation, originally contributed from eBay IncArcade DataSnowflake Computing Inc.Sphinx Technologies Inc.
Initial release20132015202120142001
Current release4.1.0, June 20223.1.0, July 2020September 20213.5.1, February 2023
License infoCommercial or Open SourceOpen Source infoApache Version 2Open Source infoApache Version 2.0Open Source infoApache Version 2.0commercialOpen Source infoGPL version 2, commercial licence available
Cloud-based only infoOnly available as a cloud servicenononoyesno
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageC++JavaJavaC++
Server operating systemsLinuxLinuxAll OS with a Java VMhostedFreeBSD
Linux
NetBSD
OS X
Solaris
Windows
Data schemeyesyesschema-freeyes infosupport of semi-structured data formats (JSON, XML, Avro)yes
Typing infopredefined data types such as float or dateyesyesyesyesno
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nononoyes
Secondary indexesyesyesyesyes infofull-text index on all search fields
SQL infoSupport of SQLSQL-like DML and DDL statementsANSI SQL for queries (using Apache Calcite)SQL-like query language, no joinsyesSQL-like query language (SphinxQL)
APIs and other access methodsJDBC
ODBC
JDBC
ODBC
RESTful HTTP API
JDBC
MongoDB API
OpenCypher
PostgreSQL wire protocol
Redis API
RESTful HTTP/JSON API
TinkerPop Gremlin
CLI Client
JDBC
ODBC
Proprietary protocol
Supported programming languagesAll languages supporting JDBC/ODBCJavaJavaScript (Node.js)
Python
C++ infounofficial client library
Java
Perl infounofficial client library
PHP
Python
Ruby infounofficial client library
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceuser defined functionsno
Triggersnono infosimilar concept for controling cloud resourcesno
Partitioning methods infoMethods for storing different data on different nodesShardingyesSharding infoPartitioning is done manually, search queries against distributed index is supported
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factorSource-replica replicationyesnone
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReduceyesnonono
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual ConsistencyImmediate ConsistencyImmediate ConsistencyImmediate Consistency
Foreign keys infoReferential integritynoyes inforelationship in graphsyesno
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoACIDACIDno
Concurrency infoSupport for concurrent manipulation of datayesyesyesyesyes
Durability infoSupport for making data persistentyesyesyesyesyes infoThe original contents of fields are not stored in the Sphinx index.
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nonono
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and KerberosUsers with fine-grained authorization concept, user roles and pluggable authenticationno

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesCData: Connect to Big Data & NoSQL through standard Drivers.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache ImpalaApache KylinArcadeDBSnowflakeSphinx
DB-Engines blog posts

Snowflake is the DBMS of the Year 2022, defending the title from last year
3 January 2023, Matthias Gelbmann, Paul Andlinger

Snowflake is the DBMS of the Year 2021
3 January 2022, Paul Andlinger, Matthias Gelbmann

show all

The DB-Engines ranking includes now search engines
4 February 2013, Paul Andlinger

show all

Recent citations in the news

Apache Impala 4 Supports Operator Multi-Threading
29 July 2021, iProgrammer

Apache Impala becomes Top-Level Project
28 November 2017, SDTimes.com

Cloudera Bringing Impala to AWS Cloud
28 November 2017, Datanami

Apache Doris just 'graduated': Why care about this SQL data warehouse
24 June 2022, InfoWorld

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop
12 March 2017, Uber

provided by Google News

Snowflake Stock: Is It A Buy Right Now? Here's What Earnings, SNOW Stock Chart Show
6 May 2024, Investor's Business Daily

Snowflake Unveils the Future of Enterprise AI, Apps, and Data at Sixth-Annual Data Cloud Summit
8 May 2024, Business Wire

Snowflake Data Clean Rooms Democratize Secure Data Sharing Across Clouds
24 April 2024, Acceleration Economy

Snowflake’s Data Clean Room promises to ease analysis of PII data
28 March 2024, InfoWorld

Infosys at Snowflake Data Cloud Summit 2024
3 May 2024, Infosys

provided by Google News

Switching From Sphinx to MkDocs Documentation — What Did I Gain and Lose
2 February 2024, Towards Data Science

Manticore is a Faster Alternative to Elasticsearch in C++
25 July 2022, hackernoon.com

Perplexity AI: From Its Use To Operation, Everything You Need To Know About Googles Newest Challenger
11 January 2024, Free Press Journal

The Pirate Bay was recently down for over a week due to a DDoS attack
29 October 2019, The Hacker News

How to Build 600+ Links in One Month
4 September 2020, Search Engine Journal

provided by Google News



Share this page

Featured Products

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

SingleStore logo

Database for your real-time AI and Analytics Apps.
Try it today.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Present your product here