DB-EnginesExtremeDB for everyone with an RTOSEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Apache Impala vs. ArangoDB vs. Sphinx vs. YTsaurus

System Properties Comparison Apache Impala vs. ArangoDB vs. Sphinx vs. YTsaurus

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Impala  Xexclude from comparisonArangoDB  Xexclude from comparisonSphinx  Xexclude from comparisonYTsaurus  Xexclude from comparison
DescriptionAnalytic DBMS for HadoopNative multi-model DBMS for graph, document, key/value and search. All in one engine and accessible with one query language.Open source search engine for searching in data from different sources, e.g. relational databasesYTsaurus is an open source platform for distributed storage and processing.
Primary database modelRelational DBMSDocument store
Graph DBMS
Key-value store
Search engine
Search engineDocument store
Key-value store
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score13.77
Rank#40  Overall
#24  Relational DBMS
Score3.32
Rank#90  Overall
#15  Document stores
#5  Graph DBMS
#12  Key-value stores
#10  Search engines
Score5.98
Rank#56  Overall
#5  Search engines
Score0.11
Rank#340  Overall
#45  Document stores
#50  Key-value stores
Websiteimpala.apache.orgarangodb.comsphinxsearch.comytsaurus.tech
Technical documentationimpala.apache.org/­impala-docs.htmldocs.arangodb.comsphinxsearch.com/­docsytsaurus.tech/­docs/­en
Social network pagesLinkedInTwitterFacebookYouTubeInstagram
DeveloperApache Software Foundation infoApache top-level project, originally developed by ClouderaArangoDB Inc.Sphinx Technologies Inc.Yandex
Initial release2013201220012023
Current release4.1.0, June 20223.11.5, November 20233.5.1, February 2023
License infoCommercial or Open SourceOpen Source infoApache Version 2Open Source infoApache Version 2; Commercial license (Enterprise) availableOpen Source infoGPL version 2, commercial licence availableOpen Source infoApache License, Version 2.0
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
ArangoDB Cloud –The Managed Cloud Service of ArangoDB. Provides fully managed, and monitored cluster deployments of any size, with enterprise-grade security. Get started for free and continue for as little as $0,21/hour.
Implementation languageC++C++C++C++
Server operating systemsLinuxLinux
OS X
Windows
FreeBSD
Linux
NetBSD
OS X
Solaris
Windows
Ubuntu
Data schemeyesschema-free infoautomatically recognizes schema within a collectionyes
Typing infopredefined data types such as float or dateyesyes infostring, double, boolean, list, hashnoyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nono
Secondary indexesyesyesyes infofull-text index on all search fields
SQL infoSupport of SQLSQL-like DML and DDL statementsnoSQL-like query language (SphinxQL)YQL, an SQL-based language, is supported
APIs and other access methodsJDBC
ODBC
AQL
Foxx Framework
Graph API (Gremlin)
GraphQL query language
HTTP API
Java & SpringData
JSON style queries
VelocyPack/VelocyStream
Proprietary protocol
Supported programming languagesAll languages supporting JDBC/ODBCC#
C++
Clojure
Elixir
Go
Java
JavaScript (Node.js)
PHP
Python
R
Rust
C++ infounofficial client library
Java
Perl infounofficial client library
PHP
Python
Ruby infounofficial client library
C++
Go
Java
JavaScript
Python
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceJavaScriptno
Triggersnonono
Partitioning methods infoMethods for storing different data on different nodesShardingSharding infosince version 2.0Sharding infoPartitioning is done manually, search queries against distributed index is supportedSharding
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factorSource-replica replication with configurable replication factornoneyes
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReduceno infocan be done with stored procedures in JavaScriptnoyes
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual ConsistencyEventual Consistency infoconfigurable per collection or per write
Immediate Consistency
OneShard (highly available, fault-tolerant deployment mode with ACID semantics)
Foreign keys infoReferential integritynoyes inforelationships in graphsno
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoACIDnoACID
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyes infoThe original contents of fields are not stored in the Sphinx index.yes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.no
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and KerberosyesnoAccess Control Lists
More information provided by the system vendor
Apache ImpalaArangoDBSphinxYTsaurus
Specific characteristicsGraph and Beyond. With more than 11,000 stargazers on GitHub, ArangoDB is the leading...
» more
Competitive advantagesConsolidation: As a native multi-model database, can be used as a full blown document...
» more
Typical application scenariosNative multi-model in ArangoDB is being used for a broad range of projects across...
» more
Key customersCisco, Barclays, Refinitive, Siemens Mentor, Kabbage, Liaison, Douglas, MakeMyTrip,...
» more
Market metricsArangoDB is the leading native multi-model database with over 11,000 stargazers on...
» more
Licensing and pricing modelsVery permissive Apache 2 License for Community Edition & commercial licenses are...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache ImpalaArangoDBSphinxYTsaurus
DB-Engines blog posts

The Weight of Relational Databases: Time for Multi-Model?
29 August 2017, Luca Olivari (guest author)

show all

The DB-Engines ranking includes now search engines
4 February 2013, Paul Andlinger

show all

Recent citations in the news

Apache Impala becomes Top-Level Project
28 November 2017, SDTimes.com

Cloudera Bringing Impala to AWS Cloud
28 November 2017, Datanami

Apache Doris just 'graduated': Why care about this SQL data warehouse
24 June 2022, InfoWorld

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop
12 March 2017, Uber

Updates & Upserts in Hadoop Ecosystem with Apache Kudu
27 October 2017, KDnuggets

provided by Google News

ArangoGraphML: Simplifying the Power of Graph Machine Learning
11 October 2023, Datanami

GQL: A New ISO Standard for Querying Graph Databases
29 April 2024, The New Stack

ArangoDB Announces Release of ArangoDB 3.11 for Search, Graph and Analytics - High-Performance Computing ...
30 May 2023, insideHPC

How to Build Knowledge Graph Enhanced Chatbot with ChatGPT and ArangoDB
30 June 2023, DataDrivenInvestor

ArangoDB expands scope of graph database platform
6 October 2022, TechTarget

provided by Google News

Switching From Sphinx to MkDocs Documentation — What Did I Gain and Lose
2 February 2024, Towards Data Science

Manticore is a Faster Alternative to Elasticsearch in C++
25 July 2022, hackernoon.com

Perplexity AI: From Its Use To Operation, Everything You Need To Know About Googles Newest Challenger
11 January 2024, Free Press Journal

The Pirate Bay was recently down for over a week due to a DDoS attack
29 October 2019, The Hacker News

How to Build 600+ Links in One Month
4 September 2020, Search Engine Journal

provided by Google News

«Яндекс» открыл исходный код платформы для работы с большими данными YTsaurus
20 March 2023, ServerNews

«Яндекс» опубликовал исходный код платформы для работы с большими данными YTsaurus
21 March 2023, Rozetked.me

«Яндекс» начал приём заявок на гранты для опенсорс-проектов в рамках Yandex Open Source
27 November 2023, 3DNews

provided by Google News



Share this page

Featured Products

SingleStore logo

The database to transact, analyze and contextualize your data in real time.
Try it today.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Present your product here