DB-EnginesExtremeDB: the mission critical dbmsEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Elasticsearch vs. Impala vs. OrigoDB vs. Spark SQL

System Properties Comparison Elasticsearch vs. Impala vs. OrigoDB vs. Spark SQL

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameElasticsearch  Xexclude from comparisonImpala  Xexclude from comparisonOrigoDB  Xexclude from comparisonSpark SQL  Xexclude from comparison
DescriptionA distributed, RESTful modern search and analytics engine based on Apache Lucene infoElasticsearch lets you perform and combine many types of searches such as structured, unstructured, geo, and metricAnalytic DBMS for HadoopA fully ACID in-memory object graph databaseSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelSearch engineRelational DBMSDocument store
Object oriented DBMS
Relational DBMS
Secondary database modelsDocument store
Spatial DBMS
Document store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score144.93
Rank#8  Overall
#1  Search engines
Score17.82
Rank#39  Overall
#24  Relational DBMS
Score0.15
Rank#345  Overall
#50  Document stores
#19  Object oriented DBMS
Score20.62
Rank#36  Overall
#22  Relational DBMS
Websitewww.elastic.co/­elasticsearchwww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.htmlorigodb.comspark.apache.org/­sql
Technical documentationwww.elastic.co/­guide/­en/­elasticsearch/­reference/­current/­index.htmldocs.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.htmlorigodb.com/­docsspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperElasticClouderaRobert Friberg et alApache Software Foundation
Initial release201020132009 infounder the name LiveDB2014
Current release7.8.0, June 20204.1.0, June 20223.3.0 ( 2.13), June 2022
License infoCommercial or Open SourceOpen Source infoElastic LicenseOpen Source infoApache Version 2Open SourceOpen Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageJavaC++C#Scala
Server operating systemsAll OS with a Java VMLinuxLinux
Windows
Linux
OS X
Windows
Data schemeschema-free infoFlexible type definitions. Once a type is defined, it is persistentyesyesyes
Typing infopredefined data types such as float or dateyesyesUser defined using .NET types and collectionsyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono infocan be achieved using .NETno
Secondary indexesyes infoAll search fields are automatically indexedyesyesno
SQL infoSupport of SQLSQL-like query languageSQL-like DML and DDL statementsnoSQL-like DML and DDL statements
APIs and other access methodsJava API
RESTful HTTP/JSON API
JDBC
ODBC
.NET Client API
HTTP API
LINQ
JDBC
ODBC
Supported programming languages.Net
Groovy
Community Contributed Clients
Java
JavaScript
Perl
PHP
Python
Ruby
All languages supporting JDBC/ODBC.NetJava
Python
R
Scala
Server-side scripts infoStored proceduresyesyes infouser defined functions and integration of map-reduceyesno
Triggersyes infoby using the 'percolation' featurenoyes infoDomain Eventsno
Partitioning methods infoMethods for storing different data on different nodesShardingShardinghorizontal partitioning infoclient side managed; servers are not synchronizedyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesyesselectable replication factorSource-replica replicationnone
MapReduce infoOffers an API for user-defined Map/Reduce methodsES-Hadoop Connectoryes infoquery execution via MapReduceno
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency infoSynchronous doc based replication. Get by ID may show delays up to 1 sec. Configurable write consistency: one, quorum, allEventual Consistency
Foreign keys infoReferential integritynonodepending on modelno
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonoACIDno
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyes infoWrite ahead logyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.Memcached and Redis integrationnoyesno
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and KerberosRole based authorizationno

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
ElasticsearchImpalaOrigoDBSpark SQL
DB-Engines blog posts

PostgreSQL is the DBMS of the Year 2017
2 January 2018, Paul Andlinger, Matthias Gelbmann

Elasticsearch moved into the top 10 most popular database management systems
3 July 2017, Matthias Gelbmann

MySQL, PostgreSQL and Redis are the winners of the March ranking
2 March 2016, Paul Andlinger

show all

Recent citations in the news

Searching for search: UK Gov wants to dump GOV.UK Elasticsearch
17 November 2022, The Stack

Elastic Reports Second Quarter Fiscal 2023 Financial Results
30 November 2022, Business Wire

Elasticsearch Scroll API vs Search After with PIT?
15 November 2022, Medium

Will Elastic N.V. (ESTC) be Able to Sustain Recession?
29 November 2022, Yahoo Finance

Elastic Named a Major Player in the 2022 IDC MarketScape Worldwide SIEM
6 December 2022, Business Wire

provided by Google News

Cloudera Boosts Hadoop App Development On Impala
10 November 2014, InformationWeek

Cloudera aims to bring real-time queries to Hadoop, big data
24 October 2012, ZDNet

Apache Impala 4 Supports Operator Multi-Threading
29 July 2021, iProgrammer

Man Busts Out of Google, Rebuilds Top-Secret Query Machine
24 October 2012, WIRED

How to add a data source to Redash
5 April 2022, TechRepublic

provided by Google News

Mobilize.Net Announces PySpark (Spark Python) to Snowpark Code Conversion Tool
16 November 2022, PR Newswire

Senior Big Data Engineer - Remote in Weehawken, NJ, USA - Apply Today!
2 December 2022, EPAM

Department for International Trade to develop global supply chain map
6 December 2022, UKAuthority.com

Tellius Announces Latest Version of Its Decision Intelligence Platform
9 November 2022, Datanami

A Deep Dive into Custom Spark Transformers for ML Pipelines
27 July 2022, CrowdStrike

provided by Google News

Job opportunities

Mid-Level Java / Elasticsearch Developer (Remote)
cloudteam, Jacksonville, FL

Backend Engineer
Sporty Group, Remote

Remote Jr. Java|Apache Camel Software Engineer
Datagrate, Inc., Remote

Data Engineer
PeakMetrics, Los Angeles, CA

Site Reliability Engineer
Conservice, River Heights, UT

Software Engineer Specialist
Humanity, Cincinnati, OH

Software Engineer Specialist
FIS Global, Cincinnati, OH

Sr. ETL Test Lead
Bank of America, Charlotte, NC

Federal - Data Engineer
Accenture, Cleveland, OH

Federal - Data Engineer
Accenture, Phoenix, AZ

Python/Spark/SQL Data Egnineer
Zettalogix, Iselin, NJ

Entry Level Data Engineer with BigData - 1
I28 Technologies Corporation, Remote

Spark-SQL Developer with Java - Plano, TX (Day 1 Onsite)
PRISMITCORP, Plano, TX

Data Scientist..
Cyber Resource, Houston, TX

Network Data Scientist
Altana AI, Brooklyn, NY

jobs by Indeed



Share this page

Featured Products

The definitive guide for Cassandra

Imagine What You Could Do if Scalability Wasn‘t a Problem!
Download the Cassandra e-book for free!

Vertica logo

Vertica Accelerator. The fastest analytics and machine learning, delivered as SaaS, with automated setup, administration, and management. Free trial.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Redis logo

The world’s most loved real‑time data platform.
Try free

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

Present your product here