DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Elasticsearch vs. Impala vs. Spark SQL

System Properties Comparison Elasticsearch vs. Impala vs. Spark SQL

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameElasticsearch  Xexclude from comparisonImpala  Xexclude from comparisonSpark SQL  Xexclude from comparison
DescriptionA distributed, RESTful modern search and analytics engine based on Apache Lucene infoElasticsearch lets you perform and combine many types of searches such as structured, unstructured, geo, and metricAnalytic DBMS for HadoopSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelSearch engineRelational DBMSRelational DBMS
Secondary database modelsDocument store
Spatial DBMS
Document store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score155.08
Rank#8  Overall
#1  Search engines
Score17.56
Rank#40  Overall
#25  Relational DBMS
Score23.72
Rank#35  Overall
#21  Relational DBMS
Websitewww.elastic.co/­elasticsearchwww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.htmlspark.apache.org/­sql
Technical documentationwww.elastic.co/­guide/­en/­elasticsearch/­reference/­current/­index.htmldocs.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.htmlspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperElasticClouderaApache Software Foundation
Initial release201020132014
Current release7.8.0, June 20204.0.0, July 20213.3.0, June 2022
License infoCommercial or Open SourceOpen Source infoElastic LicenseOpen Source infoApache Version 2Open Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageJavaC++Scala
Server operating systemsAll OS with a Java VMLinuxLinux
OS X
Windows
Data schemeschema-free infoFlexible type definitions. Once a type is defined, it is persistentyesyes
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesyes infoAll search fields are automatically indexedyesno
SQL infoSupport of SQLSQL-like query languageSQL-like DML and DDL statementsSQL-like DML and DDL statements
APIs and other access methodsJava API
RESTful HTTP/JSON API
JDBC
ODBC
JDBC
ODBC
Supported programming languages.Net
Groovy
Community Contributed Clients
Java
JavaScript
Perl
PHP
Python
Ruby
All languages supporting JDBC/ODBCJava
Python
R
Scala
Server-side scripts infoStored proceduresyesyes infouser defined functions and integration of map-reduceno
Triggersyes infoby using the 'percolation' featurenono
Partitioning methods infoMethods for storing different data on different nodesShardingShardingyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesyesselectable replication factornone
MapReduce infoOffers an API for user-defined Map/Reduce methodsES-Hadoop Connectoryes infoquery execution via MapReduce
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency infoSynchronous doc based replication. Get by ID may show delays up to 1 sec. Configurable write consistency: one, quorum, allEventual Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonono
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.Memcached and Redis integrationnono
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and Kerberosno

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesSQLFlow: Provides a visual representation of the overall flow of data. Automated SQL data lineage analysis across Databases, ETL, Business Intelligence, Cloud and Hadoop environments by parsing SQL Script and stored procedure.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
ElasticsearchImpalaSpark SQL
DB-Engines blog posts

PostgreSQL is the DBMS of the Year 2017
2 January 2018, Paul Andlinger, Matthias Gelbmann

Elasticsearch moved into the top 10 most popular database management systems
3 July 2017, Matthias Gelbmann

MySQL, PostgreSQL and Redis are the winners of the March ranking
2 March 2016, Paul Andlinger

show all

Recent citations in the news

One year of OpenSearch: Grading AWS’ open source effort
18 August 2022, TechRepublic

5 Best Browsers that Support Kibana [Ranked by Compatibility]
4 August 2022, WindowsReport.com

Supercloud simplifies cloud architecture while leveling up automation
16 August 2022, SiliconANGLE News

What to Expect at the 2022 ElasticON; San Francisco, D.C., and NYC
12 August 2022, Solutions Review

How MongoDB's Atlas Helped Amadeus Reengineer a Crucial App
18 August 2022, thenewstack.io

provided by Google News

Cloudera Boosts Hadoop App Development On Impala
10 November 2014, InformationWeek

Man Busts Out of Google, Rebuilds Top-Secret Query Machine
24 October 2012, WIRED

How to deploy the Redash data visualization dashboard with the help of Docker
23 March 2022, News Azi

How to add a data source to Redash
5 April 2022, TechRepublic

Unravel Data Adds Native Support for Impala and Kafka
29 June 2017, insideBIGDATA

provided by Google News

Data chess game: Databricks vs. Snowflake, part 1
25 July 2022, VentureBeat

A Complete Guide to PySpark Data Frames
12 July 2022, Built In

Top 10 Big Data Architect Jobs to Apply for in August 2022
2 August 2022, Analytics Insight

Spark Gets Closer Hooks to Pandas, SQL with Version 3.2
26 October 2021, Datanami

Novel tuning method for Spark SQL applications
16 June 2022, Tech Xplore

provided by Google News

Job opportunities

Elasticsearch Developer
Pomeroy, Remote

Sr. DBA-MongoDB/Elasticsearch
Autofi, Remote

Elasticsearch SME
NR Consulting LLC, San Jose, CA

Full Stack Developer
Esolvit Inc.,, Amigo, WV

ELK Administrator - Remote
Compugra Systems Inc, Remote

Data Engineer
Analytica, Remote

Software Engineer Specialist
FIS Global, Cincinnati, OH

Senior Software Engineer
FIS Global, Cincinnati, OH

Business Intelligence Developer - Treasury and Liquidity
Barclays, Whippany, NJ

Planning Engineer II
Lumen, Remote

Data Engineer
Intone Networks, New York, NY

Principal Data Architect, Data Lake & Analytics
Amazon Dev Center U.S., Inc., Kentucky

Data Engineer
Intone Networks, Houston, TX

Sr. Data Architect, Data Lake & Analytics - Nationwide Opportunities
Amazon Web Services, Inc., West Virginia

Bigdata with Pyspark
Virtusa, Irving, TX

jobs by Indeed



Share this page

Featured Products

Redis logo

The world’s most loved real‑time data platform.
Try free

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

The definitive guide for Cassandra

Imagine What You Could Do if Scalability Wasn‘t a Problem!
Download the Cassandra e-book for free!

AllegroGraph logo

Graph Database Leader for AI Knowledge Graph Applications - The Most Secure Graph Database Available.
Free Download

Neo4j logo

The right graph database.
Not a graph add-on.
Free forever.
Start solving today

Present your product here