DB-EnginesextremeDB - solve IoT connectivity disruptionsEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by Redgate Software

DBMS > Apache Impala vs. Apache Spark (SQL) vs. FeatureBase

System Properties Comparison Apache Impala vs. Apache Spark (SQL) vs. FeatureBase

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Impala  Xexclude from comparisonApache Spark (SQL)  Xexclude from comparisonFeatureBase  Xexclude from comparison
DescriptionAnalytic DBMS for HadoopApache Spark SQL is a component on top of 'Spark Core' for structured data processingReal-time database platform that powers real-time analytics and machine learning applications by simultaneously executing low-latency, high-throughput, and highly concurrent workloads.
Primary database modelRelational DBMSRelational DBMSRelational DBMS
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score11.22
Rank#41  Overall
#25  Relational DBMS
Score20.40
Rank#29  Overall
#18  Relational DBMS
Score0.22
Rank#300  Overall
#135  Relational DBMS
Websiteimpala.apache.orgspark.apache.org/­sqlwww.featurebase.com
Technical documentationimpala.apache.org/­impala-docs.htmlspark.apache.org/­docs/­latest/­sql-programming-guide.htmldocs.featurebase.com
DeveloperApache Software Foundation infoApache top-level project, originally developed by ClouderaApache Software FoundationMolecula and Pilosa Open Source Contributors
Initial release201320142017
Current release4.1.0, June 20223.5.0 ( 2.13), September 20232022, May 2022
License infoCommercial or Open SourceOpen Source infoApache Version 2Open Source infoApache 2.0commercial
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageC++ScalaGo
Server operating systemsLinuxLinux
OS X
Windows
Linux
macOS
Data schemeyesyesyes
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesyesnono
SQL infoSupport of SQLSQL-like DML and DDL statementsSQL-like DML and DDL statementsSQL queries
APIs and other access methodsJDBC
ODBC
JDBC
ODBC
gRPC
JDBC
Kafka Connector
ODBC
Supported programming languagesAll languages supporting JDBC/ODBCJava
Python
R
Scala
Java
Python
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceno
Triggersnonono
Partitioning methods infoMethods for storing different data on different nodesShardingyes, utilizing Spark CoreSharding
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factornoneyes
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReduce
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency
Foreign keys infoReferential integritynonoyes
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonoyes
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes, using Linux fsync
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nonoyes
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and Kerberosno

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache ImpalaApache Spark (SQL)FeatureBase
Recent citations in the news

Apache Impala becomes Top-Level Project
28 November 2017, SD Times

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop
12 March 2017, Uber

Cloudera brings Apache Iceberg data lake format to its Data Platform
30 June 2022, SiliconANGLE

Apache Doris just ‘graduated’: Why care about this SQL data warehouse
24 June 2022, InfoWorld

Apache Iceberg is now available on the Cloudera Data Platform
4 July 2022, techzine.eu

provided by Google News

Scala vs Python for Apache Spark: An In-depth Comparison With Use Cases For Each
21 April 2025, Simplilearn.com

Introducing AWS Glue 5.0 for Apache Spark
4 December 2024, Amazon Web Services (AWS)

Docker + Spark on Kubernetes: Build Tiny Custom Executors in Minutes in 2025 | by Aleksei Aleinikov | Apr, 2025
21 April 2025, DataDrivenInvestor

How to run Pandas code on Spark
25 January 2025, Theodo Data & AI

The 6 Best Apache Spark Courses on Udemy to Consider for 2025
1 January 2025, solutionsreview.com

provided by Google News

Former Silicon Labs CEO Tyson Tuttle launches AI startup Circuit, acquires Austin startup Molecula
23 July 2024, The Business Journals

(PDF) Deciphering the Anti‐Diabetic Potential of Gymnema Sylvestre Using Integrated Computer‐Aided Drug Design and Network Pharmacology
15 January 2025, ResearchGate

Pilosa: A Scalable High Performance Bitmap Database Index
17 June 2019, HackerNoon

32 Data and Analytics Startups That Will Go Big, According to VCs
28 September 2021, Business Insider

provided by Google News



Share this page

Featured Products

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

SingleStore logo

The data platform to build your intelligent applications.
Try it free.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Present your product here