DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Impala vs. Kinetica vs. Snowflake vs. Spark SQL

System Properties Comparison Impala vs. Kinetica vs. Snowflake vs. Spark SQL

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameImpala  Xexclude from comparisonKinetica  Xexclude from comparisonSnowflake  Xexclude from comparisonSpark SQL  Xexclude from comparison
DescriptionAnalytic DBMS for HadoopFully vectorized database across both GPUs and CPUsCloud-based data warehousing service for structured and semi-structured dataSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelRelational DBMSRelational DBMSRelational DBMSRelational DBMS
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score18.71
Rank#38  Overall
#23  Relational DBMS
Score0.74
Rank#202  Overall
#99  Relational DBMS
Score71.03
Rank#17  Overall
#11  Relational DBMS
Score22.97
Rank#34  Overall
#20  Relational DBMS
Websitewww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.htmlwww.kinetica.comwww.snowflake.comspark.apache.org/­sql
Technical documentationdocs.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.htmlwww.kinetica.com/­docsdocs.snowflake.net/­manuals/­index.htmlspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperClouderaKineticaSnowflake Computing Inc.Apache Software Foundation
Initial release2013201220142014
Current release3.4.0, April 20207.1, August 20213.2.0, October 2021
License infoCommercial or Open SourceOpen Source infoApache Version 2commercialcommercialOpen Source infoApache 2.0
Cloud-based only infoOnly available as a cloud servicenonoyesno
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageC++C, C++Scala
Server operating systemsLinuxLinuxhostedLinux
OS X
Windows
Data schemeyesyesyes infosupport of semi-structured data formats (JSON, XML, Avro)yes
Typing infopredefined data types such as float or dateyesyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonoyesno
Secondary indexesyesyesno
SQL infoSupport of SQLSQL-like DML and DDL statementsSQL-like DML and DDL statementsyesSQL-like DML and DDL statements
APIs and other access methodsJDBC
ODBC
JDBC
ODBC
RESTful HTTP API
CLI Client
JDBC
ODBC
JDBC
ODBC
Supported programming languagesAll languages supporting JDBC/ODBCC++
Java
JavaScript (Node.js)
Python
JavaScript (Node.js)
Python
Java
Python
R
Scala
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceuser defined functionsuser defined functionsno
Triggersnoyes infotriggers when inserted values for one or more columns fall within a specified rangeno infosimilar concept for controling cloud resourcesno
Partitioning methods infoMethods for storing different data on different nodesShardingShardingyesyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factorSource-replica replicationyesnone
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReducenono
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual ConsistencyImmediate Consistency or Eventual Consistency depending on configurationImmediate Consistency
Foreign keys infoReferential integritynoyesyesno
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonoACIDno
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.noyes infoGPU vRAM or System RAMnono
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and KerberosAccess rights for users and roles on table levelUsers with fine-grained authorization concept, user roles and pluggable authenticationno
More information provided by the system vendor
ImpalaKineticaSnowflakeSpark SQL
Specific characteristicsNative and fully vectorized database across both GPUs and CPUs, with memory first...
» more
Competitive advantagesBest in class geospatial and temporal analytics. Lockless architecture provides real-time...
» more
Typical application scenariosAny application requiring (1) real time insights, (2) time series analysis (3) geospatial...
» more
Key customersCitibank, US Air Force, Softbank, OVO, Telkomsel, USPS, 2 of the top 3 US Telcos,...
» more
Market metricsThe largest IoT deployment in the world, NORAD, runs on Kinetica.
» more
Licensing and pricing modelsAlways Free Tier in the Cloud, Pay-As-You-Go Consumption Based Pricing, and Portable...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesCData: Connect to Big Data & NoSQL through standard Drivers.
» more

SQLFlow: Provides a visual representation of the overall flow of data. Automated SQL data lineage analysis across Databases, ETL, Business Intelligence, Cloud and Hadoop environments by parsing SQL Script and stored procedure.
» more

DBHawk: Secure access to SQL, NoSQL and Cloud databases with an all-in-one solution.
» more
SQLFlow: Provides a visual representation of the overall flow of data. Automated SQL data lineage analysis across Databases, ETL, Business Intelligence, Cloud and Hadoop environments by parsing SQL Script and stored procedure.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
ImpalaKineticaSnowflakeSpark SQL
Recent citations in the news

Cloudera Boosts Hadoop App Development On Impala
10 November 2014, InformationWeek

Cloudera’s Impala brings Hadoop to SQL and BI
25 October 2012, ZDNet

Man Busts Out of Google, Rebuilds Top-Secret Query Machine
24 October 2012, Wired

Cloudera aims to bring real-time queries to Hadoop, big data
24 October 2012, ZDNet

Unravel Data Adds Native Support for Impala and Kafka
29 June 2017, insideBIGDATA

provided by Google News

USD 361.56 million growth for GPU Database Market from 2020 to 2025 |Evolving Opportunities with BlazingSQL Inc., Brytlyt Ltd. & Hetero DB Co. Ltd.|17000+ Technavio Reports
26 November 2021, Kilgore News Herald

Kinetica Pushes Vectorization to the Edge with Microsoft Azure Stack HCI
5 November 2021, AiThority

Kinetica Database Now on Azure - insideHPC
14 October 2021, insideHPC

Kinetica Collaborates with NVIDIA to Serve Real-time Data-driven Insights
6 November 2019, MarTech Advisor

Preventing the Next 9/11 Goal of NORAD's New Streaming Data Warehouse
30 August 2021, Datanami

provided by Google News

Snowflake stock soars after software sales more than double
1 December 2021, MarketWatch

Trifacta Launches The Data Engineering Cloud For Snowflake
2 December 2021, AiThority

Data Specialist Rostovtseva Joins Snowflake in Sales Engineer Role
2 December 2021, Techwire.net

Immuta Joins the Data Governance Accelerated Program from Snowflake
24 November 2021, Database Trends and Applications

Announcing Okera for Snowflake, a SaaS Solution to Deliver Universal Data Authorization on Snowflake's Data Cloud
23 November 2021, PRNewswire

provided by Google News

Apache Spark Brings Pandas API with Version 3.2
4 November 2021, InfoQ.com

Compare Hadoop vs. Spark vs. Kafka for your big data strategy
15 November 2021, TechTarget

How Much Do Data Scientists Make?
1 December 2021, iLounge

2015 - Spark Takes the Big Data World by Storm
1 July 2021, Datanami

Microsoft : Accelerate big data analytics with Spark 3.0 connector for SQL Server—now generally available
13 July 2021, marketscreener.com

provided by Google News

Job opportunities

Tableau Developer
Capgemini, Chicago, IL

Senior Software Engineer
Lumen, Broomfield, CO

Tableau Administrator
MAXIMUS, Remote

Data Engineer
Harmonia Holdings Group, LLC, Washington, DC

Business Intelligence Developer-Tableau Centric
Early Warning Services, Scottsdale, AZ

SQL Developer -Entry Level-
PCS Global Tech, Seattle, WA

Snowflake Database Administrator
ZLLIUS INC., Remote

Entry Level / Free Bootcamp - Data Engineer / Data Analyst (STEM)
PCS Global Tech, San Diego, CA

Database Administrator
spar information systems, Los Angeles, CA

Snowflake Database Administrator
Merchants Fleet, New Hampshire

Python/Spark/SQL Data Egnineer
Zettalogix, Iselin, NJ

Analytics Consulting
Experian, East Hampton, NY

Analytics Consulting
Experian, Costa Mesa, CA

Sr. PL/SQL Developer ( with Hadoop )
Tektree, San Jose, CA

Data Scientist: 21-07353
Akraya Inc., California

jobs by Indeed



Share this page

Featured Products

Couchbase logo

The modern database for enterprise applications. Build fast. Scale Big. Save more.
Get started today.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Datastax Astra logo

Build cloud-native apps fast with Astra, the open-source, multi-cloud stack for
modern data apps.
Get started with 80GB free.

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

AllegroGraph logo

Graph Database Leader for AI Knowledge Graph Applications - The Most Secure Graph Database Available.
Free Download

Present your product here