DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Google BigQuery vs. Impala vs. Spark SQL

System Properties Comparison Google BigQuery vs. Impala vs. Spark SQL

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameGoogle BigQuery  Xexclude from comparisonImpala  Xexclude from comparisonSpark SQL  Xexclude from comparison
DescriptionLarge scale data warehouse service with append-only tablesAnalytic DBMS for HadoopSpark SQL is a component on top of 'Spark Core' for structured data processing
Primary database modelRelational DBMSRelational DBMSRelational DBMS
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score56.57
Rank#20  Overall
#14  Relational DBMS
Score18.51
Rank#37  Overall
#23  Relational DBMS
Score18.74
Rank#36  Overall
#22  Relational DBMS
Websitecloud.google.com/­bigquerywww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.htmlspark.apache.org/­sql
Technical documentationcloud.google.com/­bigquery/­docsdocs.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.htmlspark.apache.org/­docs/­latest/­sql-programming-guide.html
DeveloperGoogleClouderaApache Software Foundation
Initial release201020132014
Current release4.1.0, June 20223.4.1 ( 2.13), June 2023
License infoCommercial or Open SourcecommercialOpen Source infoApache Version 2Open Source infoApache 2.0
Cloud-based only infoOnly available as a cloud serviceyesnono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageC++Scala
Server operating systemshostedLinuxLinux
OS X
Windows
Data schemeyesyesyes
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesnoyesno
SQL infoSupport of SQLyesSQL-like DML and DDL statementsSQL-like DML and DDL statements
APIs and other access methodsRESTful HTTP/JSON APIJDBC
ODBC
JDBC
ODBC
Supported programming languages.Net
Java
JavaScript
Objective-C
PHP
Python
Ruby
All languages supporting JDBC/ODBCJava
Python
R
Scala
Server-side scripts infoStored proceduresuser defined functions infoin JavaScriptyes infouser defined functions and integration of map-reduceno
Triggersnonono
Partitioning methods infoMethods for storing different data on different nodesnoneShardingyes, utilizing Spark Core
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factornone
MapReduce infoOffers an API for user-defined Map/Reduce methodsnoyes infoquery execution via MapReduce
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate ConsistencyEventual Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datano infoSince BigQuery is designed for querying datanono
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nonono
User concepts infoAccess controlAccess privileges (owner, writer, reader) on dataset, table or view level infoGoogle Cloud Identity & Access Management (IAM)Access rights for users, groups and roles infobased on Apache Sentry and Kerberosno

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesCData: Connect to Big Data & NoSQL through standard Drivers.
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Google BigQueryImpalaSpark SQL
DB-Engines blog posts

Snowflake is the DBMS of the Year 2022, defending the title from last year
3 January 2023, Matthias Gelbmann, Paul Andlinger

Cloud-based DBMS's popularity grows at high rates
12 December 2019, Paul Andlinger

The popularity of cloud-based DBMSs has increased tenfold in four years
7 February 2017, Matthias Gelbmann

show all

Recent citations in the news

Hightouch Announces $38M in Funding and Launches New ...
20 July 2023, Datanami

Winning the 2020 Google Cloud Technology Partner of the Year – Infrastructure Modernization Award
22 December 2021, CIO

Salesforce en Google Cloud integreren nog meer AI-features
8 June 2023, Techzine.nl

provided by Google News

Man Busts Out of Google, Rebuilds Top-Secret Query Machine
24 October 2012, WIRED

Cloudera aims to bring real-time queries to Hadoop, big data
24 October 2012, ZDNet

Cloudera Boosts Hadoop App Development On Impala
10 November 2014, InformationWeek

Cloudera plans to launch data science software, cloud services
13 March 2017, VentureBeat

Cloudera's Kudu: Like HDFS and HBase in one
28 September 2015, InfoWorld

provided by Google News

Horizontal Innovation in Data Science | by Pan Wu | Sep, 2023 ...
2 September 2023, ustcwupan.medium.com

10 Python Certifications You Should Get as a Beginner
14 September 2023, Analytics Insight

Stream Processing 101: What's Right for You?
8 September 2023, The New Stack

spark3 spark-sql explain 命令的执行过程原创
19 September 2023, CSDN

Spark SQL案例【电商购买数据分析】_让线程再跑一会的博客
22 September 2023, CSDN

provided by Google News

Job opportunities

Data Architect
Perficient, Inc, United States

Manager II, Business Analysis and Insights, eComm
Walmart, Hoboken, NJ

Data Visualization Analyst
Apogee Integration, LLC, Chantilly, VA

Data Visualization Analyst
Apogee Integration LLC, Chantilly, VA

Staff Software Engineer(Data Modeling/SQL + Visualization + Analytics)
Kivyo, Round Rock, TX

Data Analyst (National Geographic)
Disney, Washington, DC

Data Analyst (National Geographic)
Walt Disney Television, Washington, DC

Entry Level Oracle Developer - 92
I28 Technologies Corporation, Remote

Python Developer
EXL Services, Pittsburgh, PA

Junior Data Analyst - TT22-03349
Evergreen Technologies, LLC., Portsmouth, NH

jobs by Indeed



Share this page

Featured Products

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

MariaDB logo

SkySQL, the ultimate
MariaDB cloud, is here.

Get started with SkySQL today!

Redis logo

The world’s most loved real‑time data platform.
Try free

AllegroGraph logo

Graph Database Leader for AI Knowledge Graph Applications - The Most Secure Graph Database Available.
Free Download

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here