DB-EnginesextremeDB - solve IoT connectivity disruptionsEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by Redgate Software

DBMS > Apache Spark (SQL) vs. Google Cloud Bigtable vs. Riak KV

System Properties Comparison Apache Spark (SQL) vs. Google Cloud Bigtable vs. Riak KV

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Spark (SQL)  Xexclude from comparisonGoogle Cloud Bigtable  Xexclude from comparisonRiak KV  Xexclude from comparison
DescriptionApache Spark SQL is a component on top of 'Spark Core' for structured data processingGoogle's NoSQL Big Data database service. It's the same database that powers many core Google services, including Search, Analytics, Maps, and Gmail.Distributed, fault tolerant key-value store
Primary database modelRelational DBMSKey-value store
Wide column store
Key-value store infowith links between data sets and object tags for the creation of secondary indexes
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score21.62
Rank#29  Overall
#18  Relational DBMS
Score2.66
Rank#92  Overall
#15  Key-value stores
#6  Wide column stores
Score3.21
Rank#82  Overall
#11  Key-value stores
Websitespark.apache.org/­sqlcloud.google.com/­bigtable
Technical documentationspark.apache.org/­docs/­latest/­sql-programming-guide.htmlcloud.google.com/­bigtable/­docswww.tiot.jp/­riak-docs/­riak/­kv/­latest
DeveloperApache Software FoundationGoogleOpenSource, formerly Basho Technologies
Initial release201420152009
Current release3.5.0 ( 2.13), September 20233.2.0, December 2022
License infoCommercial or Open SourceOpen Source infoApache 2.0commercialOpen Source infoApache version 2, commercial enterprise edition
Cloud-based only infoOnly available as a cloud servicenoyesno
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageScalaErlang
Server operating systemsLinux
OS X
Windows
hostedLinux
OS X
Data schemeyesschema-freeschema-free
Typing infopredefined data types such as float or dateyesnono
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesnonorestricted
SQL infoSupport of SQLSQL-like DML and DDL statementsnono
APIs and other access methodsJDBC
ODBC
gRPC (using protocol buffers) API
HappyBase (Python library)
HBase compatible API (Java)
HTTP API
Native Erlang Interface
Supported programming languagesJava
Python
R
Scala
C#
C++
Go
Java
JavaScript (Node.js)
Python
C infounofficial client library
C#
C++ infounofficial client library
Clojure infounofficial client library
Dart infounofficial client library
Erlang
Go infounofficial client library
Groovy infounofficial client library
Haskell infounofficial client library
Java
JavaScript infounofficial client library
Lisp infounofficial client library
Perl infounofficial client library
PHP
Python
Ruby
Scala infounofficial client library
Smalltalk infounofficial client library
Server-side scripts infoStored proceduresnonoErlang
Triggersnonoyes infopre-commit hooks and post-commit hooks
Partitioning methods infoMethods for storing different data on different nodesyes, utilizing Spark CoreShardingSharding infono "single point of failure"
Replication methods infoMethods for redundantly storing data on multiple nodesnoneInternal replication in Colossus, and regional replication between two clusters in different zonesselectable replication factor
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesyes
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate consistency (for a single cluster), Eventual consistency (for two or more replicated clusters)Eventual Consistency
Foreign keys infoReferential integritynonono infolinks between data sets can be stored
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoAtomic single-row operationsno
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nono
User concepts infoAccess controlnoAccess rights for users, groups and roles based on Google Cloud Identity and Access Management (IAM)yes, using Riak Security

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache Spark (SQL)Google Cloud BigtableRiak KV
Recent citations in the news

Introducing AWS Glue 5.0 for Apache Spark
4 December 2024, Amazon Web Services (AWS)

Scala vs Python for Apache Spark: An In-depth Comparison With Use Cases For Each
21 April 2025, Simplilearn.com

How to run Pandas code on Spark
25 January 2025, Theodo Data & AI

The 6 Best Apache Spark Courses on Udemy to Consider for 2025
1 January 2025, solutionsreview.com

18 top big data tools and technologies to know about in 2025
22 January 2025, TechTarget

provided by Google News

What’s next for Google Cloud databases? AI inside SQL and more
9 April 2025, VentureBeat

Google introduces Bigtable SQL access and Spanner's new AI-ready features
1 August 2024, ZDNet

Google Cloud adds graph processing to Spanner, SQL support to Bigtable
1 August 2024, InfoWorld

Google Introduces Autoscaling for Cloud Bigtable for Optimizing Costs
31 January 2022, infoq.com

Google scales up Cloud Bigtable NoSQL database
27 January 2022, TechTarget

provided by Google News

Basho Revamps Riak Open-Source Database
22 September 2023, Information Week

A Critique of Resizable Hash Tables: Riak Core & Random Slicing
26 August 2018, infoq.com

NoSQL pioneer Basho stamps its mark on time stamp data with Riak TS
6 October 2015, theregister.com

Enterprise NoSQL Database for the IoT Becomes Open Source
12 May 2016, Engineering.com

Overview of Riak, Redis and Voldermort features
2 April 2021, ResearchGate

provided by Google News



Share this page

Featured Products

SingleStore logo

The data platform to build your intelligent applications.
Try it free.

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Present your product here