DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Amazon CloudSearch vs. DuckDB vs. Spark SQL vs. Tarantool

System Properties Comparison Amazon CloudSearch vs. DuckDB vs. Spark SQL vs. Tarantool

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameAmazon CloudSearch  Xexclude from comparisonDuckDB  Xexclude from comparisonSpark SQL  Xexclude from comparisonTarantool  Xexclude from comparison
DescriptionA hosted search engine service by Amazon with the data stored in Amazons cloudAn embeddable, in-process, column-oriented SQL OLAP RDBMSSpark SQL is a component on top of 'Spark Core' for structured data processingIn-memory computing platform with a flexible data schema for efficiently building high-performance applications
Primary database modelSearch engineRelational DBMSRelational DBMSDocument store
Key-value store
Relational DBMS
Secondary database modelsSpatial DBMS infowith Tarantool/GIS extension
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score1.81
Rank#137  Overall
#12  Search engines
Score4.63
Rank#69  Overall
#37  Relational DBMS
Score18.04
Rank#33  Overall
#20  Relational DBMS
Score1.67
Rank#143  Overall
#25  Document stores
#25  Key-value stores
#65  Relational DBMS
Websiteaws.amazon.com/­cloudsearchduckdb.orgspark.apache.org/­sqlwww.tarantool.io
Technical documentationdocs.aws.amazon.com/­cloudsearchduckdb.org/­docsspark.apache.org/­docs/­latest/­sql-programming-guide.htmlwww.tarantool.io/­en/­doc
DeveloperAmazonApache Software FoundationVK
Initial release2012201820142008
Current release1.0.0, June 20243.5.0 ( 2.13), September 20232.10.0, May 2022
License infoCommercial or Open SourcecommercialOpen Source infoMIT LicenseOpen Source infoApache 2.0Open Source infoBSD-2, source-available extensions (modules), commercial licenses for Tarantool Enterprise
Cloud-based only infoOnly available as a cloud serviceyesnonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageC++ScalaC and C++
Server operating systemshostedserver-lessLinux
OS X
Windows
BSD
Linux
macOS
Data schemeyesyesyesFlexible data schema: relational definition for tables with ability to store json-like documents in columns
Typing infopredefined data types such as float or dateyesyesyesstring, double, decimal, uuid, integer, blob, boolean, datetime
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonono
Secondary indexesyes infoall search fields are automatically indexedyesnoyes
SQL infoSupport of SQLnoyesSQL-like DML and DDL statementsFull-featured ANSI SQL support
APIs and other access methodsHTTP APIArrow Database Connectivity (ADBC)
CLI Client
JDBC
ODBC
JDBC
ODBC
Open binary protocol
Supported programming languagesC
C# info3rd party driver
C++
Crystal info3rd party driver
Go info3rd party driver
Java
Lisp info3rd party driver
Python
R
Ruby info3rd party driver
Rust
Swift
Zig info3rd party driver
Java
Python
R
Scala
C
C#
C++
Erlang
Go
Java
JavaScript
Lua
Perl
PHP
Python
Rust
Server-side scripts infoStored proceduresnononoLua, C and SQL stored procedures
Triggersnononoyes, before/after data modification events, on replication events, client session events
Partitioning methods infoMethods for storing different data on different nodesyes infoautomatic partitioning across Amazon Search Instance as requirednoneyes, utilizing Spark CoreSharding, partitioned with virtual buckets by user defined affinity key. Live resharding for scale up and scale down without maintenance downtime.
Replication methods infoMethods for redundantly storing data on multiple nodesyes infomanaged transparently by AWSnonenoneAsynchronous replication with multi-master option
Configurable replication topology (full-mesh, chain, star)
Synchronous quorum replication (with Raft)
MapReduce infoOffers an API for user-defined Map/Reduce methodsnono
Consistency concepts infoMethods to ensure consistency in a distributed systemImmediate ConsistencyCasual consistency across sharding partitions
Eventual consistency within replicaset partition infowhen using asyncronous replication
Immediate Consistency within single instance
Sequential consistency including linearizable read within replicaset partition infowhen using Raft
Foreign keys infoReferential integritynononoyes
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanoACIDnoACID, with serializable isolation and linearizable read (within partition); Configurable MVCC (within partition); No cross-shard distributed transactions
Concurrency infoSupport for concurrent manipulation of datayesyes, multi-version concurrency control (MVCC)yesyes, cooperative multitasking
Durability infoSupport for making data persistentyesyesyesyes, write ahead logging
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.yesnoyes, full featured in-memory storage engine with persistence
User concepts infoAccess controlauthentication via encrypted signaturesnonoAccess Control Lists
Mutual TLS authentication for Tarantol Enterprise
Password based authentication
Role-based access control (RBAC) and LDAP for Tarantol Enterprise
Users and Roles

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Amazon CloudSearchDuckDBSpark SQLTarantool
DB-Engines blog posts

Amazon - the rising star in the DBMS market
3 August 2015, Matthias Gelbmann

The DB-Engines ranking includes now search engines
4 February 2013, Paul Andlinger

show all

Data processing speed and reliability: in-memory synchronous replication
9 November 2021,  Vladimir Perepelytsya, Tarantool (sponsor) 

show all

Recent citations in the news

Amazon CloudSearch – Start Searching in One Hour for Less Than $100 / Month | Amazon Web Services
12 April 2012, AWS Blog

Searching CloudTrail Logs Easily with Amazon CloudSearch | AWS Startups Blog
21 October 2014, AWS Blog

AWS, Microsoft and Google should retire these cloud services
2 June 2020, TechTarget

CloudSearch Update – Price Reduction, Hebrew & Japanese Support, Partitioning, CloudTrail | Amazon Web Services
19 November 2014, AWS Blog

Serverless Reference Architectures with AWS Lambda
10 May 2016, All Things Distributed

provided by Google News

MotherDuck Announces General Availability; Brings Simplicity and Power of DuckDB in a Serverless Data Warehouse
11 June 2024, PR Newswire

DuckDB: The tiny but powerful analytics database
15 May 2024, InfoWorld

DuckDB promises greater stability with 1.0 release
5 June 2024, The Register

DuckDB: In-Process Python Analytics for Not-Quite-Big Data
31 May 2024, The New Stack

DuckDB Walks to the Beat of Its Own Analytics Drum
5 March 2024, Datanami

provided by Google News

Use Amazon Athena with Spark SQL for your open-source transactional table formats | Amazon Web Services
24 January 2024, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
1 April 2024, Simplilearn

Performance Insights from Sigma Rule Detections in Spark Streaming
1 June 2024, Towards Data Science

Simba Technologies(R) Introduces New, Powerful JDBC Driver With SQL Connector for Apache Spark(TM)
17 March 2024, Yahoo Singapore News

provided by Google News

Deploying Tarantool Cartridge applications with zero effort (Part 1)
16 December 2019, Хабр

VShard — horizontal scaling in Tarantool
7 March 2019, Хабр

Accelerating PHP connectors for Tarantool using Async, Swoole, and Parallel
18 December 2019, Хабр

provided by Google News



Share this page

Featured Products

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

Present your product here