DB-EnginesInfluxDB: Focus on building software with an easy-to-use serverless, scalable time series platformEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Apache Impala vs. Kdb vs. ScyllaDB vs. Weaviate

System Properties Comparison Apache Impala vs. Kdb vs. ScyllaDB vs. Weaviate

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameApache Impala  Xexclude from comparisonKdb  Xexclude from comparisonScyllaDB  Xexclude from comparisonWeaviate  Xexclude from comparison
DescriptionAnalytic DBMS for HadoopHigh performance Time Series DBMSCassandra and DynamoDB compatible wide column storeAn AI-native realtime vector database engine that integrates scalable machine learning models.
Primary database modelRelational DBMSTime Series DBMS
Vector DBMS
Wide column storeVector DBMS
Secondary database modelsDocument storeRelational DBMSKey-value store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score13.77
Rank#40  Overall
#24  Relational DBMS
Score7.55
Rank#53  Overall
#3  Time Series DBMS
#1  Vector DBMS
Score4.75
Rank#68  Overall
#5  Wide column stores
Score1.73
Rank#143  Overall
#5  Vector DBMS
Websiteimpala.apache.orgkx.comwww.scylladb.comgithub.com/­weaviate/­weaviate
weaviate.io
Technical documentationimpala.apache.org/­impala-docs.htmlcode.kx.comdocs.scylladb.comweaviate.io/­developers/­weaviate
DeveloperApache Software Foundation infoApache top-level project, originally developed by ClouderaKx Systems, a division of First Derivatives plcScyllaDBWeaviate B.V.
Initial release20132000 infokdb was released 2000, kdb+ in 200320152019
Current release4.1.0, June 20223.6, May 2018ScyllaDB Open Source 5.4.1, January 20241.19, May 2023
License infoCommercial or Open SourceOpen Source infoApache Version 2commercial infofree 32-bit versionOpen Source infoOpen Source (AGPL), commercial license availableOpen Source infocommercial license available with Weaviate Enterprise
Cloud-based only infoOnly available as a cloud servicenononono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Scylla Cloud: Create real-time applications that run at global scale with Scylla Cloud, the industry’s most powerful NoSQL DBaaS
Implementation languageC++qC++Go
Server operating systemsLinuxLinux
OS X
Solaris
Windows
Linux
Data schemeyesyesschema-freeyes, maps to GraphQL interface
Typing infopredefined data types such as float or dateyesyesyesyes infostring, int, float, geo point, date, cross reference, fuzzy references
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.noyesnono
Secondary indexesyesyes infotable attribute 'grouped'yes infocluster global secondary indicesyes infoall data objects are indexed in a semantic vector space (the Contextionary), all primitive fields are indexed
SQL infoSupport of SQLSQL-like DML and DDL statementsSQL-like query language (q)SQL-like DML and DDL statements (CQL)GraphQL is used as query language
APIs and other access methodsJDBC
ODBC
HTTP API
JDBC
Jupyter
Kafka
ODBC
WebSocket
Proprietary protocol (CQL) infocompatible with CQL (Cassandra Query Language, an SQL-like language)
RESTful HTTP API (DynamoDB compatible)
Thrift
GraphQL query language
RESTful HTTP/JSON API
Supported programming languagesAll languages supporting JDBC/ODBCC
C#
C++
Go
J
Java
JavaScript
Lua
MatLab
Perl
PHP
Python
R
Scala
For CQL interface: C#, C++, Clojure, Erlang, Go, Haskell, Java, JavaScript, Node.js, Perl, PHP, Python, Ruby, Rust, Scala
For DynamoDB interface: .Net, ColdFusion, Erlang, Groovy, Java, JavaScript, Perl, PHP, Python, Ruby
JavaScript / TypeScript
Python
Server-side scripts infoStored proceduresyes infouser defined functions and integration of map-reduceuser defined functionsyes, Luano
Triggersnoyes infowith viewsnono
Partitioning methods infoMethods for storing different data on different nodesShardinghorizontal partitioningShardingSharding
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factorSource-replica replicationselectable replication factor infoRepresentation of geographical distribution of servers is possibleyes
MapReduce infoOffers an API for user-defined Map/Reduce methodsyes infoquery execution via MapReduceno infosimilar paradigm used for internal processingnono
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual ConsistencyImmediate ConsistencyEventual Consistency
Tunable Consistency infocan be individually decided for each write operation
Eventual Consistency
Foreign keys infoReferential integritynoyesnono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datanonono infoAtomicity and isolation are supported for single operationsno
Concurrency infoSupport for concurrent manipulation of datayesyesyesyes
Durability infoSupport for making data persistentyesyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.noyesyes infoin-memory tablesyes
User concepts infoAccess controlAccess rights for users, groups and roles infobased on Apache Sentry and Kerberosrights management via user accountsAccess rights for users can be defined per objectAPI Keys
OpenID Connect Discovery
More information provided by the system vendor
Apache ImpalaKdbScyllaDBWeaviate
Specific characteristicsIntegrated columnar database & programming system for streaming, real time and historical...
» more
ScyllaDB is engineered to deliver predictable performance at scale. It’s adopted...
» more
Weaviate is an open source vector database that is robust, scalable, cloud-native,...
» more
Competitive advantagesprovides seamless scalability; runs on industry standard server platforms; is top-ranked...
» more
Highly-performant (efficiently utilizes full resources of a node and network; millions...
» more
Flexible deployment - Free, open source or fully-managed cloud vector database service...
» more
Typical application scenariostick database streaming sensor data massive intelligence applications oil and gas...
» more
ScyllaDB is ideal for applications that require high throughput and low latency at...
» more
As a database supporting the development of generative AI and semantic search applications...
» more
Key customersGoldman Sachs Morgan Stanley Merrill Lynch J.P. Morgan Deutsche Bank IEX Securities...
» more
Discord, Epic Games, Expedia, Zillow, Comcast, Disney+ Hotstar, Samsung, ShareChat,...
» more
All companies that have data. ​
» more
Market metricskdb+ performance and reliability proven by our customers in critical infrastructure...
» more
ScyllaDB typically offers ~75% total cost of ownership savings, with ~5X higher throughput...
» more
As of mid 2023: Over 2 million open source downloads 3500+ Weaviate Slack community...
» more
Licensing and pricing modelsupon request
» more
ScyllaDB Open Source - free open source software (AGPL) ScyllaDB Enterprise - subscription-based...
» more
Weaviate is open-source, and free to use. Weaviate is also available as a fully managed...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
Apache ImpalaKdbScyllaDBWeaviate
DB-Engines blog posts

Weaviate, an ANN Database with CRUD support
2 February 2021,  Etienne Dilocker, SeMI Technologies (sponsor) 

show all

Recent citations in the news

Cloudera creates observability tool to help enterprises manage cloud costs
6 June 2023, SiliconANGLE News

Apache Impala 4 Supports Operator Multi-Threading
29 July 2021, iProgrammer

Apache Impala becomes Top-Level Project
28 November 2017, SDTimes.com

Cloudera Bringing Impala to AWS Cloud
28 November 2017, Datanami

Apache Doris just 'graduated': Why care about this SQL data warehouse
24 June 2022, InfoWorld

provided by Google News

Turbocharging the Engine: KX Unleashes AI-First Transformation with kdb+
28 February 2024, Business Wire

KX JOINS SNOWFLAKE PARTNER NETWORK
27 June 2023, PR Newswire

McLaren Applied and KX partner to enhance ATLAS software analytics capabilities
9 August 2023, Professional Motorsport World

Introducing Amazon FinSpace with Managed kdb Insights, a fully managed analytics engine, commonly used by capital ...
18 May 2023, AWS Blog

KX Brings the Power and Performance of Kdb+ to Python Developers With PyKX
8 June 2023, AiThority

provided by Google News

ScyllaDB on AWS is a NoSQL Database Built for Gigabyte-to-Petabyte Scale | Amazon Web Services
6 January 2023, AWS Blog

Scylla Eyes Cassandra's NoSQL Workloads
13 February 2018, Datanami

Scylla vs Cassandra: Performance Comparison - DataScienceCentral.com
9 January 2020, Data Science Central

ScyllaDB Database Review | eWeek
21 August 2018, eWeek

Scylla review: Apache Cassandra supercharged
18 December 2019, InfoWorld

provided by Google News

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector ...
24 January 2024, AWS Blog

Weaviate Partners with Snowflake to Bring Secure GenAI to Snowpark Container Services
8 February 2024, Datanami

Weaviate Partners with Snowflake to Bring Secure GenAI to Snowpark Container Services
9 February 2024, AiThority

Foley Represents Cortical Ventures in $50M Series B Round for Weaviate
17 December 2023, Foley & Lardner LLP

Getting Started with Weaviate: A Beginner's Guide to Search with Vector Databases
18 July 2023, Towards Data Science

provided by Google News



Share this page

Featured Products

RaimaDB logo

RaimaDB, embedded database for mission-critical applications. When performance, footprint and reliability matters.
Try RaimaDB for free.

Neo4j logo

See for yourself how a graph database can make your life easier.
Use Neo4j online for free.

SingleStore logo

The database to transact, analyze and contextualize your data in real time.
Try it today.

Datastax Astra logo

Bring all your data to Generative AI applications with vector search enabled by the most scalable
vector database available.
Try for Free

Milvus logo

Vector database designed for GenAI, fully equipped for enterprise implementation.
Try Managed Milvus for Free

Present your product here