DBMS > Hive vs. Solr vs. Spark SQL vs. Sphinx vs. Splice Machine

System Properties Comparison Hive vs. Solr vs. Spark SQL vs. Sphinx vs. Splice Machine

Editorial information provided by DB-Engines

Name

Description

data warehouse software for querying and managing large distributed datasets, built on Hadoop

A widely used distributed, scalable search engine based on Apache Lucene

Spark SQL is a component on top of 'Spark Core' for structured data processing

Open source search engine for searching in data from different sources, e.g. relational databases

Open-Source SQL RDBMS for Operational and Analytical use cases with native Machine Learning, powered by Hadoop and Spark

Primary database model

Secondary database models

Spatial DBMS

DB-Engines Ranking measures the popularity of database management systems
Trend Chart

Score	59.76
Rank	#18	Overall
	#12	Relational DBMS

Score	41.02
Rank	#24	Overall
	#3	Search engines

Score	18.04
Rank	#33	Overall
	#20	Relational DBMS

Score	5.95
Rank	#55	Overall
	#5	Search engines

Score	0.54
Rank	#252	Overall
	#115	Relational DBMS

Website

Technical documentation

cwiki.apache.org/confluence/display/Hive/Home

solr.apache.org/resources.html

spark.apache.org/docs/latest/sql-programming-guide.html

sphinxsearch.com/docs

splicemachine.com/how-it-works

Developer

Apache Software Foundation

initially developed by Facebook

Apache Software Foundation

Sphinx Technologies Inc.

Splice Machine

Initial release

2012

2006

2014

2001

2014

Current release

3.1.3, April 2022

9.6.1, May 2024

3.5.0 ( 2.13), September 2023

3.5.1, February 2023

3.1, March 2021

License

Commercial or Open Source

Open Source

Apache Version 2

Open Source

Apache Version 2

Open Source

Apache 2.0

Open Source

GPL version 2, commercial licence available

Open Source

AGPL 3.0, commercial license available

Cloud-based only

Only available as a cloud service

DBaaS offerings (sponsored links)

Database as a Service

Providers of DBaaS offerings, please contact us to be listed.

Implementation language

Java

Scala

C++

Java

Server operating systems

All OS with a Java VM

runs as a servlet in servlet container (e.g. Tomcat, Jetty is included)

Linux
OS X
Windows

FreeBSD
Linux
NetBSD
OS X
Solaris
Windows

Linux
OS X
Solaris
Windows

Data scheme

yes

Dynamic Fields enables on-the-fly addition of new fields

yes

Typing

predefined data types such as float or date

yes

supports customizable data types and automatic typing

yes

XML support

Some form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.

yes

Secondary indexes

yes

All search fields are automatically indexed

yes

full-text index on all search fields

yes

SQL

Support of SQL

SQL-like DML and DDL statements

Solr Parallel SQL Interface

SQL-like DML and DDL statements

SQL-like query language (SphinxQL)

yes

APIs and other access methods

JDBC
ODBC
Thrift

Java API
RESTful HTTP/JSON API

JDBC
ODBC

Proprietary protocol

JDBC
Native Spark Datasource
ODBC

Supported programming languages

C++
Java
PHP
Python

.Net
Erlang
Java
JavaScript
any language that supports sockets and either XML or JSON
Perl
PHP
Python
Ruby
Scala

Java
Python
R
Scala

C++

unofficial client library
Java
Perl

unofficial client library
PHP
Python
Ruby

unofficial client library

C#
C++
Java
JavaScript (Node.js)
Python
R
Scala

Server-side scripts

Stored procedures

yes

user defined functions and integration of map-reduce

Java plugins

yes

Java

Triggers

yes

User configurable commands triggered on index changes

yes

Partitioning methods

Methods for storing different data on different nodes

Sharding

yes, utilizing Spark Core

Sharding

Partitioning is done manually, search queries against distributed index is supported

Shared Nothhing Auto-Sharding, Columnar Partitioning

Replication methods

Methods for redundantly storing data on multiple nodes

selectable replication factor

yes

none

Multi-source replication
Source-replica replication

MapReduce

Offers an API for user-defined Map/Reduce methods

yes

query execution via MapReduce

spark-solr: github.com/lucidworks/spark-solr and streaming expressions to reduce

Yes, via Full Spark Integration

Consistency concepts

Methods to ensure consistency in a distributed system

Eventual Consistency

Immediate Consistency

Foreign keys

Referential integrity

yes

Transaction concepts

Support to ensure data integrity after non-atomic manipulations of data

optimistic locking

ACID

Concurrency

Support for concurrent manipulation of data

yes

yes, multi-version concurrency control (MVCC)

Durability

Support for making data persistent

yes

The original contents of fields are not stored in the Sphinx index.

yes

In-memory capabilities

Is there an option to define some or all structures to be held in-memory only.

yes

User concepts

Access control

Access rights for users, groups and roles

yes

Access rights for users, groups and roles according to SQL-standard

More information provided by the system vendor

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources

DB-Engines blog posts

Why is Hadoop not listed in the DB-Engines Ranking?
13 May 2013, Paul Andlinger

show all

Elasticsearch replaced Solr as the most popular search engine
12 January 2016, Paul Andlinger

Enterprise Search Engines almost double their popularity in the last 12 months
2 July 2014, Paul Andlinger

The DB-Engines ranking includes now search engines
4 February 2013, Paul Andlinger

show all

The DB-Engines ranking includes now search engines
4 February 2013, Paul Andlinger

show all

Recent citations in the news

Apache Software Foundation Announces Apache Hive 4.0
30 April 2024, Datanami

Design a data mesh pattern for Amazon EMR-based data lakes using AWS Lake Formation with Hive metastore ...
10 June 2024, AWS Blog

18 Top Big Data Tools and Technologies to Know About in 2024
24 January 2024, TechTarget

ASF Unveils the Next Evolution of Big Data Processing With the Launch of Hive 4.0
2 May 2024, Datanami

Run Apache Hive workloads using Spark SQL with Amazon EMR on EKS | Amazon Web Services
18 October 2023, AWS Blog

provided by Google News

SOLR-led walkout demands better conditions for Compass workers
27 February 2024, Daily Northwestern

Solr Network Launches Groundbreaking Solana Token Creator
28 May 2024, AccessWire

(SOLR) Technical Data
17 May 2024, Stock Traders Daily

SOLR hosts teach-in of labor movements at Northwestern
28 January 2024, Daily Northwestern

Top 5 stock gainers and losers: SOLR.V, GRSL.V, ANON.C
21 November 2023, Equity.Guru

provided by Google News

Use Amazon Athena with Spark SQL for your open-source transactional table formats | Amazon Web Services
24 January 2024, AWS Blog

What is Apache Spark? The big data platform that crushed Hadoop
3 April 2024, InfoWorld

Cracking the Apache Spark Interview: 80+ Top Questions and Answers for 2024
1 April 2024, Simplilearn

Performant IPv4 Range Spark Joins | by Jean-Claude Cote
24 January 2024, Towards Data Science

Simba Technologies(R) Introduces New, Powerful JDBC Driver With SQL Connector for Apache Spark(TM)
17 March 2024, Yahoo Singapore News

provided by Google News

Switching From Sphinx to MkDocs Documentation — What Did I Gain and Lose
2 February 2024, Towards Data Science

5 Powerful Alternatives to Elasticsearch
25 April 2024, Insider Monkey

Manticore is a Faster Alternative to Elasticsearch in C++
25 July 2022, hackernoon.com

Royal Mail stamp prices could rise, warns Czech Sphinx
3 June 2024, Proactive Investors UK

Perplexity AI: From Its Use To Operation, Everything You Need To Know About Google's Newest Challenger
11 January 2024, Free Press Journal

provided by Google News

Machine learning data pipeline outfit Splice Machine files for insolvency
26 August 2021, The Register

Splice Machine Launches the Splice Machine Feature Store to Simplify Feature Engineering and Democratize Machine ...
19 January 2021, PR Newswire

Splice Machine Launches Feature Store to Simplify Feature Engineering
19 January 2021, Datanami

Real-time machine learning with Splice Machine's ML Manager
17 April 2019, TechTarget

How To Axe Db2 But Keep Your Code
10 March 2020, Towards Data Science

provided by Google News

Share this page