DB-EnginesCrateDB bannerEnglish
Deutsch
Knowledge Base of Relational and NoSQL Database Management Systemsprovided by solid IT

DBMS > Cassandra vs. Impala vs. MarkLogic

System Properties Comparison Cassandra vs. Impala vs. MarkLogic

Please select another system to include it in the comparison.

Editorial information provided by DB-Engines
NameCassandra  Xexclude from comparisonImpala  Xexclude from comparisonMarkLogic  Xexclude from comparison
DescriptionWide-column store based on ideas of BigTable and DynamoDB infoOptimized for write accessAnalytic DBMS for HadoopOperational and transactional Enterprise NoSQL database
Primary database modelWide column storeRelational DBMSDocument store
Native XML DBMS
RDF store infoas of version 7
Search engine
Secondary database modelsDocument store
DB-Engines Ranking infomeasures the popularity of database management systemsranking trend
Trend Chart
Score123.22
Rank#10  Overall
#1  Wide column stores
Score14.57
Rank#37  Overall
#23  Relational DBMS
Score13.06
Rank#38  Overall
#6  Document stores
#1  Native XML DBMS
#1  RDF stores
#4  Search engines
Websitecassandra.apache.orgwww.cloudera.com/­products/­open-source/­apache-hadoop/­impala.htmlwww.marklogic.com
Technical documentationcassandra.apache.org/­doc/­latestwww.cloudera.com/­documentation/­enterprise/­latest/­topics/­impala.htmldocs.marklogic.com
DeveloperApache Software Foundation infoApache top level project, originally developped by FacebookClouderaMarkLogic Corp.
Initial release200820132001
Current release3.11.4, February 20193.3.0, August 20199.0, 2017
License infoCommercial or Open SourceOpen Source infoApache version 2Open Source infoApache Version 2commercial inforestricted free version is available
Cloud-based only infoOnly available as a cloud servicenonono
DBaaS offerings (sponsored links) infoDatabase as a Service

Providers of DBaaS offerings, please contact us to be listed.
Implementation languageJavaC++C++
Server operating systemsBSD
Linux
OS X
Windows
LinuxLinux
OS X
Windows
Data schemeschema-freeyesschema-free infoSchema can be enforced
Typing infopredefined data types such as float or dateyesyesyes
XML support infoSome form of processing data in XML format, e.g. support for XML data structures, and/or support for XPath, XQuery or XSLT.nonoyes
Secondary indexesrestricted infoonly equality queries, not always the best performing solutionyesyes
SQL infoSupport of SQLSQL-like SELECT, DML and DDL statements (CQL)SQL-like DML and DDL statementsyes infoSQL92
APIs and other access methodsProprietary protocol infoCQL (Cassandra Query Language, an SQL-like language)
Thrift
JDBC
ODBC
Java API
Node.js Client API
ODBC
proprietary Optic API infoProprietary Query API, introduced with version 9
RESTful HTTP API
SPARQL
WebDAV
XDBC
XQuery
XSLT
Supported programming languagesC#
C++
Clojure
Erlang
Go
Haskell
Java
JavaScript infoNode.js
Perl
PHP
Python
Ruby
Scala
All languages supporting JDBC/ODBCC
C#
C++
Java
JavaScript (Node.js)
Perl
PHP
Python
Ruby
Server-side scripts infoStored proceduresnoyes infouser defined functions and integration of map-reduceyes infovia XQuery or JavaScript
Triggersyesnoyes
Partitioning methods infoMethods for storing different data on different nodesSharding infono "single point of failure"ShardingSharding
Replication methods infoMethods for redundantly storing data on multiple nodesselectable replication factor infoRepresentation of geographical distribution of servers is possibleselectable replication factoryes
MapReduce infoOffers an API for user-defined Map/Reduce methodsyesyes infoquery execution via MapReduceyes infovia Hadoop Connector, HDFS Direct Access and in-database MapReduce jobs
Consistency concepts infoMethods to ensure consistency in a distributed systemEventual Consistency
Immediate Consistency infocan be individually decided for each write operation
Eventual ConsistencyImmediate Consistency
Foreign keys infoReferential integritynonono
Transaction concepts infoSupport to ensure data integrity after non-atomic manipulations of datano infoAtomicity and isolation are supported for single operationsnoACID infocan act as a resource manager in an XA/JTA transaction
Concurrency infoSupport for concurrent manipulation of datayesyesyes
Durability infoSupport for making data persistentyesyesyes
In-memory capabilities infoIs there an option to define some or all structures to be held in-memory only.nonoyes, with Range Indexes
User concepts infoAccess controlAccess rights for users can be defined per objectAccess rights for users, groups and roles infobased on Apache Sentry and KerberosRole-based access control at the document and subdocument levels
More information provided by the system vendor
CassandraImpalaMarkLogic
Specific characteristicsApache Cassandra is the leading NoSQL, distributed database management system, well...
» more
The MarkLogic Multi-Model Database provides the simplest way to integrate data from...
» more
Competitive advantagesNo single point of failure ensures 100% availability . Operational simplicity for...
» more
The main benefit of MarkLogic is simplicity. By making things simpler, MarkLogic...
» more
Typical application scenariosInternet of Things (IOT), fraud detection applications, recommendation engines, product...
» more
MarkLogic’s main use case is the Operational Data Hub . A Data Hub is useful for...
» more
Key customersApple, Netflix, Uber, ING,, Intuit,Fidelity, NY Times, Outbrain, BazaarVoice, Best...
» more
MarkLogic has proven results managing mission-critical data with over 1,000 global...
» more
Market metricsCassandra is used by 40% of the Fortune 100.
» more
MarkLogic is well known within the Fortune 1000, with a strong presence in financial...
» more
Licensing and pricing modelsApache license  Pricing for commercial distributions provided by DataStax and available...
» more
MarkLogic provides a variety of licenses and cloud services: MarkLogic Data Hub Service...
» more

We invite representatives of system vendors to contact us for updating and extending the system information,
and for displaying vendor-provided information such as key customers, competitive advantages and market metrics.

Related products and services
3rd partiesDataStax Enterprise: Apache Cassandra for enterprises.
» more

CData: Connect to Big Data & NoSQL through standard Drivers.
» more

Instaclustr: Fully Hosted and Managed Apache Cassandra
» more

We invite representatives of vendors of related products to contact us for presenting information about their offerings here.

More resources
CassandraImpalaMarkLogic
DB-Engines blog posts

Cassandra keeps climbing the ranks of the DB-Engines Ranking
3 May 2016, Matthias Gelbmann

Oracle is the DBMS of the Year
5 January 2016, Paul Andlinger, Matthias Gelbmann

Winners, losers and an attractive newcomer in Novembers DB-Engines ranking
2 November 2015, Paul Andlinger

show all

Conferences and events

DataStax Accelerate
San Diego, California, USA, 11-13 May 2020

Recent citations in the news

Introducing Jakarta NoSQL
15 October 2019, InfoQ.com

DataStax Grants Early Access to CDC Connector for Kafka
10 October 2019, Database Trends and Applications

DataStax Announces Change Data Capture (CDC) Connector for Apache Kafka®
30 September 2019, Valdosta Daily Times

DataStax offers bidirectional data dexterity for Apache Kafka
6 October 2019, ComputerWeekly.com

A Fourth Amendment Framework for Voiceprint Database Searches
17 October 2019, Just Security

provided by Google News

Cloudera Boosts Hadoop App Development On Impala
10 November 2014, InformationWeek

Cloudera's a data warehouse player now
28 August 2018, ZDNet

Cloudera says Impala is faster than Hive, which isn't saying much
13 January 2014, GigaOM

Cloudera’s Impala brings Hadoop to SQL and BI
25 October 2012, ZDNet

Apache Impala gets top-level status as open source Hadoop tool
1 December 2017, TechTarget

provided by Google News

Schiphol Airport Selects MarkLogic for Operational Flight Database
24 September 2019, Business Wire

Schiphol Airport chooses software firm MarkLogic in digital push
25 September 2019, Airport Technology

Q&A: Brigham Bechtel talks big data and communications
26 September 2019, Army Technology

HITS Fall: Effective Data Governance is Crucial for Entertainment Companies, MarkLogic Says
11 October 2019, Media & Entertainment Services Alliance M&E Daily Newsletter

More Financial Services Firms to Invest in Data Management to Improve Investment Decisions
8 October 2019, Business Wire

provided by Google News

Job opportunities

Data Architect (Remote)
First San Francisco Partners, Remote

Data Architect
LendingPoint Consolidated Inc, Kennesaw, GA

Cassandra Database Engineer
Synchronoss, Bridgewater, NJ

Data Administrator
DSD Partners Inc, Midlothian, VA

Database Engineer
Geologics Corporation, Richardson, TX

Big Data Architect - Cloudera, Impala
ITL USA, Richardson, TX

Cloud Admin
Avani Technology Solutions Inc, Orange, CA

Business Intelligence Developer
Early Warning Services, Scottsdale, AZ

Big Data Lead
Brillio, Santa Clara, CA

Architect, GeForce NOW - Cloud
NVIDIA, Santa Clara, CA

Publishing Analyst
ASTM International, Conshohocken, PA

Analyst - Artificial Intelligence & Semantics
Morgan Stanley, Baltimore, MD

Associate - Artificial Intelligence & Semantics
Morgan Stanley, New York, NY

ETL Software Developer
Cleared Solutions Inc., Reston, VA

Metadata Specialist
International Baccalaureate, Washington, DC

jobs by Indeed




Share this page

Featured Products

Neo4j logo

Get your free copy of the new O'Reilly book Graph Algorithms with 20+ examples for
machine learning, graph analytics and more.

Redis logo

Hosted, serverless DBaaS
in 3 steps.

30MB Free!
Start now.

Couchbase logo

SQL + JSON + NoSQL.
Power, flexibility & scale.
All open source.
Get started now.

RavenDB logo

Setup a fully managed RavenDB Cloud Database in minutes. Enjoy hosting, management, backups all in one place.
Grab a Free Instance

Present your product here