Coursera Learner working on a presentation with Coursera logo and
Coursera Learner working on a presentation with Coursera logo and

Apache Cassandra may be a highly scalable, high-performance distributed database designed to handle large amounts of knowledge across many commodity servers, providing high availability with no single point of failure. it’s a kind of NoSQL database. allow us to first understand what a NoSQL database does.


A NoSQL database (sometimes called as Not Only SQL) may be a database that gives a mechanism to store and retrieve data aside from the tabular relations utilized in relational databases. These databases are schema-free, support easy replication, have simple API, eventually consistent, and may handle huge amounts of knowledge .

The primary objective of a NoSQL database is to possess

simplicity of design,

horizontal scaling, and

finer control over availability.

NoSql databases use different data structures compared to relational databases. It makes some operations faster in NoSQL. The suitability of a given NoSQL database depends on the matter it must solve.

NoSQL vs. electronic database

The following table lists the points that differentiate a electronic database from a NoSQL database.

Relational Database NoSql Database

Supports powerful command language. Supports very simple command language.

It has a hard and fast schema. No fixed schema.

Follows ACID (Atomicity, Consistency, Isolation, and Durability). It’s only “eventually consistent”.

Supports transactions. doesn’t support transactions.

Besides Cassandra, we’ve the subsequent NoSQL databases that are quite popular −

Apache HBase − HBase is an open source, non-relational, distributed database modeled after Google’s BigTable and is written in Java. it’s developed as a neighborhood of Apache Hadoop project and runs on top of HDFS, providing BigTable-like capabilities for Hadoop.

MongoDB − MongoDB may be a cross-platform document-oriented database system that avoids using the normal table-based electronic database structure in favor of JSON-like documents with dynamic schemas making the mixing of knowledge in certain sorts of applications easier and faster.

What is Apache Cassandra?

Apache Cassandra is an open source, distributed and decentralized/distributed storage system (database), for managing very large amounts of structured data opened up across the planet . It provides highly available service with no single point of failure.

Listed below are a number of the notable points of Apache Cassandra −

It is scalable, fault-tolerant, and consistent.

It is a column-oriented database.

Its distribution design is predicated on Amazon’s Dynamo and its data model on Google’s Bigtable.

Created at Facebook, it differs sharply from electronic database management systems.

Cassandra implements a Dynamo-style replication model with no single point of failure, but adds a more powerful “column family” data model.

Cassandra is getting used by a number of the most important companies like Facebook, Twitter, Cisco, Rackspace, ebay, Twitter, Netflix, and more.

Features of Cassandra

Cassandra has become so popular due to its outstanding technical features. Given below are a number of the features of Cassandra:

Elastic scalability − Cassandra is very scalable; it allows to feature more hardware to accommodate more customers and more data as per requirement.

Always on architecture − Cassandra has no single point of failure and it’s continuously available for business-critical applications that can’t afford a failure.

Fast linear-scale performance − Cassandra is linearly scalable, i.e., it increases your throughput as you increase the amount of nodes within the cluster. Therefore it maintains a fast reaction time .

Flexible data storage − Cassandra accommodates all possible data formats including: structured, semi-structured, and unstructured. It can dynamically accommodate changes to your data structures consistent with your need.

Easy data distribution − Cassandra provides the pliability to distribute data where you would like by replicating data across multiple data centers.

Transaction support − Cassandra supports properties like Atomicity, Consistency, Isolation, and sturdiness (ACID).

Fast writes − Cassandra was designed to run on cheap commodity hardware. It performs blazingly fast writes and may store many terabytes of knowledge , without sacrificing the read efficiency.