The Apache Cassandra Project develops a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model.
Cassandra was open sourced by Facebook in 2008, and is now developed by Apache committers and contributors from many companies.
Features
Proven
Cassandra is in use at Digg, Facebook, Twitter, Reddit, Rackspace, Cloudkick, Cisco, SimpleGeo, Ooyala, OpenX, and more companies that have large, active data sets. The largest production cluster has over 100 TB of data in over 150 machines.
Fault Tolerant
Data is automatically replicated to multiple nodes for fault-tolerance. Replication across multiple data centers is supported. Failed nodes can be replaced with no downtime.
Decentralized
Every node in the cluster is identical. There are no network bottlenecks. There are no single points of failure.
You're in Control
Choose between synchronous or asynchronous replication for each update. Highly available asynchronus operations are optimized with features like Hinted Handoff and Read Repair.
Rich Data Model
Allows efficient use for many applications beyond simple key/value.
Elastic
Read and write throughput both increase linearly as new machines are added, with no downtime or interruption to applications.
Durable
Cassandra is suitable for applications that can't afford to lose data, even when an entire data center goes down.
Professionally Supported
Cassandra support contracts and services are available from Riptano.
Leave a Comment