• For Home
    • Mobile Storage
      • iXpand™ Memory Case | SanDisk
      • Ultra Dual Drive USB Type-C | SanDisk
      • iXpand Flash Drive for iPhone and iPad | SanDisk
        • iXpand Flash Drive | SanDisk
      • Ultra Dual USB Drive 3.0 | SanDisk
      • Connect Wireless Stick | SanDisk
      • Ultra USB Type-C Flash Drive | SanDisk
      • Ultra Dual Drive m3.0 | SanDisk
    • Cards & Readers
      • SD Cards
        • ExtremePro microSD
        • Extreme PRO SDHC/SDXC UHS-I Memory Card | SanDisk
        • Extreme PLUS SDHC/SDXC UHS-I Memory Card | SanDisk
        • Extreme SDHC/SDXC UHS-I Memory Card | SanDisk
        • Ultra PLUS SDHC/SDXC Memory Card | SanDisk
        • Ultra SDHC/SDXC Memory Card 80MB/s read speed | SanDisk
        • SDHC/SDXC Memory Card | SanDisk
      • microSD Cards | SanDisk
        • SanDisk Extreme PRO microSD UHS-I CARDS
        • Extreme PLUS microSD UHS-I Card | SanDisk
        • EXTREME microSDXC and microSDHC UHS-I CARDS | SanDisk
        • Extreme microSD Card | SanDisk
        • Extreme microSD for Action Cameras | SanDisk
        • Ultra PLUS microSDXC | SanDisk
        • Ultra microSD 95MB/s read speed | SanDisk
        • Ultra microSD UHS-I Card for Cameras | SanDisk
        • microSD Card | SanDisk
        • microSD High Endurance Video Monitoring Card | SanDisk
        • Ultra microSD UHS-I Card 48MB/s read speed | SanDisk
        • Ultra microSD for Smartphone | SanDisk
        • Ultra PLUS microSD UHS-I Card for Cameras | SanDisk
        • Extreme PRO microSDXC UHS-II Card | SanDisk
      • Compact Flash | SanDisk
        • Extreme PRO CompactFlash Memory Card | SanDisk
        • Extreme CompactFlash Memory Card | SanDisk
        • Ultra CompactFlash Memory Card | SanDisk
      • CFast | SanDisk
        • Extreme PRO CFast 2D Memory Card | SanDisk
      • Memory Card Readers | SanDisk
        • Extreme PRO CFast 2.0 Reader/Writer | SanDisk
        • ImageMate All-in-One USB 3.0 Reader | SanDisk
        • MobileMate Duo Adapter and Reader | SanDisk
        • MobileMate USB Reader | SanDisk
        • Extreme PRO SD UHS-II Card Reader/Writer | SanDisk
    • USB Flash
      • SanDisk Extreme Go USB 3.1 Flash Drive
      • SanDisk Extreme PRO USB 3.1 Solid State Flash Drive
      • Cruzer Dial USB Flash Drive
      • SanDisk Ultra Flair USB 3.0 Flash Drive
      • SanDisk Ultra USB 3.0 Flash Drive
      • SanDisk Ultra Fit USB 3.0 Flash Drive
      • Cruzer Force USB Flash Drive | SanDisk
      • Cruzer Glide USB Flash Drive
      • Cruzer Orbit USB Flash Drive
      • Cruzer Switch USB Flash Drive
      • Cruzer U USB Flash Drive
      • Cruzer Blade USB Flash Drive
      • Cruzer Edge USB Flash Drive
      • Cruzer Fit USB Flash Drive
    • SSD
      • SanDisk Extreme 900 Portable SSD
      • SanDisk Extreme 510 Portable SSD
      • SanDisk Extreme 500 Portable SSD
      • SanDisk Ultra II mSATA SSD
      • SanDisk Extreme Pro SSD
      • SanDisk Ultra II SSD
      • SanDisk SSD Plus
      • SanDisk SSD Notebook Upgrade Tool Kit
      • SanDisk Ultra 3D SSD
      • Ultra 3D SSD Redirect
    • MP3 Players
      • Clip Sport MP3 Player | SanDisk
      • Clip Jam MP3 Player | SanDisk
      • Clip Sport Plus MP3 Player | SanDisk
    • Extreme Team
      • Steve McCurry
      • Akito Mizutani
      • Walter Iooss
      • Elizabeth Kreutz
      • Tom Bol
      • Lucas Gilman
      • Marcelo Maragni
      • Jeff Lewis
      • Corey Rich
      • Christian Pondella
      • Caio Guatelli
      • Terrell Lloyd
      • Michael Grecco
      • Akash Das
      • Kaustav Saikia
      • Amy Tierney
      • Alex Liu
      • Scott A. Woodward
      • Steve Simon
      • Joseph Peter
      • Kike Calvo
      • C.S. Ling
      • Peter Eastway
      • Cliff Mautner
      • Bambi Cantrell
      • Bob and Dawn Davis
      • Zach & Jody Gray
      • Marcus Bell
      • Sam Nicholson
      • Richard Finn Gregory
      • Vincent Laforet
      • Sebastien Devaud
      • Paolo Baccolo
      • Phil Coates
      • David Newton
      • Daniel Fox
      • Fred Pompermayer
      • Christian Nørgaard
      • Kirill Umrikhin
      • Matthew Jordan Smith
      • Matteo Cappè
      • Richard Walch
      • Marcel Laemmerhirt
      • Dom Daher
      • Joao Carlos
      • Claudia Goetzelmann
      • Patrick Bellair
      • Wendell Phillips
      • Tyler Stableford
      • Jonathan & Angela Scott
      • Ellen Anon
      • Daisy Gilardini
      • Georg Tappeiner
      • George Karbus
      • Renan Ozturk
      • Lars Schwellnus
      • Tim Laman
      • Sean Scott
      • Nick Didlick
      • Dave Black
    • Stories
    • Where to Buy
  • For Business
    • Data Center
      • Products
        • Flash Devices (not a page)
          • PCIe Flash (not a page)
          • SSDs (not a page)
        • Flash Software (not a page)
          • Open Source
      • Solutions
        • Database
        • Virtualization
        • Big Data
        • Cloud Solutions
      • Partners
      • Resource Library
        • Case Studies
        • Data Sheets
        • Overviews
        • Report
        • Articles
        • Video
        • Solution Briefs
          • All-Flash Storage System for Ceph and OpenStack
          • Microsoft Data Warehouse Fast Track Reference Architectures
          • Increase Hyper-V VM Density and Performance While Saving on Software License Cost
          • Red Hat Ceph Storage on the InfiniFlash All-flash Storage Platform
          • The Solution for Cassandra at Scale | SanDisk
          • Storage for MongoDB Analytics
          • Optimizing Ceph Deployments Solution Brief
      • Blog
    • Computing
      • X400
      • Z400s
    • Partners
      • Data Center Partners
        • Cisco
        • Dell
        • Fujitsu
        • HP
        • Infortrend
        • Promise Technology
        • Lenovo
        • Tegile
        • CloudByte
        • Nexenta
        • Maxta
        • Stratoscale
        • Red Hat
        • DataCore
        • Formation
        • Supermicro
        • Elastifile
        • Inspur
        • Milestone
        • QCT
        • SIOS
        • SoftNAS
        • Actifio
        • Hyve
      • Business Channel Portal
    • Professional Photo & Video
  • OEM Design
    • Data Center
    • Computing
    • Mobile
    • Commercial
    • Industrial
      • iNAND
      • Industrial Cards
      • OEM Cards
    • Automotive
      • Automotive iNAND
    • Connected Home
    • Blog
      • Future Proof Storage
  • About SanDisk
    • Company
      • History of Innovation
      • The SanDisk Advantage
        • Business
        • Consumer
      • Ventures
    • Media Center
      • Press Releases
        • 2017
          • SanDisk® Launches its Fastest, High-Capacity USB Flash Drive Ever
          • SanDisk® Unveils World’s First microSD Card Designed to Deliver a New Dimension of Mobile Application Performance
          • Western Digital Transforms the Mobile Experience with New ‘Smart’ iNAND 7350 Storage Solution Built on 3D NAND
          • SanDisk® Teams with Leading Manufacturers to Offer Consumers a Faster Mobile App Experience
          • SanDisk Now Offers 256GB of Extra Storage for iPhone and iPad
          • Western Digital Introduces iNand 7250a Embedded Storage Device For Evolving Data Demands Of Connected Automotive Technologies
          • Western Digital To Deliver World’s First Client Solid State Drives With 64-layer 3d Nand Technology
      • Awards & Accolades
      • Events
        • CloudOpen North America
        • Microsoft World Partner Conference
        • International Super Computing
        • Flash Memory Summit
        • Intel Developer Forum 2015
        • VMworld US
        • LinuxCon North America
        • SpiceWorld
        • VMworld Europe
        • GTEC
        • Oracle OpenWorld
        • HP Discover London
        • Gartner Data Center Conference
        • Photo Plus Expo 2015
        • SQL PASS Summit 2015
        • TigerDirect Innovation – IT Conference and Expo
        • Mobile World Conference
        • Embedded World
        • Open Compute
        • Linux Open Vault Conference
        • LinuxCon North America 2016
        • Embedded World
        • Mobile World Congress
        • SQL PASS Summit
        • Supercomputing16
        • Western Digital: The Future of Data
      • Media Assets
        • Product
        • Corporate
          • Management
      • Insight Report
    • Careers
      • Why SanDisk
        • Compensation and Benefits
        • Equal Employment Opportunity
      • University Hires
      • Career Development
    • Corporate Responsibility
      • Social
      • Environmental
      • Community
      • Scholars
        • 2014 Cal Berkeley Graduate Engineering School Commencement Speech
      • Supplier Management
    • Investor Relations
    • Contact Us
  • Support
    • Co-Browse
    • iXpand Flash Drive Compatible
    • ixpandmemorycase
    • Ultra +Cloud USB
    • gaming cyoa sweepstakes
    • iXpand Compatibility
  • For Home
  • For Business
  • OEM Design
  • About SanDisk
  • Support
 
  • Data Center
    • Computing
    • Partners
    • Professional Photo & Video
  • Products
  • Solutions
  • Partners
  • Resource Library
  • Blog

Optimizing Apache Cassandra™ Database at Scale Solution Brief

The Solution for Cassandra at Scale

HGST has a long history of producing the largest and most reliable hard disk drives (HDDs) in the world. Couple those hard drives with the broad SanDisk-brand portfolio of flash storage technologies and you get a solution that enables Cassandra architects to achieve optimal Cassandra clusters. While modern multi-core servers allow parallel execution of multiple threads, Cassandra was not originally written to fully exploit them, which often leaves cores idling, waiting for data from storage. Ultimately, the low utilization causes server sprawl and wasteful spending of IT budgets.

Apache Cassandra™ database at scale can use both the cost-effective capacity of HGST-brand Ultrastar® helium hard drives and the density and performance capabilities of SanDisk®-brand solidstate drives (SSDs) to fully exploit flash and modern servers and to provide optimal performance and consolidation.

What SanDisk- and HGST-brand drives can do for Cassandra:

  • Store more data in the same or smaller footprint
  • Minimize query and data operation times on critical datasets
  • Customer-optimized price and performance
  • Reduce or eliminate JBODs and controllers

About Apache Cassandra

Cassandra is an open source NoSQL database written in Java and specifically optimized to be scalable, decentralized, fault tolerant, and, above all, performant. It is used at some of the web’s largest properties and throughout financial services and other industries as a repository of record with multiple petabytes of data under its control.

As a NoSQL database, Cassandra was built from the ground up for scale-out architectures. Instead of investing in a large, centralized database server with massive amounts of storage and memory capacity, architects can deploy more modestly configured servers to perform the same types of operations and guarantee the same levels of uptime and data reliability.

Scale-out provides a powerful method for increasing database performance and capacity. Need more compute power? Add servers to distribute the workload. Need additional storage capacity? Add servers and rebalance. Yet all of these server additions, if not properly managed and minimized, can lead to a classic case of server sprawl with massive operational expenses from large and underutilized server farms.

Avoiding Cassandra Server Sprawl for Capacity

As described above, there are basically two reasons to add servers to a cluster: to expand capacity or to increase performance. Let’s examine how HGST helium hard drives can help minimize the need for additional servers for petabyte-scale capacities.

Pain Point: Database Server Sprawl, Underutilized CPU

It is an axiom in the computer industry that data always grows to fill available space. This is a good problem to have because additional data enables Cassandra to perform deeper analytics and extract higher value insights from data. However, it can lead to adding servers simply for their storage, effectively wasting the initial cost of the rest of the server and its ongoing power, cooling, and maintenance.

Pain Point: Fixed Rack-Space, Increasing Database Size

In cases where your application is data-limited and not server-computelimited, it can make sense to scale up your scale-out storage. HGSTbrand Ultrastar® helium hard drives, in announced capacities of up to 12TB in an industry-standard 3.5 inch form factor, are offered with a choice of SAS or SATA interface. By loading 4 drives in a single rack unit server, nearly 50TB of raw storage and compute can exist in such a server, providing an optimal balance between capacity and compute for less-frequently accessed data.

Avoiding Cassandra Server Sprawl for Performance

SanDisk SSDs are built for performance. Depending on your performance needs, multiple SATA or SAS interfaced SSDs can reduce the I/O wait times dramatically when compared with traditional storage solutions, leading to higher CPU utilization and a decrease in server sprawl.

Pain Point: Database Overhead is Slowing Queries

When a Cassandra cluster is slow to return a response, the cause could be a bottleneck on the underlying storage. Cassandra has an on-disk data format, the SSTable, which is efficient for additions but needs occasional compaction (or garbage collection) as items are updated. When this compaction takes place, one or more SSTables are consolidated and written into a new file. This process takes I/O performance away from the rest of the application, which is especially troublesome for high-write workloads. Even database reads can be stuck in the I/O queue behind these operations, which means that query performance can drop, sometimes dramatically, while actual server CPU usage will be minimal. A SAS SSD, such as the HGST-brand Ultrastar SS200 with its SAS interface and tuning for a mixed read/write workload, can help alleviate this bottleneck and maintain query performance during background operations.

Pain Point: Power Users Demand More Speed

For the absolute highest performance needs, the SanDisk brand also includes SSDs that completely skip the traditional storage stack by using NVM Express™ (NVMe), a direct-to- CPU attachment technology based on PCI Express that delivers dramatically lower I/O operation latencies than SATA or SAS.

Summary

Cassandra can be a powerful tool to store and extract value from massive amounts of data. However, like any scale-out tool, it needs to be applied carefully and thoughtfully, or it can result in a massive server sprawl and associated headaches.

For the largest Cassandra databases, adding HGST Ultrastar helium HDDs in industry-standard, fully serviceable chassis is ideal. This solution provides high capacity and good performance in a small footprint, and it enables the construction of cost-effective, massive clusters.

Ideal candidates for SanDisk SSDs are Cassandra databases in which queries take too long to return data or in which applications are not meeting their SLAs. In these cases, SanDisk SSDs, potentially in a directto-CPU connected NVMe interface form factor, may dramatically reduce query response times and allow you to maintain or reduce your server footprint at massively increased query performance.

  SanDisk SSD SanDisk SSD HGST SSD HGST SSD
Pain Point CloudSpeed™ SATA SkyHawk™ NVMe Ultrastar® SS200 SAS Ultrastar® Helium SATA/SAS
Database server sprawl, underutilized CPU ★ ★     ★ ★ ★
Fixed rack space, increasing database size ★ ★ ★     ★ ★ ★
Database overhead is slowing queries ★ ★ ★ ★ ★ ★  
Power users demand more speed ★ ★ ★ ★ ★ ★  
  Legend:    ★ Good       ★★ Better        ★★★ Best
Optimizing Apache Cassandra™ Database at Scale Solution Brief
Download

READY TO FLASH FORWARD?

Whether you’re a Fortune 500 or five person startup, SanDisk has solutions that will help you get the most out of your infrastructure.

VIA
EMAIL

Go ahead, ask us some questions and we'll get back to you with answers.

Let's Talk
800.578.6007

Don't wait, let's just talk now and start building the perfect flash solution.

Global Contact

Find contact information for offices all over the world.

SALES INQUIRIES

Whether you'd like to ask a few initial questions or are ready to discuss a SanDisk solution tailored to your organizations's needs, the SanDisk sales team is standing by to help.

We're happy to answer your questions, so please fill out the form below so we can get started. If you need to talk to the sales team immediately, please phone: 800.578.6007

Field cannot be empty.
Field cannot be empty.
Enter a valid email address.
Field can only contain numbers.
Field cannot be empty.
Field cannot be empty.
Field cannot be empty.
Field cannot be empty.
Field cannot be empty.
Field cannot be empty.

Please indicate your areas of interest:

You must choose an option.

Questions or comments:

Privacy Policy

Thank you. We have received your request.

Site Menu

  • For Home
    • Mobile Storage
    • Cards & Readers
    • USB
    • SSD
    • MP3 Players
  • For Business
    • Data Center
    • Computing
    • Partners
    • Pro Photo & Video
  • OEM Design
    • Mobile
    • Computing
    • Automotive
    • Connected Home
    • Industrial & Other
    • Data Center
  • Media Center
    • Press Releases
    • Awards and Accolades
    • Media Resources
  • Careers
    • Why SanDisk?
    • Apply Now
  • Investor Relations
  • Contact Us

Support

  • Business Support
  • Retail Support

Quick Links

  • IT Blog
  • Channel Partner Portal
  • SanDisk Stories
  • Extreme Team

IT Blog

Post Thumbnail

{{feed.title}} | {{getFormattedDate(feed.publishedDate) | date:''}}

Read More »
  • Legal
  • Terms of Use
  • Trademarks
  • Privacy
  • California Supply Chains Act
  • Your CA Privacy Rights
  • About Ads & Cookies
  •  
SanDisk
© 2017 Western Digital Corporation or its affiliates. All rights reserved. Western Digital Technologies, Inc. is the seller of record and licensee in the Americas of SanDisk® products.