Cassandra Backup, Restore, and Disaster Recovery Essentials Quiz Quiz

Challenge your knowledge of backup, restore, and disaster recovery processes in Cassandra databases. Understand best practices, key concepts, and essential techniques to ensure data protection and resilience in distributed systems.

  1. Backup Types Recognition

    Which type of backup in Cassandra captures the entire content of a data set at a specific point in time?

    1. Differential backup
    2. Snapshot schedule
    3. Full backup
    4. Incremental restore

    Explanation: A full backup copies all data and provides a complete set of the stored information at a specific moment. In contrast, a differential backup only captures changes since the last full backup, not the entire dataset. Incremental restore is not a backup type; it refers to the restoration process using incremental data. Snapshot schedule refers to a plan for backups, not the backup type itself.

  2. Snapshot Usage

    When performing a backup in Cassandra, what does creating a snapshot primarily do?

    1. Deletes outdated SSTables
    2. Encrypts all data in place
    3. Restores corrupted partitions
    4. Copies data files without interruption

    Explanation: Creating a snapshot in Cassandra makes hard links or copies of SSTables, which preserves the current data state with minimal impact on running operations. It does not delete any files or restore data. Encryption is not the main purpose of snapshots, making the other options incorrect.

  3. Restore Process Basics

    What is the first essential step when restoring a table from backup in Cassandra after data loss?

    1. Change all cluster tokens
    2. Run a full cluster repair
    3. Directly copy snapshot to another node
    4. Stop the relevant node

    Explanation: You must stop the related node to avoid conflicts and ensure a clean restore operation. Copying snapshots without stopping the node risks data inconsistency. Changing all cluster tokens is not needed for restoration. Running a full cluster repair is done after restoring, not before.

  4. Incremental Backup Understanding

    What is the main advantage of using incremental backups in Cassandra?

    1. Snapshots are deleted after each backup
    2. Only new or updated data is backed up
    3. Backups remove all old data
    4. Database performance decreases

    Explanation: Incremental backups record only the new or changed SSTables since the last backup, saving time and storage. They do not decrease performance as much as full backups might. Backups do not delete old data, and snapshots are not automatically deleted after every operation.

  5. Disaster Recovery Objective

    In Cassandra disaster recovery, what does RTO (Recovery Time Objective) define?

    1. The size of the backup files
    2. Amount of data lost post-disaster
    3. Maximum allowable downtime before service restoration
    4. Period between incremental backups

    Explanation: RTO refers to the duration within which service must be restored after a disaster. It does not measure data loss; that is RPO (Recovery Point Objective). Period between backups and the backup file size are unrelated to RTO, so the other options are less accurate.

  6. Regular Backup Schedule

    Why is setting a regular backup schedule critical for Cassandra clusters?

    1. Ensures recent data is always available for restore
    2. Disables recovery logs
    3. Reduces data size automatically
    4. Improves network speed

    Explanation: Regular backups make sure current data is consistently saved, providing the ability to restore the most up-to-date information after failures. Network speed is unrelated to backup schedules. Backups do not automatically shrink data size, nor do they disable recovery logs.

  7. Restoration and Consistency

    After restoring backups to a Cassandra node, which operation is commonly performed to ensure data consistency across replicas?

    1. Delete commit logs
    2. Reset all tokens
    3. Run a repair
    4. Upgrade Java version

    Explanation: Running a repair ensures that restored data is consistent and synchronized across all cluster replicas. Resetting tokens and upgrading software are not regular parts of restoration. Deleting commit logs may lead to data loss and is not standard after a restore.

  8. Backup Storage Location

    Why should Cassandra backup files be stored in multiple, separate locations?

    1. To reduce disk fragmentation
    2. To improve query performance
    3. To lower CPU usage
    4. To avoid single points of failure

    Explanation: Storing backups in diverse locations ensures data survivability if one site is compromised or fails. Storage strategies don't directly impact disk fragmentation, query efficiency, or CPU usage. Only the correct answer enhances disaster recovery resilience.

  9. Commit Logs in Recovery

    What role do commit logs play during data recovery in Cassandra?

    1. They handle user authentication
    2. They replay recent unflushed data
    3. They automatically create full backups
    4. They store backup schedules

    Explanation: Commit logs contain unflushed data changes used to recover recent writes that may not be present in SSTables. They don't create backups, hold schedules, or manage authentication. This makes them vital in ensuring data safety following a crash.

  10. Disaster Simulation for Testing

    What is the primary purpose of conducting disaster recovery drills in Cassandra environments?

    1. To permanently delete obsolete backups
    2. To automatically prune tables
    3. To update user roles
    4. To verify restoration procedures work as expected

    Explanation: Simulated drills confirm that backup and restoration processes are reliable and efficient in real-world failures. Drills do not prune tables by themselves, delete old backups automatically, or change user roles. The primary focus is on organizational preparedness.