Skip to the content.

High Availability and Disaster Recovery

Capacity Planning

Critical strategic planning effort that ensures an organization is adequately equipped to meet any future demands at the right time and in the most cost-effective manner.

Main aspects of an organization’s capacity:

High Availability

The ability of a service to be continuously available by minimizing downtime to the lowest amount possible.

Uptime

The number of minutes or hours that a system remains online over a given period.

Load Balancing

The process of distributing workloads across multiple computing resources.

Session Persistence

Round-robin

Least Connections

Weighted Value

Clustering

Multiple computers and multiple storage devices are grouped together to work as a single system.

Disaster Recovery

Redundancy

Design systems with duplicate components for backup in case of failure.

Failover

Failover involves establishing an alternate location, such as a secondary data center or cloud infrastructure, where critical business functions can continue in case the primary site becomes unavailable due to a disaster or incident.

The Goal of Disaster Recovery

Disaster Recovery Sites

Disaster Recovery Tests

Disaster Recovery Plan

Organizations often maintain diverse documents for different audiences in their Disaster Recovery Planning (DRP) efforts. These include:

Disaster Recovery in the Real World

Ensure critical systems are formally identified and regularly test backups, as incidents may go undetected for extended periods.

Sample Scenario

At a hospital in Los Angeles, it took 260 days (about 8 and a half months) to discover that there was a compromise.

Financial Department is rarely involved

A company’s financial department is rarely involved in a disaster recovery plan, except when the issue at hand is directly connected to company finances (see Chapter 2 ISC2 Study Guide, module 3, under Components of a Disaster Recovery Plan).

Data Redundancy through RAID

RAID (Redundant Array of Independent Disks) combines multiple physical disks into a single logical unit to improve performance, reliability, and data redundancy.

RAID Classifications

There are multiple ways to setup RAID disks, depending on the level of resilience that we will want to aim for.

Failure-resistant

Fault-resistant

Disaster-resistant

RAID 0

RAID 1

RAID 5

RAID 6

RAID 10

Data Backups

Considerations

Data Backup involves creating duplicate copies of critical data and storing them off-site. These backups serve as a safeguard in case the primary data becomes unavailable due to a disaster or incident.

Data Recovery Process

  1. Selection of Backup
    • Identify the most recent and relevant backup.
    • Ensure backup integrity and completeness.
    • Choose the appropriate backup type (full, incremental, differential).
  2. Initiating the Recovery Process
    • Load the backup data onto the recovery system.
    • Follow the recovery protocol for the specific backup software.
    • Monitor the progress to identify any issues early.
  3. Data Validation
    • Verify the completeness of the recovered data.
    • Check for any corruption or missing files.
    • Compare the restored data with the original backup.
  4. Testing and Validation
    • Test the recovered data in a controlled environment.
    • Ensure all applications and services function correctly.
    • Confirm data integrity and usability.
  5. Documentation and Reporting
    • Record the recovery process steps and outcomes.
    • Document any issues encountered and their resolutions.
    • Generate a comprehensive report for stakeholders.
  6. Notification
    • Inform relevant parties of the recovery completion.
    • Provide details on the restored data and any discrepancies.
    • Communicate any follow-up actions required.

Data Backup Types

BC vs DR

Business Continuity (BC) plans

Disaster Recovery (DR) plans


Back to main page