Continuous Availability
Encyclopedia
Continuous Availability is an approach to computer system and application design that protects users against downtime, whatever the cause and ensures that users remain connected to their documents, data files and business applications. Continuous availability describes the information technology methods to ensure business continuity
Business continuity
Business continuity is the activity performed by an organization to ensure that critical business functions will be available to customers, suppliers, regulators, and other entities that must have access to those functions. These activities include many daily chores such as project management,...

.

In early days of computing, availability was not considered business critical. With the increasing use of mobile computing
Mobile computing
Mobile computing is a form of human–computer interaction by which a computer is expected to be transported during normal usage. Mobile computing has three aspects: mobile communication, mobile hardware, and mobile software...

, global access to online business transactions and business-to-business communication, continuous availability is increasingly important based on the need to support customer access to information systems.

Solutions to continuous availability exists in different forms and implementations depending on the software and hardware manufacturer. The goal of the discipline is to reduce the user or business application downtime, which can have a severe impact on business operations. Inevitably, such downtime can lead to loss of productivity, loss of revenue, customer dissatisfaction and ultimately can damage a companies reputation.

Degrees of Availability

The terms high availability
High availability
High availability is a system design approach and associated service implementation that ensures a prearranged level of operational performance will be met during a contractual measurement period....

, continuous operation, and continuous availability are generally used to express how available a system is. The following is a definition of each of these terms.

High availability
refers to the ability to avoid unplanned outages by eliminating single points of failure.
This is a measure of the reliability of the hardware, operating system, middleware, and database manager software.
Another measure of high availability is the ability to minimize the effect of an unplanned outage by masking the outage from the end users.
This can be accomplished by providing redundancy or quickly restarting failed components.

Availability is usually expressed as a percentage of uptime in a given year:
Availability Downtime per year
99.9% 8.76 hours
99.99% 1 hour
99.999% 5 minutes

When defining such a percentage it needs to be specified if it applies to the hardware, the IT infrastructure or the business application on top.

Continuous operation refers to the ability to avoid planned outages.
For continuous operation there must be ways to perform necessary administrative work, like hardware and software maintenance while the business application remains available to the end users. This is accomplished by providing multiple servers and switching end users to an available server at times when one server is made unavailable.
Note that a system running in continuous operation is not necessarily operating with high availability because an excessive number of unplanned outages could compromise this.

Continuous availability combines the characteristics of high availability and continuous operation to provide the ability to keep the business application running without any noticeable downtime.

Types of outages

Planned outages are deliberate and are scheduled at a convenient time. These involve such activities as:
- Hardware installation or maintenance
- Software maintenance or upgrades of the operating system, the middleware, the database server or the business application
- Database administration such as offline backup, or offline reorganization

Unplanned outages are unexpected outages that are caused by the failure of any system component.
They include hardware failures, software issues, or people and process issues.

History

Various commercially viable examples exist for hardware/software implementations. These include:
  • Tandem NonStop Computers
  • Stratus
    Stratus VOS
    VOS is a proprietary operating system running on Stratus Technologies fault-tolerant computer systems. VOS is available on Stratus's ftServer and Continuum platforms...

  • IBM Parallel Sysplex
    IBM Parallel Sysplex
    In computing, a Parallel Sysplex is a cluster of IBM mainframes acting together as a single system image with z/OS. Used for disaster recovery, Parallel Sysplex combines data sharing and parallel computing to allow a cluster of up to 32 systems to share a workload for high performance and high...


See also

  • Business continuity planning
    Business continuity planning
    Business continuity planning “identifies [an] organization's exposure to internal and external threats and synthesizes hard and soft assets to provide effective prevention and recovery for the organization, whilst maintaining competitive advantage and value system integrity”. It is also called...

  • Disaster recovery
    Disaster recovery
    Disaster recovery is the process, policies and procedures related to preparing for recovery or continuation of technology infrastructure critical to an organization after a natural or human-induced disaster. Disaster recovery is a subset of business continuity...

  • High-availability cluster
    High-availability cluster
    High-availability clusters are groups of computers that support server applications that can be reliably utilized with a minimum of down-time. They operate by harnessing redundant computers in groups or clusters that provide continued service when system components fail...

  • Fault-tolerant system
    Fault-tolerant system
    Fault-tolerance or graceful degradation is the property that enables a system to continue operating properly in the event of the failure of some of its components. A newer approach is progressive enhancement...

  • Service Availability Forum
    Service Availability Forum
    The Service Availability Forum is a consortium that develops, publishes, educates on and promotes open specifications for carrier-grade and mission-critical systems. SA Forum specifications enable faster development and deployment of commercial off-the-shelf ecosystems for highly available...


External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK