| United States-English |
|
|
|
![]() |
Designing Disaster Tolerant High Availability Clusters: > Chapter 1 Disaster Tolerance and Recovery
in an MC/ServiceGuard ClusterTypes of Disaster Tolerant Clusters |
|
To protect against multiple points of failure, a cluster must be geographically dispersed: nodes can be put in different rooms, on different floors of a building, or even in separate buildings or separate cities. The distance between the nodes is dependent on the types of disaster from which you need protection, and on the technology used to replicate data. A local cluster has all nodes located in a single data center and is not disaster tolerant. Because most high-availability clusters are local clusters, this type is included in the discussion as a baseline for comparison with other cluster architectures. A campus cluster has alternate nodes located in different data centers. These data centers can be separate rooms in a building, adjacent buildings, or even buildings separated by some distance. The word "campus" implies that the organization housing the cluster owns or leases the land and buildings such that no outside permission is needed to dig trenches, lay cable, or reroute power circuits. Campus clusters are designed so that no single building failure will cause the cluster to fail. Campus clusters are connected using a high speed cable that guarantees network access between the nodes as long as all guidelines for disaster tolerant architecture are followed. The distance between nodes in a campus cluster is limited by the data replication technology. Architecture requirements for campus clusters usually consist of:
Campus clusters are implemented use standard MC/ServiceGuard with FibreChannel mass storage. A metropolitan cluster is a cluster that has alternate nodes located in different parts of a city or in adjacent cities. Putting nodes further apart increases the likelihood that alternate nodes will be available for failover in the event of a disaster. Metropolitan clusters are very similar to campus clusters with the exception that metropolitan clusters often require right-of-way from local governments or utilities to lay network and data replication cable. This can complicate the design and implementation. Metropolitan clusters also require a different kind of tie-breaker system for ensuring that split-brain situations do not arise. Typically, metropolitan clusters use an arbitrator site containing additional cluster nodes instead of the cluster lock disk. The architectural requirements are the same as for a campus cluster, with the additional constraint of the third data center. And as with a campus cluster, the distance separating the nodes in a metropolitan cluster is limited by the data replication and network technology available. Metropolitan cluster architecture is implemented through two HP products:
A key difference between campus and metropolitan clusters is the data replication technology used. The campus cluster uses FibreChannel and MirrorDisk/UX, which limits the distances between data centers. Metropolitan clusters use data replication based on the capabilities of the HP SureStore E Disk Array XP series or the EMC Symmetrix array, which allow greater distances. A continental cluster provides alternate clusters that are separated by large distances so that wide area networking is used between them. Continental cluster architecture is implemented via the ContinentalClusters product, described fully in Chapter 5. The design is implemented with two distinct ServiceGuard clusters located in different geographic areas. In this architecture, each cluster maintains its own quorum, so an arbitrator data center is not used. ContinentalClusters is architected to use any WAN connection via a TCP/IP protocol; however, due to data replication needs, high speed connections such as T1 or T3/E3 leased lines or switched lines may be required. The key issues concerning a WAN cluster are:
Another way of setting up your continental cluster is the use of cascading failover, an option available for users of the EMC Symmetrix. Cascading failover means that applications are configured to fail over on nodes within two data centers in either cluster. They fail over to a cluster if the entire alternate cluster is down. For example, bi-directional failover allows a cluster to be both a recovery cluster and a primary cluster for different packages. Data replication also follows the cascading model. Data is replicated from the primary disk array to the secondary disk array in the MetroCluster, then data is replicated to the third disk array in the ServiceGuard recovery cluster. A continental cluster with cascading failover uses four data centers distributed between a metropolitan cluster, which serves as a primary cluster, and a standard cluster, which serves as a recovery cluster. The configuration uses three EMC Symmetrix frames, two of which are part of the metropolitan cluster and the other attached to the recovery cluster. The data centers are distributed as follows:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||||||