Availability and redundancy
Google Cloud VMware Engine provides availability for your applications running on VMware in your private cloud environment. The following table lists failure scenarios and the associated availability features that help protect your applications.
Failure scenario | Application protected? | VMware Engine HA feature | VMware HA feature | Google Cloud feature |
---|---|---|---|---|
Disk failure | Yes | Fast replacement of failed node | About the vSAN Default Storage Policy | |
Fan failure | Yes | Redundant fans, fast replacement of failed node | ||
NIC failure | Yes | Redundant NIC, fast replacement of failed node | ||
Host power failure | Yes | Redundant power supply | ||
ESXi host failure | Yes | Fast replacement of failed node | VMware vSphere High Availability | |
VM failure | Yes | VMware vSphere High Availability | Load balancer for stateless VMware VMs | |
Leaf switch port failure | Yes | Redundant NIC | ||
Leaf switch failure | Yes | Redundant leaf switches | ||
Rack failure | Yes | Placement groups | ||
Network connectivity to on-premises | Yes | Redundant networking services | Redundant Dedicated Interconnect and Partner Interconnect circuits | |
Network connectivity | Yes | Redundant Dedicated Interconnect and Partner Interconnect circuits | ||
Regional failure | Yes | Hosting regions |
Availability features
Fast replacement of a failed node
VMware Engine continuously monitors the health of VMware clusters. When VMware Engine detects an ESXi node failure, it adds a new ESXi host to the affected VMware cluster from its pool of readily available nodes and removes the failed node from the cluster. This functionality quickly restores the spare capacity in the VMware cluster, supporting the cluster's resiliency provided by vSAN and VMware vSphere High Availability (HA).
Placement groups
A user who creates a private cloud can select a region and a placement group within the selected region. A placement group is a set of nodes spread across multiple racks but within the same spine network segment. Nodes within the same placement group can reach each other with a maximum of two extra switch hops. A placement group is always within a single availability zone and spans multiple racks. The control plane distributes nodes of a private cloud across multiple racks based on best effort. Nodes in different placement groups are guaranteed to be placed in different racks.
Availability zones
VMware Engine private clouds are hosted in a user-selected Google Cloud location. These locations are composed of regions and zones. A region is a specific geographical location where you can host your resources. Each region has one or more zones.
Resources in different regions are isolated from most types of physical infrastructure and infrastructure software service failures. You can design a robust system and distribute resources across different failure domains to protect your applications and data from data center failures.
Redundant networking services
All Google networking services for the private cloud (including firewall, public IP addresses, internet, Dedicated Interconnect, Partner Interconnect, and Cloud VPN) are highly available and support the SLA.