This document shows you how to deploy a cross-region internal Application Load Balancer with Cloud Run. To set this up, you use a serverless NEG backend for the load balancer.
Serverless NEGs let you use Cloud Run services with your load balancer. After you configure a load balancer with the serverless NEG backend, requests to the load balancer are routed to the Cloud Run backend.
Cross-regional load balancing provides redundancy, so that if a region is unreachable, traffic is automatically diverted to another region. Based on the location of the Envoy, proxy traffic is distributed to Cloud Run services as follows:
- If multi-region Cloud Run services are configured in the same region as the Envoy, the NEG that is located in the same region as the Envoy is preferred. Traffic is sent to the failover region only if outlier detection is enabled and the local NEG is unhealthy.
- If multi-region Cloud Run services are not configured in the same region as the Envoy, traffic is distributed evenly across all NEGs. The NEGs located closer are not preferred.
- If Identity-Aware Proxy is enabled, only a single serverless NEG is supported. You can, however, configure additional Cloud Run services but the load balancer does not send any traffic to them.
Before you begin
Before following this guide, familiarize yourself with the following:
- Internal Application Load Balancer overview, including the Limitations section
- VPC firewall rules
- Serverless network endpoint groups overview
Deploy a Cloud Run service
The instructions on this page assume you already have a Cloud Run service running.
For the example on this page, you can use any of the Cloud Run quickstarts to deploy a Cloud Run service.
To prevent access to the Cloud Run service from the
internet, restrict ingress to internal
. Traffic
from the internal Application Load Balancer is considered internal traffic.
Placing the Cloud Run service in multiple regions helps
prevent failures in a single region. To deploy the
Cloud Run service in REGION_A
and
REGION_B
regions, run the following commands:
gcloud
gcloud run deploy CLOUD_RUN_SERVICE_NAMEA \ --platform=managed \ --allow-unauthenticated \ --ingress=internal \ --region=REGION_A \ --image=IMAGE_URLA
gcloud run deploy CLOUD_RUN_SERVICE_NAMEB \ --platform=managed \ --allow-unauthenticated \ --ingress=internal \ --region=REGION_B \ --image=IMAGE_URLB
Note the name of the service that you create. The rest of this page shows you how to set up a load balancer that routes requests to this service.
Set up an SSL certificate resource
Create a Certificate Manager SSL certificate resource as follows:
- Deploy a global self-managed certificate
- Create a Google-managed certificate issued by your Certificate Authority Service instance
- Create a Google-managed certificate with DNS authorization
We recommend using a Google-managed certificate.
Permissions
To follow this guide, you must be able to create instances and modify a network in a project. You must be either a project owner or editor, or you must have all of the following Compute Engine IAM roles.
Task | Required role |
---|---|
Create networks, subnets, and load balancer components | Compute Network Admin |
Add and remove firewall rules | Compute Security Admin |
Create instances | Compute Instance Admin |
For more information, see the following guides:
Setup overview
You can configure the cross-region internal Application Load Balancer as described in the following diagram:
As shown in the diagram, this example creates a cross-region internal Application Load Balancer in a
VPC network, with one backend
service and two Cloud Run deployments in
REGION_A
and
REGION_B
regions.
The cross-region internal Application Load Balancer setup is described as follows:
A VPC network with the following subnets:
- Subnet
SUBNET_A
and a proxy-only subnet inREGION_A
. - Subnet
SUBNET_B
and a proxy-only subnet inREGION_B
.
You must create proxy-only subnets in each region of a VPC network where you use cross-region internal Application Load Balancers. The region's proxy-only subnet is shared among all cross-region internal Application Load Balancers in the region. Source addresses of packets sent from the load balancer to your service's backends are allocated from the proxy-only subnet. In this example, the proxy-only subnet for the region
REGION_A
has a primary IP address range of10.129.0.0/23
and forREGION_B
has a primary IP address range of10.130.0.0/23
which is the recommended subnet size.- Subnet
A firewall rule that permits proxy-only subnet traffic flows in your network. This means adding one rule that allows TCP port
80
,443
, and8080
traffic from10.129.0.0/23
and10.130.0.0/23
(the range of the proxy-only subnets in this example).Another firewall rule for the health check probes.
A high availability setup that has serverless backends for Cloud Run deployments in
REGION_A
andREGION_B
regions. If the backends in one region happen to be down, traffic fails over to the other region.A global backend service that monitors the usage and health of backends. Ensure that you enable outlier detection on the backend service.
A global URL map that parses the URL of a request and forwards requests to specific backend services based on the host and path of the request URL.
A global target HTTP or HTTPS proxy, which receives a request from the user and forwards it to the URL map. For HTTPS, configure a regional SSL certificate resource. The target proxy uses the SSL certificate to decrypt SSL traffic if you configure HTTPS load balancing. The target proxy can forward traffic to your instances by using HTTP or HTTPS.
Global forwarding rules, which have the internal IP address of your load balancer, to forward each incoming request to the target proxy.
The internal IP address associated with the forwarding rule can come from any subnet in the same network and region. Note the following conditions:
- The IP address can (but does not need to) come from the same subnet as the backend instance groups.
- The IP address must not come from a reserved proxy-only subnet that has
its
--purpose
flag set toGLOBAL_MANAGED_PROXY
. - If you want to use the same internal IP address with multiple forwarding
rules, set the IP address
--purpose
flag toSHARED_LOADBALANCER_VIP
.
Optional: Configure DNS routing policies of type
GEO
to route client traffic to the load balancer VIP in the region closest to the client.
Configure the network and subnets
Within the VPC network, configure a subnet in each region
where your backends are configured. In addition, configure a proxy-only-subnet
in each region in which you want to configure the load balancer.
This example uses the following VPC network, region, and subnets:
Network. The network is a custom mode VPC network named
NETWORK
.Subnets for backends. A subnet named
SUBNET_A
in theREGION_A
region uses10.1.2.0/24
for its primary IP range. Subnet namedSUBNET_A
in theREGION_B
region uses10.1.3.0/24
for its primary IP range.Subnet for proxies. A subnet named
PROXY_SN_A
in theREGION_A
region uses10.129.0.0/23
for its primary IP range. A subnet namedPROXY_SN_B
in theREGION_B
region uses10.130.0.0/23
for its primary IP range.
Cross-region internal Application Load Balancers can be accessed from any region within the VPC. So clients from any region can globally access your load balancer backends.
Configure the backend subnets
Console
In the Google Cloud console, go to the VPC networks page.
Click Create VPC network.
Provide a Name for the network.
In the Subnets section, set the Subnet creation mode to Custom.
Create a subnet for the load balancer's backends. In the New subnet section, enter the following information:
- Provide a Name for the subnet.
- Select a Region: REGION_A
- Enter an IP address range:
10.1.2.0/24
Click Done.
Click Add subnet.
Create a subnet for the load balancer's backends. In the New subnet section, enter the following information:
- Provide a Name for the subnet.
- Select a Region: REGION_B
- Enter an IP address range:
10.1.3.0/24
Click Done.
Click Create.
gcloud
Create the custom VPC network with the
gcloud compute networks create
command:gcloud compute networks create NETWORK --subnet-mode=custom
Create a subnet in the
NETWORK
network in theREGION_A
region with thegcloud compute networks subnets create
command:gcloud compute networks subnets create SUBNET_A \ --network=NETWORK \ --range=10.1.2.0/24 \ --region=REGION_A
Create a subnet in the
NETWORK
network in theREGION_B
region with thegcloud compute networks subnets create
command:gcloud compute networks subnets create SUBNET_B \ --network=NETWORK \ --range=10.1.3.0/24 \ --region=REGION_B
API
Make a POST
request to the
networks.insert
method.
Replace PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/networks { "routingConfig": { "routingMode": "regional" }, "name": "NETWORK", "autoCreateSubnetworks": false }
Make a POST
request to the
subnetworks.insert
method.
Replace PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION_A/subnetworks { "name": "SUBNET_A", "network": "projects/PROJECT_ID/global/networks/NETWORK", "ipCidrRange": "10.1.2.0/24", "region": "projects/PROJECT_ID/regions/REGION_A", }
Make a POST
request to the
subnetworks.insert
method.
Replace PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION_B/subnetworks { "name": "SUBNET_B", "network": "projects/PROJECT_ID/global/networks/NETWORK", "ipCidrRange": "10.1.3.0/24", "region": "projects/PROJECT_ID/regions/REGION_B", }
Configure the proxy-only subnet
A proxy-only subnet provides a set of IP addresses that Google Cloud uses to run Envoy proxies on your behalf. The proxies end connections from the client and create connections to the backends.
This proxy-only subnet is used by all Envoy-based regional load balancers in the same region as the VPC network. There can only be one active proxy-only subnet for a given purpose, per region, per network.
Console
If you're using the Google Cloud console, you can wait and create the proxy-only subnet later on the Load balancing page.
If you want to create the proxy-only subnet now, use the following steps:
In the Google Cloud console, go to the VPC networks page.
- Click the name of the VPC network.
- On the Subnets tab, click Add subnet.
- Provide a Name for the proxy-only subnet.
- In the Region list, select REGION_A.
- In the Purpose list, select Cross-region Managed Proxy.
- In the IP address range field, enter
10.129.0.0/23
. - Click Add.
Create the proxy-only subnet in REGION_B
- Click Add subnet.
- Provide a Name for the proxy-only subnet.
- In the Region list, select REGION_B.
- In the Purpose list, select Cross-region Managed Proxy.
- In the IP address range field, enter
10.130.0.0/23
. - Click Add.
gcloud
Create the proxy-only subnets with the
gcloud compute networks subnets create
command.
gcloud compute networks subnets create PROXY_SN_A \ --purpose=GLOBAL_MANAGED_PROXY \ --role=ACTIVE \ --region=REGION_A \ --network=NETWORK \ --range=10.129.0.0/23
gcloud compute networks subnets create PROXY_SN_B \ --purpose=GLOBAL_MANAGED_PROXY \ --role=ACTIVE \ --region=REGION_B \ --network=NETWORK \ --range=10.130.0.0/23
API
Create the proxy-only subnets with the
subnetworks.insert
method, replacing
PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION_A/subnetworks { "name": "PROXY_SN_A", "ipCidrRange": "10.129.0.0/23", "network": "projects/PROJECT_ID/global/networks/NETWORK", "region": "projects/PROJECT_ID/regions/REGION_A", "purpose": "GLOBAL_MANAGED_PROXY", "role": "ACTIVE" }
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION_B/subnetworks { "name": "PROXY_SN_B", "ipCidrRange": "10.130.0.0/23", "network": "projects/PROJECT_ID/global/networks/NETWORK", "region": "projects/PROJECT_ID/regions/REGION_B", "purpose": "GLOBAL_MANAGED_PROXY", "role": "ACTIVE" }
Create the serverless NEGs
Create a serverless NEG for your Cloud Run service:
gcloud compute network-endpoint-groups create gl7ilb-serverless-neg-a \ --region=REGION_A \ --network-endpoint-type=serverless \ --cloud-run-service=CLOUD_RUN_SERVICE_NAMEA
gcloud compute network-endpoint-groups create gl7ilb-serverless-neg-b \ --region=REGION_B \ --network-endpoint-type=serverless \ --cloud-run-service=CLOUD_RUN_SERVICE_NAMEB
Configure the load balancer
Traffic going from the load balancer to the serverless NEG backends uses special routes defined outside your VPC that are not subject to firewall rules. Therefore, if your load balancer only has serverless NEG backends, you don't need to create firewall rules to allow traffic from the proxy-only subnet to the serverless backend.
Console
Start your configuration
In the Google Cloud console, go to the Load balancing page.
- Click Create load balancer.
- For Type of load balancer, select Application Load Balancer (HTTP/HTTPS) and click Next.
- For Public facing or internal, select Internal and click Next.
- For Cross-region or single region deployment, select Best for cross-region workloads and click Next.
- Click Configure.
Basic configuration
- Provide a Name for the load balancer.
- For Network, select NETWORK.
Configure the frontend with two forwarding rules
For HTTP:
- Click Frontend configuration.
- Provide a Name for the forwarding rule.
- In the Subnetwork region list, select REGION_A.
Reserve a proxy-only subnet
- In the Subnetwork list, select SUBNET_A.
- In the IP address list, click Create IP address. The Reserve a static internal IP address page opens.
- Provide a Name for the static IP address.
- In the Static IP address list, select Let me choose.
- In the Custom IP address field, enter
10.1.2.99
. - Select Reserve.
- Click Done.
- To add the second forwarding rule, click Add frontend IP and port.
- Provide a Name for the forwarding rule.
- In the Subnetwork region list, select REGION_B.
Reserve a proxy-only subnet
- In the Subnetwork list, select SUBNET_B.
- In the IP address list, click Create IP address. The Reserve a static internal IP address page opens.
- Provide a Name for the static IP address.
- In the Static IP address list, select Let me choose.
- In the Custom IP address field, enter
10.1.3.99
. - Select Reserve.
- Click Done.
For HTTPS:
If you are using HTTPS between the client and the load balancer,
you need one or more SSL certificate resources to configure the proxy.
To create an all-regions
Google-managed certificate, see the
following documentation:
- Create a Google-managed certificate issued by your Certificate Authority Service instance.
- Create a Google-managed certificate with DNS authorization.
After you create the Google-managed certificate, attach the certificate directly to the target proxy. Certificate maps are not supported by cross-region internal Application Load Balancers.
To create an all-regions
self-managed certificate, see the
following documentation:
Deploy a regional self-managed certificate.
- Click Frontend configuration.
- Provide a Name for the forwarding rule.
- In the Protocol field, select
HTTPS (includes HTTP/2)
. - Ensure that the Port is set to
443
. - In the Subnetwork region list, select REGION_A.
Reserve a proxy-only subnet
- In the Subnetwork list, select SUBNET_A.
- In the IP address list, click Create IP address. The Reserve a static internal IP address page opens.
- Provide a Name for the static IP address.
- In the Static IP address list, select Let me choose.
- In the Custom IP address field, enter
10.1.3.99
. - Select Reserve.
- In the Add certificate section, select the certificate.
- Optional: To add certificates in addition to the primary SSL certificate:
- Click Add certificate.
- Select the certificate from the list.
- Select an SSL policy from the SSL policy list. If you have not created any SSL policies, a default Google Cloud SSL policy is applied.
- Click Done.
- Provide a Name for the frontend configuration.
- In the Protocol field, select
HTTPS (includes HTTP/2)
. - Ensure that the Port is set to
443
. - In the Subnetwork region list, select REGION_B.
Reserve a proxy-only subnet
- In the Subnetwork list, select SUBNET_B.
- In the IP address list, click Create IP address. The Reserve a static internal IP address page opens.
- Provide a Name for the static IP address.
- In the Static IP address list, select Let me choose.
- In the Custom IP address field, enter
10.1.3.99
. - Select Reserve.
- In the Add certificate section, select the certificate.
- Optional: To add certificates in addition to the primary SSL certificate:
- Click Add certificate.
- Select the certificate from the list.
- Select an SSL policy from the SSL policy list. If you have not created any SSL policies, a default Google Cloud SSL policy is applied.
- Click Done.
- Click Backend configuration.
- In the Create or select backend services list, click Create a backend service.
- Provide a Name for the backend service.
- For Protocol, select HTTP.
- For Named Port, enter
http
. - In the Backend type list, select Serverless network endpoint group.
- In the New backend section:
- In the Serverless network endpoint group list,
select
gl7ilb-serverless-neg-a
. - Click Done.
- To add another backend, click Add backend.
- In the Serverless network endpoint group list,
select
gl7ilb-serverless-neg-b
. - Click Done.
- Click Routing rules.
- For Mode, select Simple host and path rule.
- Ensure that there is only one backend service for any unmatched host and any unmatched path.
- Click Review and finalize.
- Review your load balancer configuration settings.
- Click Create.
Add the second frontend configuration:
Configure the routing rules
Review the configuration
gcloud
Define the backend service with the
gcloud compute backend-services create
command.gcloud compute backend-services create gil7-backend-service \ --load-balancing-scheme=INTERNAL_MANAGED \ --protocol=HTTP \ --global
Add backends to the backend service with the
gcloud compute backend-services add-backend
command.gcloud compute backend-services add-backend gil7-backend-service \ --network-endpoint-group=gl7ilb-serverless-neg-a \ --network-endpoint-group-region=REGION_A \ --global
gcloud compute backend-services add-backend gil7-backend-service \ --network-endpoint-group=gl7ilb-serverless-neg-b \ --network-endpoint-group-region=REGION_B \ --global
Create the URL map with the
gcloud compute url-maps create
command.gcloud compute url-maps create gil7-map \ --default-service=gil7-backend-service \ --global
Create the target proxy.
For HTTP:
Create the target proxy with the
gcloud compute target-http-proxies create
command.gcloud compute target-http-proxies create gil7-http-proxy \ --url-map=gil7-map \ --global
For HTTPS:
To create a Google-managed certificate, see the following documentation:
- Create a Google-managed certificate issued by your Certificate Authority Service instance
- Create a Google-managed certificate with DNS authorization
After you create the Google-managed certificate, attach the certificate directly to the target proxy. Certificate maps are not supported by cross-region internal Application Load Balancers.
To create a self-managed certificate, see the following documentation:
Assign your file paths to variable names.
export LB_CERT=PATH_TO_PEM_FORMATTED_FILE
export LB_PRIVATE_KEY=PATH_TO_LB_PRIVATE_KEY_FILE
Create an all region SSL certificate using the
gcloud certificate-manager certificates create
command.gcloud certificate-manager certificates create gilb-certificate \ --private-key-file=$LB_PRIVATE_KEY \ --certificate-file=$LB_CERT \ –-scope=all-regions
Use the SSL certificate to create a target proxy with the
gcloud compute target-https-proxies create
commandgcloud compute target-https-proxies create gil7-https-proxy \ --url-map=gil7-map \ --certificate-manager-certificates=gilb-certificate
Create two forwarding rules: one with a VIP (
10.1.2.99
) in theREGION_B
region and another one with a VIP (10.1.3.99
) in theREGION_A
region.For custom networks, you must reference the subnet in the forwarding rule. Note that this is the virtual machine (VM) instance subnet, not the proxy subnet.
For HTTP:
Use the
gcloud compute forwarding-rules create
command with the correct flags.gcloud compute forwarding-rules create gil7-forwarding-rule-a \ --load-balancing-scheme=INTERNAL_MANAGED \ --network=NETWORK \ --subnet=SUBNET_B \ --subnet-region=REGION_B \ --address=10.1.3.99 \ --ports=80 \ --target-http-proxy=gil7-http-proxy \ --global
gcloud compute forwarding-rules create gil7-forwarding-rule-b \ --load-balancing-scheme=INTERNAL_MANAGED \ --network=NETWORK \ --subnet=SUBNET_A \ --subnet-region=REGION_A \ --address=10.1.2.99 \ --ports=80 \ --target-http-proxy=gil7-http-proxy \ --global
For HTTPS:
Create the forwarding rule with the
gcloud compute forwarding-rules create
command with the correct flags.gcloud compute forwarding-rules create gil7-forwarding-rule-a \ --load-balancing-scheme=INTERNAL_MANAGED \ --network=NETWORK \ --subnet=SUBNET_B \ --address=10.1.3.99 \ --ports=443 \ --target-https-proxy=gil7-https-proxy \ --global
gcloud compute forwarding-rules create gil7-forwarding-rule-b \ --load-balancing-scheme=INTERNAL_MANAGED \ --network=NETWORK \ --subnet=SUBNET_A \ --address=10.1.2.99 \ --ports=443 \ --target-https-proxy=gil7-https-proxy \ --global
API
Create the global backend service by making a POST
request to the
backendServices.insert
method,
replacing PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/backendServices { "name": "gil7-backend-service", "backends": [ { "group": "projects/PROJECT_ID/zones/ZONE_A/instanceGroups/gl7ilb_serverless_negwest", "balancingMode": "UTILIZATION" }, { "group": "projects/PROJECT_ID/zones/ZONE_B/instanceGroups/gl7ilb_serverless_negeast", } ], "loadBalancingScheme": "INTERNAL_MANAGED" }
Create the URL map by making a POST
request to the
urlMaps.insert
method,
replacing PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/urlMaps { "name": "l7-ilb-map", "defaultService": "projects/PROJECT_ID/global/backendServices/gil7-backend-service" }
For HTTP:
Create the target HTTP proxy by making a POST
request to the
targetHttpProxies.insert
method,
replacing PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/targetHttpProxy { "name": "l7-ilb-proxy", "urlMap": "projects/PROJECT_ID/global/urlMaps/l7-ilb-map" }
Create the forwarding rule by making a POST
request to the
globalforwardingRules.insert
method,
replacing PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/forwardingRules { "name": "gil7-forwarding-rule-a", "IPAddress": "10.1.2.99", "IPProtocol": "TCP", "portRange": "80-80", "target": "projects/PROJECT_ID/global/targetHttpProxies/l7-ilb-proxy", "loadBalancingScheme": "INTERNAL_MANAGED", "subnetwork": "projects/PROJECT_ID/regions/REGION_A/subnetworks/SUBNET_A", "network": "projects/PROJECT_ID/global/networks/NETWORK", "networkTier": "PREMIUM" }
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/forwardingRules { "name": "gil7-forwarding-rule-b", "IPAddress": "10.1.3.99", "IPProtocol": "TCP", "portRange": "80-80", "target": "projects/PROJECT_ID/global/targetHttpProxies/l7-ilb-proxy", "loadBalancingScheme": "INTERNAL_MANAGED", "subnetwork": "projects/PROJECT_ID/regions/REGION_B/subnetworks/SUBNET_B", "network": "projects/PROJECT_ID/global/networks/NETWORK", "networkTier": "PREMIUM" }
For HTTPS:
Read the certificate and private key files, and then create the SSL certificate. The following example shows how to do this with Python.
Create the target HTTPS proxy by making a POST
request to the
targetHttpsProxies.insert
method,
replacing PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/targetHttpsProxy { "name": "l7-ilb-proxy", "urlMap": "projects/PROJECT_ID/global/urlMaps/l7-ilb-map", "sslCertificates": /projects/PROJECT_ID/global/sslCertificates/SSL_CERT_NAME }
Create the forwarding rule by making a POST
request to the
globalForwardingRules.insert
method,
replacing PROJECT_ID
with your project ID.
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/forwardingRules { "name": "gil7-forwarding-rule-a", "IPAddress": "10.1.2.99", "IPProtocol": "TCP", "portRange": "80-80", "target": "projects/PROJECT_ID/global/targetHttpsProxies/l7-ilb-proxy", "loadBalancingScheme": "INTERNAL_MANAGED", "subnetwork": "projects/PROJECT_ID/regions/REGION_A/subnetworks/SUBNET_A", "network": "projects/PROJECT_ID/global/networks/NETWORK", "networkTier": "PREMIUM" }
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/global/forwardingRules { "name": "gil7-forwarding-rule-b", "IPAddress": "10.1.3.99", "IPProtocol": "TCP", "portRange": "80-80", "target": "projects/PROJECT_ID/global/targetHttpsProxies/l7-ilb-proxy", "loadBalancingScheme": "INTERNAL_MANAGED", "subnetwork": "projects/PROJECT_ID/regions/REGION_B/subnetworks/SUBNET_B", "network": "projects/PROJECT_ID/global/networks/NETWORK", "networkTier": "PREMIUM" }
Test the load balancer
Now that the load balancing service is running, you can send traffic to the forwarding rule and observe the traffic being dispersed to different instances.
Configure the firewall rule
This example requires the fw-allow-ssh
firewall rule for the test client VM.
fw-allow-ssh
is an ingress rule that is applicable to the test client VM and
that allows incoming SSH connectivity on TCP port 22
from any
address. You can choose a more restrictive source IP address range for this
rule; for example, you can specify just the IP address ranges of the system from
which you initiate SSH sessions. This example uses the target tag allow-ssh
.
gcloud
Create the
fw-allow-ssh
firewall rule to allow SSH connectivity to VMs with the network tagallow-ssh
. When you omitsource-ranges
, Google Cloud interprets the rule to mean any source.gcloud compute firewall-rules create fw-allow-ssh \ --network=NETWORK \ --action=allow \ --direction=ingress \ --target-tags=allow-ssh \ --rules=tcp:22
Create a VM instance to test connectivity
Create a client VM:
gcloud compute instances create l7-ilb-client-a \ --image-family=debian-12 \ --image-project=debian-cloud \ --network=NETWORK \ --subnet=SUBNET_A \ --zone=ZONE_A \ --tags=allow-ssh
gcloud compute instances create l7-ilb-client-b \ --image-family=debian-12 \ --image-project=debian-cloud \ --network=NETWORK \ --subnet=SUBNET_B \ --zone=ZONE_B \ --tags=allow-ssh
Connect, using SSH, to each client instance.
gcloud compute ssh l7-ilb-client-a \ --zone=ZONE_A
gcloud compute ssh l7-ilb-client-b \ --zone=ZONE_B
Verify that the IP address is serving its hostname.
Verify that the client VM can reach both IP addresses. The command succeeds and returns the name of the backend VM that served the request:
curl 10.1.2.99
curl 10.1.3.99
For HTTPS testing, replace
curl
with:curl -k -s 'https://test.example.com:443' --connect-to test.example.com:443:10.1.2.99:443
curl -k -s 'https://test.example.com:443' --connect-to test.example.com:443:10.1.3.99:443
The
-k
flag causes curl to skip certificate validation.Optional: Use the configured DNS record to resolve the IP address.
curl service.example.com
Run 100 requests and confirm that they are load balanced
For HTTP:
{ RESULTS= for i in {1..100} do RESULTS="$RESULTS:$(curl --silent 10.1.2.99)" done echo "" echo " Results of load-balancing to 10.1.2.99: " echo "***" echo "$RESULTS" | tr ':' '\n' | grep -Ev "^$" | sort | uniq -c echo }
{ RESULTS= for i in {1..100} do RESULTS="$RESULTS:$(curl --silent 10.1.3.99)" done echo "" echo " Results of load-balancing to 10.1.3.99: " echo "***" echo "$RESULTS" | tr ':' '\n' | grep -Ev "^$" | sort | uniq -c echo }
For HTTPS:
{ RESULTS= for i in {1..100} do RESULTS="$RESULTS:$(curl -k -s 'https://test.example.com:443' --connect-to test.example.com:443:10.1.2.99:443)" done echo "" echo " Results of load-balancing to 10.1.2.99: " echo "***" echo "$RESULTS" | tr ':' '\n' | grep -Ev "^$" | sort | uniq -c echo }
{ RESULTS= for i in {1..100} do RESULTS="$RESULTS:$(curl -k -s 'https://test.example.com:443' --connect-to test.example.com:443:10.1.3.99:443)" done echo "" echo " Results of load-balancing to 10.1.3.99: " echo "***" echo "$RESULTS" | tr ':' '\n' | grep -Ev "^$" | sort | uniq -c echo }
Test failover
Verify failover to backends in the
REGION_A
region when backends in theREGION_B
regions are unhealthy or unreachable. We simulate this by removing all of the backends fromREGION_B
:gcloud compute backend-services remove-backend gil7-backend-service \ --network-endpoint-group=gl7ilb-serverless-neg-b \ --network-endpoint-group-zone=ZONE_B
Connect, using SSH, to a client VM in
REGION_B
.gcloud compute ssh l7-ilb-client-b \ --zone=ZONE_B
Send requests to the load balanced IP address in the
REGION_B
region. The command output displays responses from backend VMs inREGION_A
:{ RESULTS= for i in {1..100} do RESULTS="$RESULTS:$(curl -k -s 'https://test.example.com:443' --connect-to test.example.com:443:10.1.3.99:443)" done echo "***" echo "*** Results of load-balancing to 10.1.3.99: " echo "***" echo "$RESULTS" | tr ':' '\n' | grep -Ev "^$" | sort | uniq -c echo }
Additional configuration options
This section expands on the configuration example to provide alternative and additional configuration options. All of the tasks are optional. You can perform them in any order.
Using a URL mask
When creating a serverless NEG, instead of selecting a specific Cloud Run service, you can use a URL mask to point to multiple services serving at the same domain. A URL mask is a template of your URL schema. The serverless NEG uses this template to extract the service name from the incoming request's URL and map the request to the appropriate service.
URL masks are particularly useful if your service is mapped to a custom domain rather than to the default address that Google Cloud provides for the deployed service. A URL mask lets you target multiple services and versions with a single rule even when your application is using a custom URL pattern.
If you haven't already done so, make sure you read the Serverless NEGS overview: URL Masks.
Construct a URL mask
To construct a URL mask for your load balancer, start with the URL of your
service. This example uses a sample serverless app running at
https://example.com/login
. This is the URL where the app's login
service
is served.
- Remove the
http
orhttps
from the URL. You are left withexample.com/login
. - Replace the service name with a placeholder for the URL mask.
- Cloud Run: Replace the
Cloud Run service name with the
placeholder
<service>
. If the Cloud Run service has a tag associated with it, replace the tag name with the placeholder<tag>
. In this example, the URL mask you are left with isexample.com/<service>
.
- Cloud Run: Replace the
Cloud Run service name with the
placeholder
Optional: If the service name can be extracted from the path portion of the URL, the domain can be omitted. The path part of the URL mask is distinguished by the first slash (
/
) character. If a slash (/
) is not present in the URL mask, the mask is understood to represent the host only. Therefore, for this example, the URL mask can be reduced to/<service>
.Similarly, if
<service>
can be extracted from the host part of the URL, you can omit the path altogether from the URL mask.You can also omit any host or subdomain components that come before the first placeholder as well as any path components that come after the last placeholder. In such cases, the placeholder captures the required information for the component.
Here are a few more examples that demonstrate these rules:
This table assumes that you have a custom domain called example.com
and
all your Cloud Run services are being mapped
to this domain.
Service, Tag name | Cloud Run custom domain URL | URL mask |
---|---|---|
service: login | https://login-home.example.com/web | <service>-home.example.com |
service: login | https://example.com/login/web | example.com/<service> or /<service> |
service: login, tag: test | https://test.login.example.com/web | <tag>.<service>.example.com |
service: login, tag: test | https://example.com/home/login/test | example.com/home/<service>/<tag> or /home/<service>/<tag> |
service: login, tag: test | https://test.example.com/home/login/web | <tag>.example.com/home/<service> |
Creating a serverless NEG with a URL mask
Console
For a new load balancer, you can use the same end-to-end process as described previously in this document. When configuring the backend service, instead of selecting a specific service, enter a URL mask.
If you have an existing load balancer, you can edit the backend configuration and have the serverless NEG point to a URL mask instead of a specific service.
To add a URL mask-based serverless NEG to an existing backend service, do the following:
- In the Google Cloud console, go to the Load balancing page.
Go to Load balancing - Click the name of the load balancer that has the backend service you want to edit.
- On the Load balancer details page, click Edit.
- On the Edit global external Application Load Balancer page, click Backend configuration.
- On the Backend configuration page, click Edit for the backend service you want to modify.
- Click Add backend.
- Select Create Serverless network endpoint group.
- For the Name, enter
helloworld-serverless-neg
. - Under Region, the region of the load balancer is displayed.
- Under Serverless network endpoint group type, Cloud Run is the only supported network endpoint group type.
- Select Use URL Mask.
- Enter a URL mask. For information about how to create a URL mask, see Constructing a URL mask.
- Click Create.
- In the New backend, click Done.
- Click Update.
gcloud
To create a serverless NEG with a sample URL mask of
example.com/<service>
:
gcloud compute network-endpoint-groups create SERVERLESS_NEG_MASK_NAME \ --region=REGION \ --network-endpoint-type=serverless \ --cloud-run-url-mask="example.com/<service>"
Use the same IP address between multiple internal forwarding rules
For multiple internal forwarding rules to share the same internal IP address,
you must reserve the IP address and set its --purpose
flag to
SHARED_LOADBALANCER_VIP
.
gcloud
gcloud compute addresses create SHARED_IP_ADDRESS_NAME \ --region=REGION \ --subnet=SUBNET_NAME \ --purpose=SHARED_LOADBALANCER_VIP
Configure DNS routing policies
If your clients are in multiple regions, you might want to make your cross-region internal Application Load Balancer accessible by using VIPs in these regions. This multi-region setup minimizes latency and network transit costs. In addition, it lets you set up a DNS-based, global, load balancing solution that provides resilience against regional outages. For more information, see Manage DNS routing policies and health checks.
gcloud
To create a DNS entry with a 30 second TTL, use the
gcloud dns record-sets create
command.
gcloud dns record-sets create DNS_ENTRY --ttl="30" \ --type="A" --zone="service-zone" \ --routing-policy-type="GEO" \ --routing-policy-data="REGION_A=gil7-forwarding-rule-a@global;REGION_B=gil7-forwarding-rule-b@global" \ --enable-health-checking
Replace the following:
DNS_ENTRY
: DNS or domain name of the record-setFor example,
service.example.com
REGION_A
andREGION_B
: the regions where you have configured the load balancer
API
Create the DNS record by making a POST
request to the
ResourceRecordSets.create
method.
Replace PROJECT_ID with your project ID.
POST https://www.googleapis.com/dns/v1/projects/PROJECT_ID/managedZones/SERVICE_ZONE/rrsets { "name": "DNS_ENTRY", "type": "A", "ttl": 30, "routingPolicy": { "geo": { "items": [ { "location": "REGION_A", "healthCheckedTargets": { "internalLoadBalancers": [ { "loadBalancerType": "globalL7ilb", "ipAddress": "IP_ADDRESS", "port": "80", "ipProtocol": "tcp", "networkUrl": "https://www.googleapis.com/compute/v1/projects/PROJECT_ID/global/networks/lb-network", "project": "PROJECT_ID" } ] } }, { "location": "REGION_B", "healthCheckedTargets": { "internalLoadBalancers": [ { "loadBalancerType": "globalL7ilb", "ipAddress": "IP_ADDRESS_B", "port": "80", "ipProtocol": "tcp", "networkUrl": "https://www.googleapis.com/compute/v1/projects/PROJECT_ID/global/networks/lb-network", "project": "PROJECT_ID" } ] } } ] } } }
Enable outlier detection
You can enable outlier detection on global backend services to identify unhealthy serverless NEGs and reduce the number the requests sent to the unhealthy serverless NEGs.
Outlier detection is enabled on the backend service by using one of the following methods:
- The
consecutiveErrors
method (outlierDetection.consecutiveErrors
), in which a5xx
series HTTP status code qualifies as an error. - The
consecutiveGatewayFailure
method (outlierDetection.consecutiveGatewayFailure
), in which only the502
,503
, and504
HTTP status codes qualify as an error.
Use the following steps to enable outlier detection for an existing backend
service. Note that even after enabling outlier detection, some requests can be
sent to the unhealthy service and return a 5xx
status code to
the clients. To further reduce the error rate, you can configure more aggressive
values for the outlier detection parameters. For more information, see the
outlierDetection
field.
Console
In the Google Cloud console, go to the Load balancing page.
Click the name of the load balancer whose backend service you want to edit.
On the Load balancer details page, click
Edit.On the Edit cross-region internal Application Load Balancer page, click Backend configuration.
On the Backend configuration page, click
Edit for the backend service that you want to modify.Scroll down and expand the Advanced configurations section.
In the Outlier detection section, select the Enable checkbox.
Click
Edit to configure outlier detection.Verify that the following options are configured with these values:
Property Value Consecutive errors 5 Interval 1000 Base ejection time 30000 Max ejection percent 50 Enforcing consecutive errors 100 In this example, the outlier detection analysis runs every one second. If the number of consecutive HTTP
5xx
status codes received by an Envoy proxy is five or more, the backend endpoint is ejected from the load-balancing pool of that Envoy proxy for 30 seconds. When the enforcing percentage is set to 100%, the backend service enforces the ejection of unhealthy endpoints from the load-balancing pools of those specific Envoy proxies every time the outlier detection analysis runs. If the ejection conditions are met, up to 50% of the backend endpoints from the load-balancing pool can be ejected.Click Save.
To update the backend service, click Update.
To update the load balancer, on the Edit cross-region internal Application Load Balancer page, click Update.
gcloud
Export the backend service into a YAML file.
gcloud compute backend-services export BACKEND_SERVICE_NAME \ --destination=BACKEND_SERVICE_NAME.yaml --global
Replace
BACKEND_SERVICE_NAME
with the name of the backend service.Edit the YAML configuration of the backend service to add the fields for outlier detection as highlighted in the following YAML configuration, in the
outlierDetection
section:In this example, the outlier detection analysis runs every one second. If the number of consecutive HTTP
5xx
status codes received by an Envoy proxy is five or more, the backend endpoint is ejected from the load-balancing pool of that Envoy proxy for 30 seconds. When the enforcing percentage is set to 100%, the backend service enforces the ejection of unhealthy endpoints from the load-balancing pools of those specific Envoy proxies every time the outlier detection analysis runs. If the ejection conditions are met, up to 50% of the backend endpoints from the load-balancing pool can be ejected.name: BACKEND_SERVICE_NAME backends: - balancingMode: UTILIZATION capacityScaler: 1.0 group: https://www.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION_A/networkEndpointGroups/SERVERLESS_NEG_NAME - balancingMode: UTILIZATION capacityScaler: 1.0 group: https://www.googleapis.com/compute/v1/projects/PROJECT_ID/regions/REGION_B/networkEndpointGroups/SERVERLESS_NEG_NAME_2 outlierDetection: baseEjectionTime: nanos: 0 seconds: 30 consecutiveErrors: 5 enforcingConsecutiveErrors: 100 interval: nanos: 0 seconds: 1 maxEjectionPercent: 50 port: 80 selfLink: https://www.googleapis.com/compute/v1/projects/PROJECT_ID/global/backendServices/BACKEND_SERVICE_NAME sessionAffinity: NONE timeoutSec: 30 ...
Replace the following:
BACKEND_SERVICE_NAME
: the name of the backend servicePROJECT_ID
: the ID of your projectREGION_A
andREGION_B
: the regions where the load balancer has been configured.SERVERLESS_NEG_NAME
: the name of the first serverless NEGSERVERLESS_NEG_NAME_2
: the name of the second serverless NEG
Update the backend service by importing the latest configuration.
gcloud compute backend-services import BACKEND_SERVICE_NAME \ --source=BACKEND_SERVICE_NAME.yaml --global
Outlier detection is now enabled on the backend service.
Deleting a serverless NEG
A network endpoint group cannot be deleted if it is attached to a backend service. Before you delete a NEG, ensure that it is detached from the backend service.
Console
- To make sure the serverless NEG you want to delete is not
in use by any backend service, go to the Backend services tab on the
Load balancing components page.
Go to Backend services - If the serverless NEG is in use, do the following:
- Click the name of the backend service that is using the serverless NEG.
- Click Edit.
- From the list of Backends, click to remove the serverless NEG backend from the backend service.
- Click Save.
- Go to the Network endpoint group page in the Google Cloud console.
Go to Network endpoint group - Select the checkbox for the serverless NEG you want to delete.
- Click Delete.
- Click Delete again to confirm.
gcloud
To remove a serverless NEG from a backend service, you must specify the region where the NEG was created.
gcloud compute backend-services remove-backend BACKEND_SERVICE_NAME \ --network-endpoint-group=SERVERLESS_NEG_NAME \ --network-endpoint-group-region=REGION \ --region=REGION
To delete the serverless NEG:
gcloud compute network-endpoint-groups delete SERVERLESS_NEG_NAME \ --region=REGION
What's next
- Deploy an internal Application Load Balancer with Cloud Run by using Terraform
- Clean up a load balancing setup
- Deprovision Shared VPC
- Internal Application Load Balancer logging and monitoring
- Troubleshoot issues with internal Application Load Balancers