- 0.58.0 (latest)
- 0.57.0
- 0.56.0
- 0.55.0
- 0.54.0
- 0.53.0
- 0.52.0
- 0.51.0
- 0.50.0
- 0.49.0
- 0.48.0
- 0.47.0
- 0.46.0
- 0.45.0
- 0.44.0
- 0.43.0
- 0.42.0
- 0.41.0
- 0.40.0
- 0.39.0
- 0.38.0
- 0.37.0
- 0.36.0
- 0.35.0
- 0.34.0
- 0.33.0
- 0.32.0
- 0.31.0
- 0.30.0
- 0.29.0
- 0.28.0
- 0.27.0
- 0.26.0
- 0.25.0
- 0.24.0
- 0.23.0
- 0.22.0
- 0.21.0
- 0.20.0
- 0.19.0
- 0.18.0
- 0.17.0
- 0.16.0
- 0.15.0
- 0.14.0
- 0.13.0
- 0.12.0
- 0.11.0
- 0.10.0
- 0.9.1
- 0.8.0
- 0.7.0
- 0.6.0
- 0.5.0
- 0.4.0
- 0.3.0
- 0.2.0
- 0.1.0
Reference documentation and code samples for the Vertex AI V1 API class Google::Cloud::AIPlatform::V1::ModelContainerSpec.
Specification of a container for serving predictions. Some fields in this message correspond to fields in the Kubernetes Container v1 core specification.
Inherits
- Object
Extended By
- Google::Protobuf::MessageExts::ClassMethods
Includes
- Google::Protobuf::MessageExts
Methods
#args
def args() -> ::Array<::String>
-
(::Array<::String>) — Immutable. Specifies arguments for the command that runs when the container
starts. This overrides the container's
CMD
. Specify this field as an array of executable and arguments, similar to a DockerCMD
's "default parameters" form.If you don't specify this field but do specify the command field, then the command from the
command
field runs without any additional arguments. See the Kubernetes documentation about how thecommand
andargs
fields interact with a container'sENTRYPOINT
andCMD
.If you don't specify this field and don't specify the
command
field, then the container'sENTRYPOINT
andCMD
determine what runs based on their default behavior. See the Docker documentation about howCMD
andENTRYPOINT
interact.In this field, you can reference environment variables set by Vertex AI and environment variables set in the env field. You cannot reference environment variables set in the Docker image. In order for environment variables to be expanded, reference them by using the following syntax:
$(VARIABLE_NAME)
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with$$
; for example:$$(VARIABLE_NAME)
This field corresponds to theargs
field of the Kubernetes Containers v1 core API.
#args=
def args=(value) -> ::Array<::String>
-
value (::Array<::String>) — Immutable. Specifies arguments for the command that runs when the container
starts. This overrides the container's
CMD
. Specify this field as an array of executable and arguments, similar to a DockerCMD
's "default parameters" form.If you don't specify this field but do specify the command field, then the command from the
command
field runs without any additional arguments. See the Kubernetes documentation about how thecommand
andargs
fields interact with a container'sENTRYPOINT
andCMD
.If you don't specify this field and don't specify the
command
field, then the container'sENTRYPOINT
andCMD
determine what runs based on their default behavior. See the Docker documentation about howCMD
andENTRYPOINT
interact.In this field, you can reference environment variables set by Vertex AI and environment variables set in the env field. You cannot reference environment variables set in the Docker image. In order for environment variables to be expanded, reference them by using the following syntax:
$(VARIABLE_NAME)
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with$$
; for example:$$(VARIABLE_NAME)
This field corresponds to theargs
field of the Kubernetes Containers v1 core API.
-
(::Array<::String>) — Immutable. Specifies arguments for the command that runs when the container
starts. This overrides the container's
CMD
. Specify this field as an array of executable and arguments, similar to a DockerCMD
's "default parameters" form.If you don't specify this field but do specify the command field, then the command from the
command
field runs without any additional arguments. See the Kubernetes documentation about how thecommand
andargs
fields interact with a container'sENTRYPOINT
andCMD
.If you don't specify this field and don't specify the
command
field, then the container'sENTRYPOINT
andCMD
determine what runs based on their default behavior. See the Docker documentation about howCMD
andENTRYPOINT
interact.In this field, you can reference environment variables set by Vertex AI and environment variables set in the env field. You cannot reference environment variables set in the Docker image. In order for environment variables to be expanded, reference them by using the following syntax:
$(VARIABLE_NAME)
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with$$
; for example:$$(VARIABLE_NAME)
This field corresponds to theargs
field of the Kubernetes Containers v1 core API.
#command
def command() -> ::Array<::String>
-
(::Array<::String>) — Immutable. Specifies the command that runs when the container starts. This
overrides the container's
ENTRYPOINT.
Specify this field as an array of executable and arguments, similar to a
Docker
ENTRYPOINT
's "exec" form, not its "shell" form.If you do not specify this field, then the container's
ENTRYPOINT
runs, in conjunction with the args field or the container'sCMD
, if either exists. If this field is not specified and the container does not have anENTRYPOINT
, then refer to the Docker documentation about howCMD
andENTRYPOINT
interact.If you specify this field, then you can also specify the
args
field to provide additional arguments for this command. However, if you specify this field, then the container'sCMD
is ignored. See the Kubernetes documentation about how thecommand
andargs
fields interact with a container'sENTRYPOINT
andCMD
.In this field, you can reference environment variables set by Vertex AI and environment variables set in the env field. You cannot reference environment variables set in the Docker image. In order for environment variables to be expanded, reference them by using the following syntax:
$(VARIABLE_NAME)
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with$$
; for example:$$(VARIABLE_NAME)
This field corresponds to thecommand
field of the Kubernetes Containers v1 core API.
#command=
def command=(value) -> ::Array<::String>
-
value (::Array<::String>) — Immutable. Specifies the command that runs when the container starts. This
overrides the container's
ENTRYPOINT.
Specify this field as an array of executable and arguments, similar to a
Docker
ENTRYPOINT
's "exec" form, not its "shell" form.If you do not specify this field, then the container's
ENTRYPOINT
runs, in conjunction with the args field or the container'sCMD
, if either exists. If this field is not specified and the container does not have anENTRYPOINT
, then refer to the Docker documentation about howCMD
andENTRYPOINT
interact.If you specify this field, then you can also specify the
args
field to provide additional arguments for this command. However, if you specify this field, then the container'sCMD
is ignored. See the Kubernetes documentation about how thecommand
andargs
fields interact with a container'sENTRYPOINT
andCMD
.In this field, you can reference environment variables set by Vertex AI and environment variables set in the env field. You cannot reference environment variables set in the Docker image. In order for environment variables to be expanded, reference them by using the following syntax:
$(VARIABLE_NAME)
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with$$
; for example:$$(VARIABLE_NAME)
This field corresponds to thecommand
field of the Kubernetes Containers v1 core API.
-
(::Array<::String>) — Immutable. Specifies the command that runs when the container starts. This
overrides the container's
ENTRYPOINT.
Specify this field as an array of executable and arguments, similar to a
Docker
ENTRYPOINT
's "exec" form, not its "shell" form.If you do not specify this field, then the container's
ENTRYPOINT
runs, in conjunction with the args field or the container'sCMD
, if either exists. If this field is not specified and the container does not have anENTRYPOINT
, then refer to the Docker documentation about howCMD
andENTRYPOINT
interact.If you specify this field, then you can also specify the
args
field to provide additional arguments for this command. However, if you specify this field, then the container'sCMD
is ignored. See the Kubernetes documentation about how thecommand
andargs
fields interact with a container'sENTRYPOINT
andCMD
.In this field, you can reference environment variables set by Vertex AI and environment variables set in the env field. You cannot reference environment variables set in the Docker image. In order for environment variables to be expanded, reference them by using the following syntax:
$(VARIABLE_NAME)
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with$$
; for example:$$(VARIABLE_NAME)
This field corresponds to thecommand
field of the Kubernetes Containers v1 core API.
#deployment_timeout
def deployment_timeout() -> ::Google::Protobuf::Duration
- (::Google::Protobuf::Duration) — Immutable. Deployment timeout. Limit for deployment timeout is 2 hours.
#deployment_timeout=
def deployment_timeout=(value) -> ::Google::Protobuf::Duration
- value (::Google::Protobuf::Duration) — Immutable. Deployment timeout. Limit for deployment timeout is 2 hours.
- (::Google::Protobuf::Duration) — Immutable. Deployment timeout. Limit for deployment timeout is 2 hours.
#env
def env() -> ::Array<::Google::Cloud::AIPlatform::V1::EnvVar>
-
(::Array<::Google::Cloud::AIPlatform::V1::EnvVar>) — Immutable. List of environment variables to set in the container. After the
container starts running, code running in the container can read these
environment variables.
Additionally, the command and args fields can reference these variables. Later entries in this list can also reference earlier entries. For example, the following example sets the variable
VAR_2
to have the valuefoo bar
:json [ { "name": "VAR_1", "value": "foo" }, { "name": "VAR_2", "value": "$(VAR_1) bar" } ]
If you switch the order of the variables in the example, then the expansion does not occur.
This field corresponds to the
env
field of the Kubernetes Containers v1 core API.
#env=
def env=(value) -> ::Array<::Google::Cloud::AIPlatform::V1::EnvVar>
-
value (::Array<::Google::Cloud::AIPlatform::V1::EnvVar>) — Immutable. List of environment variables to set in the container. After the
container starts running, code running in the container can read these
environment variables.
Additionally, the command and args fields can reference these variables. Later entries in this list can also reference earlier entries. For example, the following example sets the variable
VAR_2
to have the valuefoo bar
:json [ { "name": "VAR_1", "value": "foo" }, { "name": "VAR_2", "value": "$(VAR_1) bar" } ]
If you switch the order of the variables in the example, then the expansion does not occur.
This field corresponds to the
env
field of the Kubernetes Containers v1 core API.
-
(::Array<::Google::Cloud::AIPlatform::V1::EnvVar>) — Immutable. List of environment variables to set in the container. After the
container starts running, code running in the container can read these
environment variables.
Additionally, the command and args fields can reference these variables. Later entries in this list can also reference earlier entries. For example, the following example sets the variable
VAR_2
to have the valuefoo bar
:json [ { "name": "VAR_1", "value": "foo" }, { "name": "VAR_2", "value": "$(VAR_1) bar" } ]
If you switch the order of the variables in the example, then the expansion does not occur.
This field corresponds to the
env
field of the Kubernetes Containers v1 core API.
#grpc_ports
def grpc_ports() -> ::Array<::Google::Cloud::AIPlatform::V1::Port>
-
(::Array<::Google::Cloud::AIPlatform::V1::Port>) — Immutable. List of ports to expose from the container. Vertex AI sends gRPC
prediction requests that it receives to the first port on this list. Vertex
AI also sends liveness and health checks to this port.
If you do not specify this field, gRPC requests to the container will be disabled.
Vertex AI does not use ports other than the first one listed. This field corresponds to the
ports
field of the Kubernetes Containers v1 core API.
#grpc_ports=
def grpc_ports=(value) -> ::Array<::Google::Cloud::AIPlatform::V1::Port>
-
value (::Array<::Google::Cloud::AIPlatform::V1::Port>) — Immutable. List of ports to expose from the container. Vertex AI sends gRPC
prediction requests that it receives to the first port on this list. Vertex
AI also sends liveness and health checks to this port.
If you do not specify this field, gRPC requests to the container will be disabled.
Vertex AI does not use ports other than the first one listed. This field corresponds to the
ports
field of the Kubernetes Containers v1 core API.
-
(::Array<::Google::Cloud::AIPlatform::V1::Port>) — Immutable. List of ports to expose from the container. Vertex AI sends gRPC
prediction requests that it receives to the first port on this list. Vertex
AI also sends liveness and health checks to this port.
If you do not specify this field, gRPC requests to the container will be disabled.
Vertex AI does not use ports other than the first one listed. This field corresponds to the
ports
field of the Kubernetes Containers v1 core API.
#health_probe
def health_probe() -> ::Google::Cloud::AIPlatform::V1::Probe
- (::Google::Cloud::AIPlatform::V1::Probe) — Immutable. Specification for Kubernetes readiness probe.
#health_probe=
def health_probe=(value) -> ::Google::Cloud::AIPlatform::V1::Probe
- value (::Google::Cloud::AIPlatform::V1::Probe) — Immutable. Specification for Kubernetes readiness probe.
- (::Google::Cloud::AIPlatform::V1::Probe) — Immutable. Specification for Kubernetes readiness probe.
#health_route
def health_route() -> ::String
-
(::String) —
Immutable. HTTP path on the container to send health checks to. Vertex AI intermittently sends GET requests to this path on the container's IP address and port to check that the container is healthy. Read more about health checks.
For example, if you set this field to
/bar
, then Vertex AI intermittently sends a GET request to the/bar
path on the port of your container specified by the first value of thisModelContainerSpec
's ports field.If you don't specify this field, it defaults to the following value when you [deploy this Model to an Endpoint][google.cloud.aiplatform.v1.EndpointService.DeployModel]:
/v1/endpoints/ENDPOINT/deployedModels/DEPLOYED_MODEL:predict
The placeholders in this value are replaced as follows:ENDPOINT: The last segment (following
endpoints/
)of the Endpoint.name][] field of the Endpoint where this Model has been deployed. (Vertex AI makes this value available to your container code as theAIP_ENDPOINT_ID
environment variable.)DEPLOYED_MODEL: DeployedModel.id of the
DeployedModel
. (Vertex AI makes this value available to your container code as theAIP_DEPLOYED_MODEL_ID
environment variable.)
#health_route=
def health_route=(value) -> ::String
-
value (::String) —
Immutable. HTTP path on the container to send health checks to. Vertex AI intermittently sends GET requests to this path on the container's IP address and port to check that the container is healthy. Read more about health checks.
For example, if you set this field to
/bar
, then Vertex AI intermittently sends a GET request to the/bar
path on the port of your container specified by the first value of thisModelContainerSpec
's ports field.If you don't specify this field, it defaults to the following value when you [deploy this Model to an Endpoint][google.cloud.aiplatform.v1.EndpointService.DeployModel]:
/v1/endpoints/ENDPOINT/deployedModels/DEPLOYED_MODEL:predict
The placeholders in this value are replaced as follows:ENDPOINT: The last segment (following
endpoints/
)of the Endpoint.name][] field of the Endpoint where this Model has been deployed. (Vertex AI makes this value available to your container code as theAIP_ENDPOINT_ID
environment variable.)DEPLOYED_MODEL: DeployedModel.id of the
DeployedModel
. (Vertex AI makes this value available to your container code as theAIP_DEPLOYED_MODEL_ID
environment variable.)
-
(::String) —
Immutable. HTTP path on the container to send health checks to. Vertex AI intermittently sends GET requests to this path on the container's IP address and port to check that the container is healthy. Read more about health checks.
For example, if you set this field to
/bar
, then Vertex AI intermittently sends a GET request to the/bar
path on the port of your container specified by the first value of thisModelContainerSpec
's ports field.If you don't specify this field, it defaults to the following value when you [deploy this Model to an Endpoint][google.cloud.aiplatform.v1.EndpointService.DeployModel]:
/v1/endpoints/ENDPOINT/deployedModels/DEPLOYED_MODEL:predict
The placeholders in this value are replaced as follows:ENDPOINT: The last segment (following
endpoints/
)of the Endpoint.name][] field of the Endpoint where this Model has been deployed. (Vertex AI makes this value available to your container code as theAIP_ENDPOINT_ID
environment variable.)DEPLOYED_MODEL: DeployedModel.id of the
DeployedModel
. (Vertex AI makes this value available to your container code as theAIP_DEPLOYED_MODEL_ID
environment variable.)
#image_uri
def image_uri() -> ::String
-
(::String) — Required. Immutable. URI of the Docker image to be used as the custom
container for serving predictions. This URI must identify an image in
Artifact Registry or Container Registry. Learn more about the container
publishing
requirements,
including permissions requirements for the Vertex AI Service Agent.
The container image is ingested upon ModelService.UploadModel, stored internally, and this original path is afterwards not used.
To learn about the requirements for the Docker image itself, see Custom container requirements.
You can use the URI to one of Vertex AI's pre-built container images for prediction in this field.
#image_uri=
def image_uri=(value) -> ::String
-
value (::String) — Required. Immutable. URI of the Docker image to be used as the custom
container for serving predictions. This URI must identify an image in
Artifact Registry or Container Registry. Learn more about the container
publishing
requirements,
including permissions requirements for the Vertex AI Service Agent.
The container image is ingested upon ModelService.UploadModel, stored internally, and this original path is afterwards not used.
To learn about the requirements for the Docker image itself, see Custom container requirements.
You can use the URI to one of Vertex AI's pre-built container images for prediction in this field.
-
(::String) — Required. Immutable. URI of the Docker image to be used as the custom
container for serving predictions. This URI must identify an image in
Artifact Registry or Container Registry. Learn more about the container
publishing
requirements,
including permissions requirements for the Vertex AI Service Agent.
The container image is ingested upon ModelService.UploadModel, stored internally, and this original path is afterwards not used.
To learn about the requirements for the Docker image itself, see Custom container requirements.
You can use the URI to one of Vertex AI's pre-built container images for prediction in this field.
#ports
def ports() -> ::Array<::Google::Cloud::AIPlatform::V1::Port>
-
(::Array<::Google::Cloud::AIPlatform::V1::Port>) — Immutable. List of ports to expose from the container. Vertex AI sends any
prediction requests that it receives to the first port on this list. Vertex
AI also sends
liveness and health
checks
to this port.
If you do not specify this field, it defaults to following value:
json [ { "containerPort": 8080 } ]
Vertex AI does not use ports other than the first one listed. This field corresponds to the
ports
field of the Kubernetes Containers v1 core API.
#ports=
def ports=(value) -> ::Array<::Google::Cloud::AIPlatform::V1::Port>
-
value (::Array<::Google::Cloud::AIPlatform::V1::Port>) — Immutable. List of ports to expose from the container. Vertex AI sends any
prediction requests that it receives to the first port on this list. Vertex
AI also sends
liveness and health
checks
to this port.
If you do not specify this field, it defaults to following value:
json [ { "containerPort": 8080 } ]
Vertex AI does not use ports other than the first one listed. This field corresponds to the
ports
field of the Kubernetes Containers v1 core API.
-
(::Array<::Google::Cloud::AIPlatform::V1::Port>) — Immutable. List of ports to expose from the container. Vertex AI sends any
prediction requests that it receives to the first port on this list. Vertex
AI also sends
liveness and health
checks
to this port.
If you do not specify this field, it defaults to following value:
json [ { "containerPort": 8080 } ]
Vertex AI does not use ports other than the first one listed. This field corresponds to the
ports
field of the Kubernetes Containers v1 core API.
#predict_route
def predict_route() -> ::String
-
(::String) —
Immutable. HTTP path on the container to send prediction requests to. Vertex AI forwards requests sent using projects.locations.endpoints.predict to this path on the container's IP address and port. Vertex AI then returns the container's response in the API response.
For example, if you set this field to
/foo
, then when Vertex AI receives a prediction request, it forwards the request body in a POST request to the/foo
path on the port of your container specified by the first value of thisModelContainerSpec
's ports field.If you don't specify this field, it defaults to the following value when you [deploy this Model to an Endpoint][google.cloud.aiplatform.v1.EndpointService.DeployModel]:
/v1/endpoints/ENDPOINT/deployedModels/DEPLOYED_MODEL:predict
The placeholders in this value are replaced as follows:ENDPOINT: The last segment (following
endpoints/
)of the Endpoint.name][] field of the Endpoint where this Model has been deployed. (Vertex AI makes this value available to your container code as theAIP_ENDPOINT_ID
environment variable.)DEPLOYED_MODEL: DeployedModel.id of the
DeployedModel
. (Vertex AI makes this value available to your container code as theAIP_DEPLOYED_MODEL_ID
environment variable.)
#predict_route=
def predict_route=(value) -> ::String
-
value (::String) —
Immutable. HTTP path on the container to send prediction requests to. Vertex AI forwards requests sent using projects.locations.endpoints.predict to this path on the container's IP address and port. Vertex AI then returns the container's response in the API response.
For example, if you set this field to
/foo
, then when Vertex AI receives a prediction request, it forwards the request body in a POST request to the/foo
path on the port of your container specified by the first value of thisModelContainerSpec
's ports field.If you don't specify this field, it defaults to the following value when you [deploy this Model to an Endpoint][google.cloud.aiplatform.v1.EndpointService.DeployModel]:
/v1/endpoints/ENDPOINT/deployedModels/DEPLOYED_MODEL:predict
The placeholders in this value are replaced as follows:ENDPOINT: The last segment (following
endpoints/
)of the Endpoint.name][] field of the Endpoint where this Model has been deployed. (Vertex AI makes this value available to your container code as theAIP_ENDPOINT_ID
environment variable.)DEPLOYED_MODEL: DeployedModel.id of the
DeployedModel
. (Vertex AI makes this value available to your container code as theAIP_DEPLOYED_MODEL_ID
environment variable.)
-
(::String) —
Immutable. HTTP path on the container to send prediction requests to. Vertex AI forwards requests sent using projects.locations.endpoints.predict to this path on the container's IP address and port. Vertex AI then returns the container's response in the API response.
For example, if you set this field to
/foo
, then when Vertex AI receives a prediction request, it forwards the request body in a POST request to the/foo
path on the port of your container specified by the first value of thisModelContainerSpec
's ports field.If you don't specify this field, it defaults to the following value when you [deploy this Model to an Endpoint][google.cloud.aiplatform.v1.EndpointService.DeployModel]:
/v1/endpoints/ENDPOINT/deployedModels/DEPLOYED_MODEL:predict
The placeholders in this value are replaced as follows:ENDPOINT: The last segment (following
endpoints/
)of the Endpoint.name][] field of the Endpoint where this Model has been deployed. (Vertex AI makes this value available to your container code as theAIP_ENDPOINT_ID
environment variable.)DEPLOYED_MODEL: DeployedModel.id of the
DeployedModel
. (Vertex AI makes this value available to your container code as theAIP_DEPLOYED_MODEL_ID
environment variable.)
#shared_memory_size_mb
def shared_memory_size_mb() -> ::Integer
- (::Integer) — Immutable. The amount of the VM memory to reserve as the shared memory for the model in megabytes.
#shared_memory_size_mb=
def shared_memory_size_mb=(value) -> ::Integer
- value (::Integer) — Immutable. The amount of the VM memory to reserve as the shared memory for the model in megabytes.
- (::Integer) — Immutable. The amount of the VM memory to reserve as the shared memory for the model in megabytes.
#startup_probe
def startup_probe() -> ::Google::Cloud::AIPlatform::V1::Probe
- (::Google::Cloud::AIPlatform::V1::Probe) — Immutable. Specification for Kubernetes startup probe.
#startup_probe=
def startup_probe=(value) -> ::Google::Cloud::AIPlatform::V1::Probe
- value (::Google::Cloud::AIPlatform::V1::Probe) — Immutable. Specification for Kubernetes startup probe.
- (::Google::Cloud::AIPlatform::V1::Probe) — Immutable. Specification for Kubernetes startup probe.