Updates an existing deployed model. Updatable fields include
min_replica_count
, max_replica_count
, autoscaling_metric_specs
,
disable_container_logging
(v1 only), and enable_container_logging
(v1beta1 only).
This method waits—the workflow execution is paused—until the operation is
complete, fails, or times out. The default timeout value is 1800
seconds (30
minutes) and can be changed to a maximum value of 31536000
seconds (one year)
for long-running operations using the connector_params
field. See the
Connectors reference.
The connector uses polling to monitor the long-running operation, which might generate additional billable steps. For more information about retries and long-running operations, refer to Understand connectors.
The polling policy for the long-running operation can be configured. To set the
connector-specific parameters (connector_params
), refer to
Invoke a connector call.
Arguments
Parameters | |
---|---|
endpoint |
Required. The name of the Endpoint resource into which to mutate a DeployedModel. Format: |
region |
Required. Region of the HTTP endpoint. For example, if region is set to |
body |
Required. |
Raised exceptions
Exceptions | |
---|---|
ConnectionError |
In case of a network problem (such as DNS failure or refused connection). |
HttpError |
If the response status is >= 400 (excluding 429 and 503). |
TimeoutError |
If a long-running operation takes longer to finish than the specified timeout limit. |
TypeError |
If an operation or function receives an argument of the wrong type. |
ValueError |
If an operation or function receives an argument of the right type but an inappropriate value. For example, a negative timeout. |
OperationError |
If the long-running operation finished unsuccessfully. |
ResponseTypeError |
If the long-running operation returned a response of the wrong type. |
Response
If successful, the response contains an instance of GoogleLongrunningOperation
.
Subworkflow snippet
Some fields might be optional or required. To identify required fields, refer to the API documentation.
YAML
- mutateDeployedModel: call: googleapis.aiplatform.v1beta1.projects.locations.endpoints.mutateDeployedModel args: endpoint: ... region: ... body: deployedModel: automaticResources: maxReplicaCount: ... minReplicaCount: ... dedicatedResources: autoscalingMetricSpecs: ... machineSpec: acceleratorCount: ... acceleratorType: ... machineType: ... tpuTopology: ... maxReplicaCount: ... minReplicaCount: ... disableExplanations: ... displayName: ... enableAccessLogging: ... enableContainerLogging: ... explanationSpec: metadata: featureAttributionsSchemaUri: ... inputs: ... latentSpaceSource: ... outputs: ... parameters: examples: exampleGcsSource: dataFormat: ... gcsSource: uris: ... gcsSource: ... nearestNeighborSearchConfig: ... neighborCount: ... presets: modality: ... query: ... integratedGradientsAttribution: blurBaselineConfig: maxBlurSigma: ... smoothGradConfig: featureNoiseSigma: noiseSigma: ... noiseSigma: ... noisySampleCount: ... stepCount: ... outputIndices: ... sampledShapleyAttribution: pathCount: ... topK: ... xraiAttribution: blurBaselineConfig: ... smoothGradConfig: ... stepCount: ... id: ... model: ... serviceAccount: ... sharedResources: ... updateMask: ... result: mutateDeployedModelResult
JSON
[ { "mutateDeployedModel": { "call": "googleapis.aiplatform.v1beta1.projects.locations.endpoints.mutateDeployedModel", "args": { "endpoint": "...", "region": "...", "body": { "deployedModel": { "automaticResources": { "maxReplicaCount": "...", "minReplicaCount": "..." }, "dedicatedResources": { "autoscalingMetricSpecs": "...", "machineSpec": { "acceleratorCount": "...", "acceleratorType": "...", "machineType": "...", "tpuTopology": "..." }, "maxReplicaCount": "...", "minReplicaCount": "..." }, "disableExplanations": "...", "displayName": "...", "enableAccessLogging": "...", "enableContainerLogging": "...", "explanationSpec": { "metadata": { "featureAttributionsSchemaUri": "...", "inputs": "...", "latentSpaceSource": "...", "outputs": "..." }, "parameters": { "examples": { "exampleGcsSource": { "dataFormat": "...", "gcsSource": { "uris": "..." } }, "gcsSource": "...", "nearestNeighborSearchConfig": "...", "neighborCount": "...", "presets": { "modality": "...", "query": "..." } }, "integratedGradientsAttribution": { "blurBaselineConfig": { "maxBlurSigma": "..." }, "smoothGradConfig": { "featureNoiseSigma": { "noiseSigma": "..." }, "noiseSigma": "...", "noisySampleCount": "..." }, "stepCount": "..." }, "outputIndices": "...", "sampledShapleyAttribution": { "pathCount": "..." }, "topK": "...", "xraiAttribution": { "blurBaselineConfig": "...", "smoothGradConfig": "...", "stepCount": "..." } } }, "id": "...", "model": "...", "serviceAccount": "...", "sharedResources": "..." }, "updateMask": "..." } }, "result": "mutateDeployedModelResult" } } ]