ClusterConfig(mapping=None, *, ignore_unknown_fields=False, **kwargs)
The cluster config.
Attributes | |
---|---|
Name | Description |
config_bucket |
str
Optional. A Cloud Storage bucket used to stage job dependencies, config files, and job driver console output. If you do not specify a staging bucket, Cloud Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's staging bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket (see `Dataproc staging bucket |
gce_cluster_config |
Optional. The shared Compute Engine config settings for all instances in a cluster. |
master_config |
Optional. The Compute Engine config settings for the master instance in a cluster. |
worker_config |
Optional. The Compute Engine config settings for worker instances in a cluster. |
secondary_worker_config |
Optional. The Compute Engine config settings for additional worker instances in a cluster. |
software_config |
Optional. The config settings for software inside the cluster. |
lifecycle_config |
Optional. The config setting for auto delete cluster schedule. |
initialization_actions |
Sequence[
Optional. Commands to execute on each node after config is completed. By default, executables are run on master and all worker nodes. You can test a node's role metadata to run an executable on a master or worker node, as shown below using curl (you can also use wget ):
::
ROLE=$(curl -H Metadata-Flavor:Google
http://metadata/computeMetadata/v1beta2/instance/attributes/dataproc-role)
if [[ "${ROLE}" == 'Master' ]]; then
... master specific actions ...
else
... worker specific actions ...
fi
|
encryption_config |
Optional. Encryption settings for the cluster. |
autoscaling_config |
Optional. Autoscaling config for the policy associated with the cluster. Cluster does not autoscale if this field is unset. |
endpoint_config |
Optional. Port/endpoint configuration for this cluster |
security_config |
Optional. Security related configuration. |
gke_cluster_config |
Optional. The Kubernetes Engine config for Dataproc clusters deployed to Kubernetes. Setting this is considered mutually exclusive with Compute Engine-based options such as gce_cluster_config , master_config ,
worker_config , secondary_worker_config , and
autoscaling_config .
|