Reference documentation and code samples for the Google Cloud Dataproc V1 Client class ClusterConfig.
The cluster config.
Generated from protobuf message google.cloud.dataproc.v1.ClusterConfig
Methods
__construct
Constructor.
Parameters | |
---|---|
Name | Description |
data |
array
Optional. Data for populating the Message object. |
↳ config_bucket |
string
Optional. A Cloud Storage bucket used to stage job dependencies, config files, and job driver console output. If you do not specify a staging bucket, Cloud Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's staging bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket (see Dataproc staging and temp buckets). This field requires a Cloud Storage bucket name, not a |
↳ temp_bucket |
string
Optional. A Cloud Storage bucket used to store ephemeral cluster and jobs data, such as Spark and MapReduce history files. If you do not specify a temp bucket, Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's temp bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket. The default bucket has a TTL of 90 days, but you can use any TTL (or none) if you specify a bucket (see Dataproc staging and temp buckets). This field requires a Cloud Storage bucket name, not a |
↳ gce_cluster_config |
Google\Cloud\Dataproc\V1\GceClusterConfig
Optional. The shared Compute Engine config settings for all instances in a cluster. |
↳ master_config |
Google\Cloud\Dataproc\V1\InstanceGroupConfig
Optional. The Compute Engine config settings for the cluster's master instance. |
↳ worker_config |
Google\Cloud\Dataproc\V1\InstanceGroupConfig
Optional. The Compute Engine config settings for the cluster's worker instances. |
↳ secondary_worker_config |
Google\Cloud\Dataproc\V1\InstanceGroupConfig
Optional. The Compute Engine config settings for a cluster's secondary worker instances |
↳ software_config |
Google\Cloud\Dataproc\V1\SoftwareConfig
Optional. The config settings for cluster software. |
↳ initialization_actions |
array<Google\Cloud\Dataproc\V1\NodeInitializationAction>
Optional. Commands to execute on each node after config is completed. By default, executables are run on master and all worker nodes. You can test a node's |
↳ encryption_config |
Google\Cloud\Dataproc\V1\EncryptionConfig
Optional. Encryption settings for the cluster. |
↳ autoscaling_config |
Google\Cloud\Dataproc\V1\AutoscalingConfig
Optional. Autoscaling config for the policy associated with the cluster. Cluster does not autoscale if this field is unset. |
↳ security_config |
Google\Cloud\Dataproc\V1\SecurityConfig
Optional. Security settings for the cluster. |
↳ lifecycle_config |
Google\Cloud\Dataproc\V1\LifecycleConfig
Optional. Lifecycle setting for the cluster. |
↳ endpoint_config |
Google\Cloud\Dataproc\V1\EndpointConfig
Optional. Port/endpoint configuration for this cluster |
↳ metastore_config |
Google\Cloud\Dataproc\V1\MetastoreConfig
Optional. Metastore configuration. |
↳ dataproc_metric_config |
Google\Cloud\Dataproc\V1\DataprocMetricConfig
Optional. The config for Dataproc metrics. |
↳ auxiliary_node_groups |
array<Google\Cloud\Dataproc\V1\AuxiliaryNodeGroup>
Optional. The node group settings. |
getConfigBucket
Optional. A Cloud Storage bucket used to stage job dependencies, config files, and job driver console output.
If you do not specify a staging bucket, Cloud
Dataproc will determine a Cloud Storage location (US,
ASIA, or EU) for your cluster's staging bucket according to the
Compute Engine zone where your cluster is deployed, and then create
and manage this project-level, per-location bucket (see
Dataproc staging and temp
buckets).
This field requires a Cloud Storage bucket name, not a gs://...
URI to
a Cloud Storage bucket.
Returns | |
---|---|
Type | Description |
string |
setConfigBucket
Optional. A Cloud Storage bucket used to stage job dependencies, config files, and job driver console output.
If you do not specify a staging bucket, Cloud
Dataproc will determine a Cloud Storage location (US,
ASIA, or EU) for your cluster's staging bucket according to the
Compute Engine zone where your cluster is deployed, and then create
and manage this project-level, per-location bucket (see
Dataproc staging and temp
buckets).
This field requires a Cloud Storage bucket name, not a gs://...
URI to
a Cloud Storage bucket.
Parameter | |
---|---|
Name | Description |
var |
string
|
Returns | |
---|---|
Type | Description |
$this |
getTempBucket
Optional. A Cloud Storage bucket used to store ephemeral cluster and jobs data, such as Spark and MapReduce history files. If you do not specify a temp bucket, Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's temp bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket. The default bucket has a TTL of 90 days, but you can use any TTL (or none) if you specify a bucket (see Dataproc staging and temp buckets).
This field requires a Cloud Storage bucket name, not a gs://...
URI to
a Cloud Storage bucket.
Returns | |
---|---|
Type | Description |
string |
setTempBucket
Optional. A Cloud Storage bucket used to store ephemeral cluster and jobs data, such as Spark and MapReduce history files. If you do not specify a temp bucket, Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's temp bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket. The default bucket has a TTL of 90 days, but you can use any TTL (or none) if you specify a bucket (see Dataproc staging and temp buckets).
This field requires a Cloud Storage bucket name, not a gs://...
URI to
a Cloud Storage bucket.
Parameter | |
---|---|
Name | Description |
var |
string
|
Returns | |
---|---|
Type | Description |
$this |
getGceClusterConfig
Optional. The shared Compute Engine config settings for all instances in a cluster.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\GceClusterConfig|null |
hasGceClusterConfig
clearGceClusterConfig
setGceClusterConfig
Optional. The shared Compute Engine config settings for all instances in a cluster.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\GceClusterConfig
|
Returns | |
---|---|
Type | Description |
$this |
getMasterConfig
Optional. The Compute Engine config settings for the cluster's master instance.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\InstanceGroupConfig|null |
hasMasterConfig
clearMasterConfig
setMasterConfig
Optional. The Compute Engine config settings for the cluster's master instance.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\InstanceGroupConfig
|
Returns | |
---|---|
Type | Description |
$this |
getWorkerConfig
Optional. The Compute Engine config settings for the cluster's worker instances.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\InstanceGroupConfig|null |
hasWorkerConfig
clearWorkerConfig
setWorkerConfig
Optional. The Compute Engine config settings for the cluster's worker instances.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\InstanceGroupConfig
|
Returns | |
---|---|
Type | Description |
$this |
getSecondaryWorkerConfig
Optional. The Compute Engine config settings for a cluster's secondary worker instances
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\InstanceGroupConfig|null |
hasSecondaryWorkerConfig
clearSecondaryWorkerConfig
setSecondaryWorkerConfig
Optional. The Compute Engine config settings for a cluster's secondary worker instances
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\InstanceGroupConfig
|
Returns | |
---|---|
Type | Description |
$this |
getSoftwareConfig
Optional. The config settings for cluster software.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\SoftwareConfig|null |
hasSoftwareConfig
clearSoftwareConfig
setSoftwareConfig
Optional. The config settings for cluster software.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\SoftwareConfig
|
Returns | |
---|---|
Type | Description |
$this |
getInitializationActions
Optional. Commands to execute on each node after config is completed. By default, executables are run on master and all worker nodes.
You can test a node's role
metadata to run an executable on
a master or worker node, as shown below using curl
(you can also use
wget
):
ROLE=$(curl -H Metadata-Flavor:Google
http://metadata/computeMetadata/v1/instance/attributes/dataproc-role)
if [[ "${ROLE}" == 'Master' ]]; then
... master specific actions ...
else
... worker specific actions ...
fi
Returns | |
---|---|
Type | Description |
Google\Protobuf\Internal\RepeatedField |
setInitializationActions
Optional. Commands to execute on each node after config is completed. By default, executables are run on master and all worker nodes.
You can test a node's role
metadata to run an executable on
a master or worker node, as shown below using curl
(you can also use
wget
):
ROLE=$(curl -H Metadata-Flavor:Google
http://metadata/computeMetadata/v1/instance/attributes/dataproc-role)
if [[ "${ROLE}" == 'Master' ]]; then
... master specific actions ...
else
... worker specific actions ...
fi
Parameter | |
---|---|
Name | Description |
var |
array<Google\Cloud\Dataproc\V1\NodeInitializationAction>
|
Returns | |
---|---|
Type | Description |
$this |
getEncryptionConfig
Optional. Encryption settings for the cluster.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\EncryptionConfig|null |
hasEncryptionConfig
clearEncryptionConfig
setEncryptionConfig
Optional. Encryption settings for the cluster.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\EncryptionConfig
|
Returns | |
---|---|
Type | Description |
$this |
getAutoscalingConfig
Optional. Autoscaling config for the policy associated with the cluster.
Cluster does not autoscale if this field is unset.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\AutoscalingConfig|null |
hasAutoscalingConfig
clearAutoscalingConfig
setAutoscalingConfig
Optional. Autoscaling config for the policy associated with the cluster.
Cluster does not autoscale if this field is unset.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\AutoscalingConfig
|
Returns | |
---|---|
Type | Description |
$this |
getSecurityConfig
Optional. Security settings for the cluster.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\SecurityConfig|null |
hasSecurityConfig
clearSecurityConfig
setSecurityConfig
Optional. Security settings for the cluster.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\SecurityConfig
|
Returns | |
---|---|
Type | Description |
$this |
getLifecycleConfig
Optional. Lifecycle setting for the cluster.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\LifecycleConfig|null |
hasLifecycleConfig
clearLifecycleConfig
setLifecycleConfig
Optional. Lifecycle setting for the cluster.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\LifecycleConfig
|
Returns | |
---|---|
Type | Description |
$this |
getEndpointConfig
Optional. Port/endpoint configuration for this cluster
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\EndpointConfig|null |
hasEndpointConfig
clearEndpointConfig
setEndpointConfig
Optional. Port/endpoint configuration for this cluster
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\EndpointConfig
|
Returns | |
---|---|
Type | Description |
$this |
getMetastoreConfig
Optional. Metastore configuration.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\MetastoreConfig|null |
hasMetastoreConfig
clearMetastoreConfig
setMetastoreConfig
Optional. Metastore configuration.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\MetastoreConfig
|
Returns | |
---|---|
Type | Description |
$this |
getDataprocMetricConfig
Optional. The config for Dataproc metrics.
Returns | |
---|---|
Type | Description |
Google\Cloud\Dataproc\V1\DataprocMetricConfig|null |
hasDataprocMetricConfig
clearDataprocMetricConfig
setDataprocMetricConfig
Optional. The config for Dataproc metrics.
Parameter | |
---|---|
Name | Description |
var |
Google\Cloud\Dataproc\V1\DataprocMetricConfig
|
Returns | |
---|---|
Type | Description |
$this |
getAuxiliaryNodeGroups
Optional. The node group settings.
Returns | |
---|---|
Type | Description |
Google\Protobuf\Internal\RepeatedField |
setAuxiliaryNodeGroups
Optional. The node group settings.
Parameter | |
---|---|
Name | Description |
var |
array<Google\Cloud\Dataproc\V1\AuxiliaryNodeGroup>
|
Returns | |
---|---|
Type | Description |
$this |