Class SparkBatch (5.12.0)

SparkBatch(mapping=None, *, ignore_unknown_fields=False, **kwargs)

A configuration for running an Apache Spark <https://spark.apache.org/>__ batch workload.

This message has oneof_ fields (mutually exclusive fields). For each oneof, at most one member field can be set at the same time. Setting any member of the oneof automatically clears all other members.

.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields

Attributes

Name Description
main_jar_file_uri str
Optional. The HCFS URI of the jar file that contains the main class. This field is a member of oneof_ driver.
main_class str
Optional. The name of the driver main class. The jar file that contains the class must be in the classpath or specified in jar_file_uris. This field is a member of oneof_ driver.
args MutableSequence[str]
Optional. The arguments to pass to the driver. Do not include arguments that can be set as batch properties, such as --conf, since a collision can occur that causes an incorrect batch submission.
jar_file_uris MutableSequence[str]
Optional. HCFS URIs of jar files to add to the classpath of the Spark driver and tasks.
file_uris MutableSequence[str]
Optional. HCFS URIs of files to be placed in the working directory of each executor.
archive_uris MutableSequence[str]
Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.