Dataproc Batch

Dataproc Batch in Google Cloud is a managed service for running batch workloads using open-source data processing frameworks like Spark, Hive, and PySpark. It allows you to submit jobs without managing clusters directly, as resources are provisioned and scaled automatically for the duration of the job. This makes it efficient for running data transformations, analytics, and machine learning tasks on demand.

gcp.dataproc_batch

Fields

TitleIDTypeData TypeDescription
_keycorestring
ancestorscorearray<string>
create_timecoretimestampOutput only. The time when the batch was created.
creatorcorestringOutput only. The email address of the user who created the batch.
datadog_display_namecorestring
environment_configcorejsonOptional. Environment configuration for the batch execution.
labelscorearray<string>Optional. The labels to associate with this batch. Label keys must contain 1 to 63 characters, and must conform to RFC 1035 (https://www.ietf.org/rfc/rfc1035.txt). Label values may be empty, but, if present, must contain 1 to 63 characters, and must conform to RFC 1035 (https://www.ietf.org/rfc/rfc1035.txt). No more than 32 labels can be associated with a batch.
namecorestringOutput only. The resource name of the batch.
operationcorestringOutput only. The resource name of the operation associated with this batch.
organization_idcorestring
parentcorestring
project_idcorestring
project_numbercorestring
pyspark_batchcorejsonOptional. PySpark batch config.
resource_namecorestring
runtime_configcorejsonOptional. Runtime configuration for the batch execution.
runtime_infocorejsonOutput only. Runtime information about batch execution.
spark_batchcorejsonOptional. Spark batch config.
spark_r_batchcorejsonOptional. SparkR batch config.
spark_sql_batchcorejsonOptional. SparkSql batch config.
statecorestringOutput only. The state of the batch.
state_historycorejsonOutput only. Historical state information for the batch.
state_messagecorestringOutput only. Batch state details, such as a failure description if the state is FAILED.
state_timecoretimestampOutput only. The time when the batch entered a current state.
tagscorehstore
uuidcorestringOutput only. A batch UUID (Unique Universal Identifier). The service generates this value when it creates the batch.