Cloud Data Fusion Instance

Cloud Data Fusion Instance is a fully managed, cloud-native data integration service that allows users to build and manage ETL and ELT data pipelines visually. It provides a graphical interface for designing data flows, connecting to various data sources, transforming data, and loading it into target systems. The service supports both batch and streaming data processing and integrates with other Google Cloud services such as BigQuery, Cloud Storage, and Dataproc. It helps simplify data integration, reduce development time, and improve data reliability across hybrid and multi-cloud environments.

gcp.datafusion_instance

Fields

TitleIDTypeData TypeDescription
_keycorestring
acceleratorscorejsonOutput only. List of accelerators enabled for this CDF instance.
ancestorscorearray<string>
api_endpointcorestringOutput only. Endpoint on which the REST APIs is accessible.
available_versioncorejsonOutput only. Available versions that the instance can be upgraded to using UpdateInstanceRequest.
create_timecoretimestampOutput only. The time the instance was created.
crypto_key_configcorejsonOptional. The crypto key configuration. This field is used by the Customer-Managed Encryption Keys (CMEK) feature.
datadog_display_namecorestring
dataplex_data_lineage_integration_enabledcoreboolOptional. Option to enable the Dataplex Lineage Integration feature.
dataproc_service_accountcorestringOptional. User-managed service account to set on Dataproc when Cloud Data Fusion creates Dataproc to run data processing pipelines. This allows users to have fine-grained access control on Dataproc's accesses to cloud resources.
descriptioncorestringOptional. A description of this instance.
disabled_reasoncorearray<string>Output only. If the instance state is DISABLED, the reason for disabling the instance.
enable_rbaccoreboolOptional. Option to enable granular role-based access control.
enable_stackdriver_loggingcoreboolOptional. Option to enable Dataproc Stackdriver Logging.
enable_stackdriver_monitoringcoreboolOptional. Option to enable Stackdriver Monitoring.
enable_zone_separationcoreboolOutput only. Option to enable granular zone separation.
event_publish_configcorejsonOptional. Option to enable and pass metadata for event publishing.
gcp_display_namecorestringOptional. Display name for an instance.
gcs_bucketcorestringOutput only. Cloud Storage bucket generated by Data Fusion in the customer project.
labelscorearray<string>The resource labels for instance to use to annotate any related underlying resources such as Compute Engine VMs. The character '=' is not allowed to be used within the labels.
logging_configcorejsonOptional. The logging configuration for this instance. This field is supported only in CDF versions 6.11.0 and above.
maintenance_eventscorejsonOutput only. The maintenance events for this instance.
maintenance_policycorejsonOptional. Configure the maintenance policy for this instance.
namecorestringOutput only. The name of this instance is in the form of projects/{project}/locations/{location}/instances/{instance}.
network_configcorejsonOptional. Network configuration options. These are required when a private Data Fusion instance is to be created.
organization_idcorestring
p4_service_accountcorestringOutput only. Service agent for the customer project.
parentcorestring
patch_revisioncorestringOptional. Current patch revision of the Data Fusion.
private_instancecoreboolOptional. Specifies whether the Data Fusion instance should be private. If set to true, all Data Fusion nodes will have private IP addresses and will not be able to access the public internet.
project_idcorestring
project_numbercorestring
region_idcorestring
resource_namecorestring
satisfies_pzicoreboolOutput only. Reserved for future use.
satisfies_pzscoreboolOutput only. Reserved for future use.
service_accountcorestringOutput only. Deprecated. Use tenant_project_id instead to extract the tenant project ID.
service_endpointcorestringOutput only. Endpoint on which the Data Fusion UI is accessible.
statecorestringOutput only. The current state of this Data Fusion instance.
state_messagecorestringOutput only. Additional information about the current state of this Data Fusion instance if available.
tagscorehstore_csv
tenant_project_idcorestringOutput only. The name of the tenant project.
typecorestringRequired. Instance type.
update_timecoretimestampOutput only. The time the instance was last updated.
versioncorestringOptional. Current version of the Data Fusion. Only specifiable in Update.
workforce_identity_service_endpointcorestringOutput only. Endpoint on which the Data Fusion UI is accessible to third-party users
zonecorestringOptional. Name of the zone in which the Data Fusion instance will be created. Only DEVELOPER instances use this field.
zone_idcorestring