DataSync HDFS Location

DataSync HDFS Location in AWS represents a connection point to a Hadoop Distributed File System (HDFS) cluster. It allows AWS DataSync to transfer data between HDFS and AWS storage services such as Amazon S3, Amazon EFS, or Amazon FSx. The resource stores configuration details like the HDFS NameNode addresses, authentication type, and security settings, enabling secure and efficient data migration or synchronization from on-premises or self-managed Hadoop environments into AWS.

aws.datasync_location_hdfs

Fields

TitleIDTypeData TypeDescription
_keycorestring
account_idcorestring
agent_arnscorearray<string>The ARNs of the DataSync agents that can connect with your HDFS cluster.
authentication_typecorestringThe type of authentication used to determine the identity of the user.
block_sizecoreint64The size of the data blocks to write into the HDFS cluster.
creation_timecoretimestampThe time that the HDFS location was created.
kerberos_principalcorestringThe Kerberos principal with access to the files and folders on the HDFS cluster. This parameter is used if the AuthenticationType is defined as KERBEROS.
kms_key_provider_uricorestringThe URI of the HDFS cluster's Key Management Server (KMS).
location_arncorestringThe ARN of the HDFS location.
location_uricorestringThe URI of the HDFS location.
name_nodescorejsonThe NameNode that manages the HDFS namespace.
qop_configurationcorejsonThe Quality of Protection (QOP) configuration, which specifies the Remote Procedure Call (RPC) and data transfer protection settings configured on the HDFS cluster.
replication_factorcoreint64The number of DataNodes to replicate the data to when writing to the HDFS cluster.
simple_usercorestringThe user name to identify the client on the host operating system. This parameter is used if the AuthenticationType is defined as SIMPLE.
tagscorehstore