Glue DataBrew Dataset

Glue DataBrew Dataset is a resource in AWS Glue DataBrew that defines the data you want to prepare and transform. It represents a collection of data from sources such as Amazon S3, Amazon Redshift, or other supported data stores. The dataset acts as the input for DataBrew projects, enabling users to visually clean, normalize, and enrich data without writing code.

aws.databrew_dataset

Fields

TitleIDTypeData TypeDescription
_keycorestring
account_idcorestringThe ID of the Amazon Web Services account that owns the dataset.
create_datecoretimestampThe date and time that the dataset was created.
created_bycorestringThe Amazon Resource Name (ARN) of the user who created the dataset.
formatcorestringThe file format of a dataset that is created from an Amazon S3 file or folder.
format_optionscorejsonA set of options that define how DataBrew interprets the data in the dataset.
inputcorejsonInformation on how DataBrew can find the dataset, in either the Glue Data Catalog or Amazon S3.
last_modified_bycorestringThe Amazon Resource Name (ARN) of the user who last modified the dataset.
last_modified_datecoretimestampThe last modification date and time of the dataset.
namecorestringThe unique name of the dataset.
path_optionscorejsonA set of options that defines how DataBrew interprets an Amazon S3 path of the dataset.
resource_arncorestringThe unique Amazon Resource Name (ARN) for the dataset.
tagscorehstore