This product is not supported for your selected Datadog site. ().
Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, n'hésitez pas à nous contacter.

aws_databrew_dataset

account_id

Type: STRING

create_date

Type: TIMESTAMP
Provider name: CreateDate
Description: The date and time that the dataset was created.

created_by

Type: STRING
Provider name: CreatedBy
Description: The Amazon Resource Name (ARN) of the user who created the dataset.

format

Type: STRING
Provider name: Format
Description: The file format of a dataset that is created from an Amazon S3 file or folder.

format_options

Type: STRUCT
Provider name: FormatOptions
Description: A set of options that define how DataBrew interprets the data in the dataset.

  • csv
    Type: STRUCT
    Provider name: Csv
    Description: Options that define how CSV input is to be interpreted by DataBrew.
    • delimiter
      Type: STRING
      Provider name: Delimiter
      Description: A single character that specifies the delimiter being used in the CSV file.
    • header_row
      Type: BOOLEAN
      Provider name: HeaderRow
      Description: A variable that specifies whether the first row in the file is parsed as the header. If this value is false, column names are auto-generated.
  • excel
    Type: STRUCT
    Provider name: Excel
    Description: Options that define how Excel input is to be interpreted by DataBrew.
    • header_row
      Type: BOOLEAN
      Provider name: HeaderRow
      Description: A variable that specifies whether the first row in the file is parsed as the header. If this value is false, column names are auto-generated.
    • sheet_indexes
      Type: UNORDERED_LIST_INT32
      Provider name: SheetIndexes
      Description: One or more sheet numbers in the Excel file that will be included in the dataset.
    • sheet_names
      Type: UNORDERED_LIST_STRING
      Provider name: SheetNames
      Description: One or more named sheets in the Excel file that will be included in the dataset.
  • json
    Type: STRUCT
    Provider name: Json
    Description: Options that define how JSON input is to be interpreted by DataBrew.
    • multi_line
      Type: BOOLEAN
      Provider name: MultiLine
      Description: A value that specifies whether JSON input contains embedded new line characters.

input

Type: STRUCT
Provider name: Input
Description: Information on how DataBrew can find the dataset, in either the Glue Data Catalog or Amazon S3.

  • data_catalog_input_definition
    Type: STRUCT
    Provider name: DataCatalogInputDefinition
    Description: The Glue Data Catalog parameters for the data.
    • catalog_id
      Type: STRING
      Provider name: CatalogId
      Description: The unique identifier of the Amazon Web Services account that holds the Data Catalog that stores the data.
    • database_name
      Type: STRING
      Provider name: DatabaseName
      Description: The name of a database in the Data Catalog.
    • table_name
      Type: STRING
      Provider name: TableName
      Description: The name of a database table in the Data Catalog. This table corresponds to a DataBrew dataset.
    • temp_directory
      Type: STRUCT
      Provider name: TempDirectory
      Description: Represents an Amazon location where DataBrew can store intermediate results.
      • bucket
        Type: STRING
        Provider name: Bucket
        Description: The Amazon S3 bucket name.
      • bucket_owner
        Type: STRING
        Provider name: BucketOwner
        Description: The Amazon Web Services account ID of the bucket owner.
      • key
        Type: STRING
        Provider name: Key
        Description: The unique name of the object in the bucket.
  • database_input_definition
    Type: STRUCT
    Provider name: DatabaseInputDefinition
    Description: Connection information for dataset input files stored in a database.
    • database_table_name
      Type: STRING
      Provider name: DatabaseTableName
      Description: The table within the target database.
    • glue_connection_name
      Type: STRING
      Provider name: GlueConnectionName
      Description: The Glue Connection that stores the connection information for the target database.
    • query_string
      Type: STRING
      Provider name: QueryString
      Description: Custom SQL to run against the provided Glue connection. This SQL will be used as the input for DataBrew projects and jobs.
    • temp_directory
      Type: STRUCT
      Provider name: TempDirectory
      • bucket
        Type: STRING
        Provider name: Bucket
        Description: The Amazon S3 bucket name.
      • bucket_owner
        Type: STRING
        Provider name: BucketOwner
        Description: The Amazon Web Services account ID of the bucket owner.
      • key
        Type: STRING
        Provider name: Key
        Description: The unique name of the object in the bucket.
  • metadata
    Type: STRUCT
    Provider name: Metadata
    Description: Contains additional resource information needed for specific datasets.
    • source_arn
      Type: STRING
      Provider name: SourceArn
      Description: The Amazon Resource Name (ARN) associated with the dataset. Currently, DataBrew only supports ARNs from Amazon AppFlow.
  • s3_input_definition
    Type: STRUCT
    Provider name: S3InputDefinition
    Description: The Amazon S3 location where the data is stored.
    • bucket
      Type: STRING
      Provider name: Bucket
      Description: The Amazon S3 bucket name.
    • bucket_owner
      Type: STRING
      Provider name: BucketOwner
      Description: The Amazon Web Services account ID of the bucket owner.
    • key
      Type: STRING
      Provider name: Key
      Description: The unique name of the object in the bucket.

last_modified_by

Type: STRING
Provider name: LastModifiedBy
Description: The Amazon Resource Name (ARN) of the user who last modified the dataset.

last_modified_date

Type: TIMESTAMP
Provider name: LastModifiedDate
Description: The last modification date and time of the dataset.

name

Type: STRING
Provider name: Name
Description: The unique name of the dataset.

path_options

Type: STRUCT
Provider name: PathOptions
Description: A set of options that defines how DataBrew interprets an Amazon S3 path of the dataset.

  • files_limit
    Type: STRUCT
    Provider name: FilesLimit
    Description: If provided, this structure imposes a limit on a number of files that should be selected.
    • max_files
      Type: INT32
      Provider name: MaxFiles
      Description: The number of Amazon S3 files to select.
    • order
      Type: STRING
      Provider name: Order
      Description: A criteria to use for Amazon S3 files sorting before their selection. By default uses DESCENDING order, i.e. most recent files are selected first. Another possible value is ASCENDING.
    • ordered_by
      Type: STRING
      Provider name: OrderedBy
      Description: A criteria to use for Amazon S3 files sorting before their selection. By default uses LAST_MODIFIED_DATE as a sorting criteria. Currently it’s the only allowed value.
  • last_modified_date_condition
    Type: STRUCT
    Provider name: LastModifiedDateCondition
    Description: If provided, this structure defines a date range for matching Amazon S3 objects based on their LastModifiedDate attribute in Amazon S3.
    • expression
      Type: STRING
      Provider name: Expression
      Description: The expression which includes condition names followed by substitution variables, possibly grouped and combined with other conditions. For example, “(starts_with :prefix1 or starts_with :prefix2) and (ends_with :suffix1 or ends_with :suffix2)”. Substitution variables should start with ‘:’ symbol.
    • values_map
      Type: MAP_STRING_STRING
      Provider name: ValuesMap
      Description: The map of substitution variable names to their values used in this filter expression.
  • parameters
    Type: STRING
    Provider name: Parameters
    Description: A structure that maps names of parameters used in the Amazon S3 path of a dataset to their definitions.

resource_arn

Type: STRING
Provider name: ResourceArn
Description: The unique Amazon Resource Name (ARN) for the dataset.

tags

Type: UNORDERED_LIST_STRING