---
title: Getting Started with Datadog
description: Datadog, the leading service for cloud-scale monitoring.
breadcrumbs: Docs > Infrastructure > Datadog Resource Catalog
---

# aws_databrew_dataset{% #aws_databrew_dataset %}

## `account_id`{% #account_id %}

**Type**: `STRING`

## `create_date`{% #create_date %}

**Type**: `TIMESTAMP`**Provider name**: `CreateDate`**Description**: The date and time that the dataset was created.

## `created_by`{% #created_by %}

**Type**: `STRING`**Provider name**: `CreatedBy`**Description**: The Amazon Resource Name (ARN) of the user who created the dataset.

## `format`{% #format %}

**Type**: `STRING`**Provider name**: `Format`**Description**: The file format of a dataset that is created from an Amazon S3 file or folder.

## `format_options`{% #format_options %}

**Type**: `STRUCT`**Provider name**: `FormatOptions`**Description**: A set of options that define how DataBrew interprets the data in the dataset.

- `csv`**Type**: `STRUCT`**Provider name**: `Csv`**Description**: Options that define how CSV input is to be interpreted by DataBrew.
  - `delimiter`**Type**: `STRING`**Provider name**: `Delimiter`**Description**: A single character that specifies the delimiter being used in the CSV file.
  - `header_row`**Type**: `BOOLEAN`**Provider name**: `HeaderRow`**Description**: A variable that specifies whether the first row in the file is parsed as the header. If this value is false, column names are auto-generated.
- `excel`**Type**: `STRUCT`**Provider name**: `Excel`**Description**: Options that define how Excel input is to be interpreted by DataBrew.
  - `header_row`**Type**: `BOOLEAN`**Provider name**: `HeaderRow`**Description**: A variable that specifies whether the first row in the file is parsed as the header. If this value is false, column names are auto-generated.
  - `sheet_indexes`**Type**: `UNORDERED_LIST_INT32`**Provider name**: `SheetIndexes`**Description**: One or more sheet numbers in the Excel file that will be included in the dataset.
  - `sheet_names`**Type**: `UNORDERED_LIST_STRING`**Provider name**: `SheetNames`**Description**: One or more named sheets in the Excel file that will be included in the dataset.
- `json`**Type**: `STRUCT`**Provider name**: `Json`**Description**: Options that define how JSON input is to be interpreted by DataBrew.
  - `multi_line`**Type**: `BOOLEAN`**Provider name**: `MultiLine`**Description**: A value that specifies whether JSON input contains embedded new line characters.

## `input`{% #input %}

**Type**: `STRUCT`**Provider name**: `Input`**Description**: Information on how DataBrew can find the dataset, in either the Glue Data Catalog or Amazon S3.

- `data_catalog_input_definition`**Type**: `STRUCT`**Provider name**: `DataCatalogInputDefinition`**Description**: The Glue Data Catalog parameters for the data.
  - `catalog_id`**Type**: `STRING`**Provider name**: `CatalogId`**Description**: The unique identifier of the Amazon Web Services account that holds the Data Catalog that stores the data.
  - `database_name`**Type**: `STRING`**Provider name**: `DatabaseName`**Description**: The name of a database in the Data Catalog.
  - `table_name`**Type**: `STRING`**Provider name**: `TableName`**Description**: The name of a database table in the Data Catalog. This table corresponds to a DataBrew dataset.
  - `temp_directory`**Type**: `STRUCT`**Provider name**: `TempDirectory`**Description**: Represents an Amazon location where DataBrew can store intermediate results.
    - `bucket`**Type**: `STRING`**Provider name**: `Bucket`**Description**: The Amazon S3 bucket name.
    - `bucket_owner`**Type**: `STRING`**Provider name**: `BucketOwner`**Description**: The Amazon Web Services account ID of the bucket owner.
    - `key`**Type**: `STRING`**Provider name**: `Key`**Description**: The unique name of the object in the bucket.
- `database_input_definition`**Type**: `STRUCT`**Provider name**: `DatabaseInputDefinition`**Description**: Connection information for dataset input files stored in a database.
  - `database_table_name`**Type**: `STRING`**Provider name**: `DatabaseTableName`**Description**: The table within the target database.
  - `glue_connection_name`**Type**: `STRING`**Provider name**: `GlueConnectionName`**Description**: The Glue Connection that stores the connection information for the target database.
  - `query_string`**Type**: `STRING`**Provider name**: `QueryString`**Description**: Custom SQL to run against the provided Glue connection. This SQL will be used as the input for DataBrew projects and jobs.
  - `temp_directory`**Type**: `STRUCT`**Provider name**: `TempDirectory`
    - `bucket`**Type**: `STRING`**Provider name**: `Bucket`**Description**: The Amazon S3 bucket name.
    - `bucket_owner`**Type**: `STRING`**Provider name**: `BucketOwner`**Description**: The Amazon Web Services account ID of the bucket owner.
    - `key`**Type**: `STRING`**Provider name**: `Key`**Description**: The unique name of the object in the bucket.
- `metadata`**Type**: `STRUCT`**Provider name**: `Metadata`**Description**: Contains additional resource information needed for specific datasets.
  - `source_arn`**Type**: `STRING`**Provider name**: `SourceArn`**Description**: The Amazon Resource Name (ARN) associated with the dataset. Currently, DataBrew only supports ARNs from Amazon AppFlow.
- `s3_input_definition`**Type**: `STRUCT`**Provider name**: `S3InputDefinition`**Description**: The Amazon S3 location where the data is stored.
  - `bucket`**Type**: `STRING`**Provider name**: `Bucket`**Description**: The Amazon S3 bucket name.
  - `bucket_owner`**Type**: `STRING`**Provider name**: `BucketOwner`**Description**: The Amazon Web Services account ID of the bucket owner.
  - `key`**Type**: `STRING`**Provider name**: `Key`**Description**: The unique name of the object in the bucket.

## `last_modified_by`{% #last_modified_by %}

**Type**: `STRING`**Provider name**: `LastModifiedBy`**Description**: The Amazon Resource Name (ARN) of the user who last modified the dataset.

## `last_modified_date`{% #last_modified_date %}

**Type**: `TIMESTAMP`**Provider name**: `LastModifiedDate`**Description**: The last modification date and time of the dataset.

## `name`{% #name %}

**Type**: `STRING`**Provider name**: `Name`**Description**: The unique name of the dataset.

## `path_options`{% #path_options %}

**Type**: `STRUCT`**Provider name**: `PathOptions`**Description**: A set of options that defines how DataBrew interprets an Amazon S3 path of the dataset.

- `files_limit`**Type**: `STRUCT`**Provider name**: `FilesLimit`**Description**: If provided, this structure imposes a limit on a number of files that should be selected.
  - `max_files`**Type**: `INT32`**Provider name**: `MaxFiles`**Description**: The number of Amazon S3 files to select.
  - `order`**Type**: `STRING`**Provider name**: `Order`**Description**: A criteria to use for Amazon S3 files sorting before their selection. By default uses DESCENDING order, i.e. most recent files are selected first. Another possible value is ASCENDING.
  - `ordered_by`**Type**: `STRING`**Provider name**: `OrderedBy`**Description**: A criteria to use for Amazon S3 files sorting before their selection. By default uses LAST_MODIFIED_DATE as a sorting criteria. Currently it's the only allowed value.
- `last_modified_date_condition`**Type**: `STRUCT`**Provider name**: `LastModifiedDateCondition`**Description**: If provided, this structure defines a date range for matching Amazon S3 objects based on their LastModifiedDate attribute in Amazon S3.
  - `expression`**Type**: `STRING`**Provider name**: `Expression`**Description**: The expression which includes condition names followed by substitution variables, possibly grouped and combined with other conditions. For example, "(starts_with :prefix1 or starts_with :prefix2) and (ends_with :suffix1 or ends_with :suffix2)". Substitution variables should start with ':' symbol.
  - `values_map`**Type**: `MAP_STRING_STRING`**Provider name**: `ValuesMap`**Description**: The map of substitution variable names to their values used in this filter expression.
- `parameters`**Type**: `STRING`**Provider name**: `Parameters`**Description**: A structure that maps names of parameters used in the Amazon S3 path of a dataset to their definitions.

## `resource_arn`{% #resource_arn %}

**Type**: `STRING`**Provider name**: `ResourceArn`**Description**: The unique Amazon Resource Name (ARN) for the dataset.

## `tags`{% #tags %}

**Type**: `UNORDERED_LIST_STRING`