---
title: Getting Started with Datadog
description: Datadog, the leading service for cloud-scale monitoring.
breadcrumbs: Docs > Infrastructure > Datadog Resource Catalog
---

# gcp_storageinsights_dataset_config{% #gcp_storageinsights_dataset_config %}

## `ancestors`{% #ancestors %}

**Type**: `UNORDERED_LIST_STRING`

## `cloud_storage_object_path`{% #cloud_storage_object_path %}

**Type**: `STRING`**Provider name**: `cloudStorageObjectPath`**Description**: Input only. Cloud Storage object path containing a list of project or folder numbers to include in the dataset; it cannot contain a mix of project and folders. The object must be a text file where each line has one of the following entries: - Project number, formatted as `projects/{project_number}`, for example, `projects/1234567890`. - Folder identifier, formatted as `folders/{folder_number}`, for example, `folders/9876543210`. Path must be in the format `gs://{bucket_name}/{object_name}`.

## `create_time`{% #create_time %}

**Type**: `TIMESTAMP`**Provider name**: `createTime`**Description**: Output only. The UTC time at which the dataset configuration was created. This is auto-populated.

## `dataset_config_state`{% #dataset_config_state %}

**Type**: `STRING`**Provider name**: `datasetConfigState`**Description**: Output only. State of the `datasetConfig`.**Possible values**:

- `CONFIG_STATE_UNSPECIFIED` - Unspecified state.
- `CONFIG_STATE_ACTIVE` - Active configuration indicates that the configuration is actively ingesting data.
- `CONFIG_STATE_VERIFICATION_IN_PROGRESS` - In this state, the configuration is being verified for various permissions.
- `CONFIG_STATE_CREATED` - Configuration is created and further processing needs to happen.
- `CONFIG_STATE_PROCESSING` - Configuration is under processing

## `description`{% #description %}

**Type**: `STRING`**Provider name**: `description`**Description**: Optional. A user-provided description for the dataset configuration. Maximum length: 256 characters.

## `exclude_cloud_storage_buckets`{% #exclude_cloud_storage_buckets %}

**Type**: `STRUCT`**Provider name**: `excludeCloudStorageBuckets`

- `cloud_storage_buckets`**Type**: `UNORDERED_LIST_STRUCT`**Provider name**: `cloudStorageBuckets`
  - `bucket_name`**Type**: `STRING`**Provider name**: `bucketName`**Description**: Cloud Storage bucket name.
  - `bucket_prefix_regex`**Type**: `STRING`**Provider name**: `bucketPrefixRegex`**Description**: A regex pattern for bucket names matching the regex. Regex should follow the syntax specified in `google/re2` on GitHub.

## `exclude_cloud_storage_locations`{% #exclude_cloud_storage_locations %}

**Type**: `STRUCT`**Provider name**: `excludeCloudStorageLocations`

- `locations`**Type**: `UNORDERED_LIST_STRING`**Provider name**: `locations`

## `gcp_status`{% #gcp_status %}

**Type**: `STRUCT`**Provider name**: `status`**Description**: Output only. Status of the `datasetConfig`.

- `code`**Type**: `INT32`**Provider name**: `code`**Description**: The status code, which should be an enum value of google.rpc.Code.
- `message`**Type**: `STRING`**Provider name**: `message`**Description**: A developer-facing error message, which should be in English. Any user-facing error message should be localized and sent in the google.rpc.Status.details field, or localized by the client.

## `identity`{% #identity %}

**Type**: `STRUCT`**Provider name**: `identity`**Description**: Identity used by this `datasetConfig`.

- `name`**Type**: `STRING`**Provider name**: `name`**Description**: Output only. Name of the identity.
- `type`**Type**: `STRING`**Provider name**: `type`**Description**: Type of identity to use for the datasetConfig.**Possible values**:
  - `IDENTITY_TYPE_UNSPECIFIED` - Default is unspecified and should not be used.
  - `IDENTITY_TYPE_PER_CONFIG` - Google managed service account per resource.
  - `IDENTITY_TYPE_PER_PROJECT` - Google managed service account per project.

## `include_cloud_storage_buckets`{% #include_cloud_storage_buckets %}

**Type**: `STRUCT`**Provider name**: `includeCloudStorageBuckets`

- `cloud_storage_buckets`**Type**: `UNORDERED_LIST_STRUCT`**Provider name**: `cloudStorageBuckets`
  - `bucket_name`**Type**: `STRING`**Provider name**: `bucketName`**Description**: Cloud Storage bucket name.
  - `bucket_prefix_regex`**Type**: `STRING`**Provider name**: `bucketPrefixRegex`**Description**: A regex pattern for bucket names matching the regex. Regex should follow the syntax specified in `google/re2` on GitHub.

## `include_cloud_storage_locations`{% #include_cloud_storage_locations %}

**Type**: `STRUCT`**Provider name**: `includeCloudStorageLocations`

- `locations`**Type**: `UNORDERED_LIST_STRING`**Provider name**: `locations`

## `include_newly_created_buckets`{% #include_newly_created_buckets %}

**Type**: `BOOLEAN`**Provider name**: `includeNewlyCreatedBuckets`**Description**: If set to `true`, the request includes all the newly created buckets in the dataset that meet the inclusion and exclusion rules.

## `labels`{% #labels %}

**Type**: `UNORDERED_LIST_STRING`

## `link`{% #link %}

**Type**: `STRUCT`**Provider name**: `link`**Description**: Details of the linked dataset.

- `dataset`**Type**: `STRING`**Provider name**: `dataset`**Description**: Output only. Dataset name for linked dataset.
- `linked`**Type**: `BOOLEAN`**Provider name**: `linked`**Description**: Output only. State of the linked dataset.

## `name`{% #name %}

**Type**: `STRING`**Provider name**: `name`**Description**: Identifier. name of resource

## `organization_id`{% #organization_id %}

**Type**: `STRING`

## `organization_number`{% #organization_number %}

**Type**: `INT64`**Provider name**: `organizationNumber`**Description**: Optional. Organization resource ID that the source projects should belong to. Projects that do not belong to the provided organization are not considered when creating the dataset.

## `organization_scope`{% #organization_scope %}

**Type**: `BOOLEAN`**Provider name**: `organizationScope`**Description**: Defines the options for providing a source organization for the dataset.

## `parent`{% #parent %}

**Type**: `STRING`

## `project_id`{% #project_id %}

**Type**: `STRING`

## `project_number`{% #project_number %}

**Type**: `STRING`

## `region_id`{% #region_id %}

**Type**: `STRING`

## `resource_name`{% #resource_name %}

**Type**: `STRING`

## `retention_period_days`{% #retention_period_days %}

**Type**: `INT32`**Provider name**: `retentionPeriodDays`**Description**: Number of days of history that must be retained.

## `skip_verification_and_ingest`{% #skip_verification_and_ingest %}

**Type**: `BOOLEAN`**Provider name**: `skipVerificationAndIngest`**Description**: Optional. If set to `false`, then all the permission checks must be successful before the system can start ingesting data. This field can only be updated before the system ingests data for the first time. Any attempt to modify the field after data ingestion starts results in an error.

## `source_folders`{% #source_folders %}

**Type**: `STRUCT`**Provider name**: `sourceFolders`**Description**: Defines the options for providing source folders for the dataset.

- `folder_numbers`**Type**: `UNORDERED_LIST_INT64`**Provider name**: `folderNumbers`**Description**: Optional. The list of folder numbers to include in the dataset.

## `source_projects`{% #source_projects %}

**Type**: `STRUCT`**Provider name**: `sourceProjects`**Description**: Defines the options for providing source projects for the dataset.

- `project_numbers`**Type**: `UNORDERED_LIST_INT64`**Provider name**: `projectNumbers`

## `tags`{% #tags %}

**Type**: `UNORDERED_LIST_STRING`

## `uid`{% #uid %}

**Type**: `STRING`**Provider name**: `uid`**Description**: Output only. System generated unique identifier for the resource.

## `update_time`{% #update_time %}

**Type**: `TIMESTAMP`**Provider name**: `updateTime`**Description**: Output only. The UTC time at which the dataset configuration was last updated. This is auto-populated.

## `zone_id`{% #zone_id %}

**Type**: `STRING`
