A singleton resource under a Processor
which configures a collection of documents.
JSON representation |
---|
{ "name": string, "state": enum ( |
Fields | |
---|---|
name |
Dataset resource name. Format: |
state |
Required. State of the dataset. Ignored when updating dataset. |
satisfiesPzs |
Output only. Reserved for future use. |
satisfiesPzi |
Output only. Reserved for future use. |
Union field
|
|
gcsManagedConfig |
Optional. User-managed Cloud Storage dataset configuration. Use this configuration if the dataset documents are stored under a user-managed Cloud Storage location. |
documentWarehouseConfig |
Optional. Deprecated. Warehouse-based dataset configuration is not supported. |
unmanagedDatasetConfig |
Optional. Unmanaged dataset configuration. Use this configuration if the dataset documents are managed by the document service internally (not user-managed). |
Union field
|
|
spannerIndexingConfig |
Optional. A lightweight indexing source with low latency and high reliability, but lacking advanced features like CMEK and content-based search. |
GCSManagedConfig
Configuration specific to the Cloud Storage-based implementation.
JSON representation |
---|
{
"gcsPrefix": {
object ( |
Fields | |
---|---|
gcsPrefix |
Required. The Cloud Storage URI (a directory) where the documents belonging to the dataset must be stored. |
GcsPrefix
Specifies all documents on Cloud Storage with a common prefix.
JSON representation |
---|
{ "gcsUriPrefix": string } |
Fields | |
---|---|
gcsUriPrefix |
The URI prefix. |
DocumentWarehouseConfig
Configuration specific to the Document AI Warehouse-based implementation.
JSON representation |
---|
{ "collection": string, "schema": string } |
Fields | |
---|---|
collection |
Output only. The collection in Document AI Warehouse associated with the dataset. |
schema |
Output only. The schema in Document AI Warehouse associated with the dataset. |
UnmanagedDatasetConfig
This type has no fields.
Configuration specific to an unmanaged dataset.
SpannerIndexingConfig
This type has no fields.
Configuration specific to spanner-based indexing.