Full name: projects.locations.processors.dataset.importDocuments
Import documents into a dataset.
HTTP request
POST https://{endpoint}/v1beta3/{dataset}:importDocuments
Where {endpoint}
is one of the supported service endpoints.
Path parameters
Parameters | |
---|---|
dataset |
Required. The dataset resource name. Format: projects/{project}/locations/{location}/processors/{processor}/dataset It takes the form |
Request body
The request body contains data with the following structure:
JSON representation |
---|
{
"batchDocumentsImportConfigs": [
{
object ( |
Fields | |
---|---|
batchDocumentsImportConfigs[] |
Required. The Cloud Storage uri containing raw documents that must be imported. |
Response body
If successful, the response body contains an instance of Operation
.
Authorization scopes
Requires the following OAuth scope:
https://www.googleapis.com/auth/cloud-platform
For more information, see the Authentication Overview.
IAM Permissions
Requires the following IAM permission on the dataset
resource:
documentai.datasets.createDocuments
For more information, see the IAM documentation.
BatchDocumentsImportConfig
Config for importing documents. Each batch can have its own dataset split type.
JSON representation |
---|
{ "batchInputConfig": { object ( |
Fields | |
---|---|
batchInputConfig |
The common config to specify a set of documents used as input. |
Union field
|
|
datasetSplit |
Target dataset split where the documents must be stored. |
autoSplitConfig |
If set, documents will be automatically split into training and test split category with the specified ratio. |
AutoSplitConfig
The config for auto-split.
JSON representation |
---|
{ "trainingSplitRatio": number } |
Fields | |
---|---|
trainingSplitRatio |
Ratio of training dataset split. |