.document_understanding.GcsDestination
The Google Cloud Storage location to write
the output to.
pages_per_shard
int
The max number of pages to include into each output Document
shard JSON on Google Cloud Storage.
The valid range is [1, 100]. If not specified, the default
value is 20.
For example, for one pdf file with 100 pages, 100 parsed
pages will be produced. If pages_per_shard = 20, then 5
Document shard JSON files each containing 20 parsed pages
will be written under the prefix
[OutputConfig.gcs_destination.uri][] and suffix
pages-x-to-y.json where x and y are 1-indexed page numbers.
Example GCS outputs with 157 pages and pages_per_shard = 50:
pages-001-to-050.json pages-051-to-100.json
pages-101-to-150.json pages-151-to-157.json
Methods
__delattr__
__delattr__(key)
Delete the value on the given field.
This is generally equivalent to setting a falsy value.
__eq__
__eq__(other)
Return True if the messages are equal, False otherwise.
__ne__
__ne__(other)
Return True if the messages are unequal, False otherwise.
__setattr__
__setattr__(key,value)
Set the value on the given field.
For well-known protocol buffer types which are marshalled, either
the protocol buffer object or the Python equivalent is accepted.