Table(
documentai_object: google.cloud.documentai_v1.types.document.Document.Page.Table,
document_text: dataclasses.InitVar[str],
)
Represents a wrapped documentai.Document.Page.Table.
Attributes | |
---|---|
Name | Description |
documentai_object |
google.cloud.documentai.Document.Page.Table
Required. The original google.cloud.documentai.Document.Page.Table object. |
document_text |
str
Required. UTF-8 encoded text in reading order from the document. |
body_rows |
List[List[str]]
Required. A list of body rows. |
header_rows |
List[List[str]]
Required. A list of header rows. |
Methods
to_csv
to_csv() -> str
Returns a csv str.
.. code-block:: python
from google.cloud.documentai_toolbox import Document
def sample_table_to_csv():
#Wrap document from gcs_path
merged_document = Document('gs://abc/def/gh/1')
#Use first page
page = merged_document.pages[0]
#export the first table in page 1 to csv
csv_text = page.tables[0].to_csv()
print(csv_text)
Parameter | |
---|---|
Name | Description |
dataframe |
pd.Dataframe
Required. Two-dimensional, size-mutable, potentially heterogeneous tabular data. |
Returns | |
---|---|
Type | Description |
str | The table in csv format. |
to_dataframe
to_dataframe() -> pandas.core.frame.DataFrame
Returns pd.DataFrame from documentai.table
Returns | |
---|---|
Type | Description |
pd.DataFrame | The DataFrame of the table. |