Changelog

0.13.5-alpha (2024-07-02)

Bug Fixes

  • Refactor page.py to improve performance and organization (#316) (bee4f62)

0.13.4-alpha (2024-06-13)

Bug Fixes

  • Change if condition typo in _get_children_of_element() (#313) (7495e0e)

0.13.3-alpha (2024-03-11)

Bug Fixes

  • Drop Python 3.7 Support (71e6c51)

  • Escape html special characters in hocr_document_template.xml.j2 (#279) (2d9f05b)

  • Require google-api-core >= 2.17.1 (71e6c51)

  • Require numpy >= 1.23.5 (71e6c51)

  • Require pandas >= 2.0.0 (71e6c51)

  • Require pikepdf >= 8.0.0 (71e6c51)

  • Require Pillow >= 10.0.0 (71e6c51)

  • Require proto-plus >= 1.22.3 (71e6c51)

0.13.2-alpha (2024-03-08)

Bug Fixes

  • Add trailing slash if not present for gcs_prefix in Document.from_gcs() to cover matching prefixes edge case. (#274) (b4762e8)

0.13.1-alpha (2024-03-04)

Bug Fixes

  • Changed client_info import and added new quickstart samples (#268) (c4b1d58), closes #266

0.13.0-alpha (2024-02-26)

Features

  • Added gcs_uri parameter to Document.from_gcs() to allow importing of a single Document JSON (#261) (f654a5d)

0.12.2-alpha (2024-02-02)

Bug Fixes

  • Reduce API polling for Document.from_batch_process_operation() (#249) (0677299)

0.12.1-alpha (2024-02-02)

Bug Fixes

0.12.0-alpha (2023-11-30)

Features

  • Add support for Python 3.12 (1ff4bf7)

  • Introduce compatibility with native namespace packages (#201) (d178acb)

Bug Fixes

  • Implement lazy-loading of properties to speed up initialization (#205) (7001c76)

  • Prevent sorting entities labeled in Document AI Workbench (#200) (d843e51)

0.11.2-alpha (2023-11-07)

Bug Fixes

  • Updates to hOCR Template to follow hOCR Spec (#195) (3f52e82)

0.11.1-alpha (2023-10-23)

Bug Fixes

0.11.0-alpha (2023-10-18)

Features

0.10.3-alpha (2023-10-06)

Bug Fixes

  • docai_utilities.py to return Optional (#176) (028bc37)

0.10.2-alpha (2023-10-03)

Bug Fixes

  • Change ocr_line <span> to include all ocr_word (#169) (bc44dab)

0.10.1-alpha (2023-09-05)

Bug Fixes

  • Add handling for documents missing all layout elements. (#161) (1ac6f5e)

0.10.0-alpha (2023-08-09)

Features

  • Add export merged sharded Document proto (#145) (a5e1f5c)

Bug Fixes

0.9.1-alpha (2023-07-27)

Bug Fixes

  • Internal refactoring to improve efficiency and readability. No external-facing changes. (82ac823)

0.9.0-alpha (2023-07-07)

Features

0.8.0-alpha (2023-06-07)

Features

  • Add convert_document_to_annotate_file_json (#124) (a6b75fc)

0.7.0-alpha (2023-05-31)

Features

  • Added text_annotation to vision conversion (#114) (27196bb)

0.6.0-alpha (2023-04-17)

Features

0.5.0-alpha (2023-04-07)

Features

  • Add Import Document from Batch Process Metadata & Operation (#88) (f95bbea)

  • Added Export Images functionality (#96) (383e105)

  • Update Max Files per Batch Request to 1000 (#91) (3bbc0f0)

0.4.1-alpha (2023-03-21)

Miscellaneous Chores

0.4.0-alpha (2023-03-09)

Features

  • Add config based annotation converter (#72) (735514e)

  • Added Batch creation for Cloud Storage documents. (#66) (c32a371)

  • Added list_gcs_document_tree (#75) (d18d1dc)

Bug Fixes

  • Handle Edge Case where GCS Shards are out of order (#69) (709fe86)

0.3.0-alpha (2023-02-27)

Features

  • Added docproto to AnnotateFile convertor (#63) (f6dd89a)

0.2.1-alpha (2023-02-15)

Documentation

0.2.0-alpha (2023-02-15)

Features

  • Add entities_to_dict() and entities_to_bigquery() to Document wrapper (#50) (494fa86)

  • Add PDF Splitter (#51) (8359911)

  • Added Support for Form Fields (#48) (6d74548)

0.1.1-alpha (2023-02-08)

Bug Fixes

  • Updated Pip install name in README (#52) (dad8c8b)

Documentation

  • samples: Added quickstart sample (#27) (23a0791)

0.1.0-alpha (2023-01-31)

Features

Miscellaneous Chores

  • Set initial version to 0.1.0-alpha (b01c38b)

Documentation

0.1.0-alpha (2022-11-24)

Features