-
Notifications
You must be signed in to change notification settings - Fork 54
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Feat: add more output format for table inference (#263)
This PR addresses [CORE-2307](https://unstructured-ai.atlassian.net/browse/CORE-2307) - add a new kwarg to `UnstructuredTableTransformerModel.run_prediction`: `output_format` - default `output_format` is `html`, which is current behavior: output html string representation of the table - another options available is `dataframe`, which returns a pandas dataframe representation of the table - if not specified or any other string value for `output_format` it returns a list of dictionaries: table cell format, the original output format from table transformer - `unstructured.model.tables.recognize` no longer accepts `out_html` kwarg and it now only returns table cell format [CORE-2307]: https://unstructured-ai.atlassian.net/browse/CORE-2307?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ --------- Co-authored-by: qued <[email protected]>
- Loading branch information
Showing
5 changed files
with
107 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
__version__ = "0.7.10-dev1" # pragma: no cover | ||
__version__ = "0.7.10-dev2" # pragma: no cover |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters