-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No inferences found by Textract #87
Comments
@nekogeko thank you for reaching out. Can you please confirm if you are uploading the documents to process in the S3 bucket directly or in the UI? A few things to check in addition to the above
|
Hi, I'm uploading the document from the UI. I've been testing with management.png, document type generic. The cloudwatch logs for the textract lambda does not show errors
The entry in DynamoDB has a case status set to success I see inferences being created in S3 under a folder hierarchy that looks like
alongside this file are also other files including entity-*.json, and textract-analyze.json The inferences files are requested and retrieved from the UI, and the key-value pairs tab shows 10 key-value pairs found, but the Raw Text section says "No Raw Text detected" |
Thank you for the additional detail. Investigating the issue. |
It appears that the issue may be in the javascript code responsible of retrieving the number of pages from the back-end response in document.js |
@nekogeko thank you for reaching out. You're right, While we work on releasing the fix for this, we will share it with you here so that you can add it to your code in the meantime. In if (validInferences.includes(InferenceName.TEXTRACT_DETECT_TEXT)) {
unformattedtextractDetectResponse = await baseQuery(
`${INFERENCES_PATH}/${arg.selectedCaseId}/${arg.selectedDocumentId}/${InferenceName.TEXTRACT_DETECT_TEXT}`
);
} you'll need to make these changes: Line 26 - unformattedtextractDetectResponse = await baseQuery(
+ const response = await baseQuery( and after line 28 add + unformattedtextractDetectResponse = response.data as any; I've attached a screenshot of what this change should look like |
thanks, I will test this and get back to you |
I confirm that the issue is resolved |
Describe the bug
When attempting to process the sample documents packaged in the solution, files are uploaded but no raw data is visible after a file has been processed. Key/value pairs are visible
To Reproduce
Deploy the application using one of the provided workflows (single-doc-textract.json, default.json), create a case and upload the management.png document with a document type (generic or Passport, depending on the choosen workflow config). Manually start the job. Wait for the status of the case to show that processing is complete. Then go to the playground view to see the extracted data.
Expected behavior
Since Textract is in the selected workflow, data is expected to be displayed in the Raw data section.
Please complete the following information about the solution:
Screenshots
If applicable, add screenshots to help explain your problem (please DO NOT include sensitive information).
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: