-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create hocr parser #1
Comments
Looking at https://gist.github.com/dcloud/9173113, fwiw |
Basics done in 9036cdc. |
Note that spans for words are sometimes ocrx_word and sometimes just ocr_word -- in other words, the x is sometimes missing. |
Ah, I wasn't sure about that. Reopening. Do you know the difference (what the x means)? |
I dunno. I'm not sure it's intentional or a bug. But since I ran into this On Mon, Feb 24, 2014 at 12:22 PM, Daniel Cloud [email protected]:
|
Yeah, so fwiw ocrx_word might not be a formal part of the spec -- this doc On Mon, Feb 24, 2014 at 12:22 PM, Daniel Cloud [email protected]:
|
Found existing one in Github, but it didn't work. See if we can quick write one of our own.
The text was updated successfully, but these errors were encountered: