-
Notifications
You must be signed in to change notification settings - Fork 241
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature request] Can you add webpages text highlighting to Read Aloud #358
Comments
We try to break the text into paragraphs rather than sentences, because the latency associated with cloud voices may cause delay between sentences to be too long. On the other hand, some native voices like the "Google US English" require us to break text into chunks that must be no longer than 15 seconds due to the voice engine's imposed limitation. In other words, the chunk size varies depending on the voice and we just have to accept that. So we need to highlight the text on the page that corresponds to the current chunk being read. The challenge indeed is that the boundaries of the chunk, i.e. its start/end indices, may not align precisely at the DOM element boundaries. The start/end index may fall in the middle of a span. This mean we have to break the span up into two spans just for the purpose of text highlighting. This is slightly tricky to do, as our algorithm has to be designed to work for light mode and darkmode, and work on a variety of websites on the internet that use varying markups and styling. For this reason I have not invested the time into this, though I've seen a few other extension have been able to do this fairly successfully. |
I'll try exploring the source code of those extensions you mention to see what makes them tick. maybe I can find some function they made have used to accomplish their feats. the errors I encounter with text highlighter in the past:
The other extensions: first glanceI find that from a first glance that these extensions you mention seems to have "solved" the problem of highlighting text. but with a closer examination it seems to me that they just sidestep the issue all together by reconstructing the site's main contents in their own style while keep most of the site's formatting "the same". Main the culprits founded doing this is "speechify" and "Natural Reader". Like it looks perfect at a glance but look a little closer and it feels off in some way. I find that it leads to some jarring formatting errors for text heavy sites that I visit frequently. like some text having slightly smaller or larger font-sizes then usually. easy to miss if you're not looking for it. though it does allow for some cool visual aid feature like a Dyslexia font changer and a clear reader mode too. Edge's: first glanceI think the best approach to this is to follow how edge does it highlighting of text. by only changing what is needed to be change and leave everything else alone; this does seem to cause little of column shifting occasionally. it seems to me that edge tense to keep it simple. by highlighting in light blue, the current paragraphs its reading and the individually words in yellow. the words themselves aren't completely in sync with what's being said but that's kind of okay since audio syncing is something that's easy to ignore if you're not looking for it. though edges approach is imperfect and still suffers from desyncing especially over large spans of text. though I do believe that much of the internet's webpages are pretty standardize in my opinion with much of the contents being very simple html documents with very little CSS standing in the way. if you're worry about CSS priority fighting then why not place the highlighter under a custom tag and set it to a high priority like edge. besides nobody expect perfection from a free open-source extension. Footnotes
|
If this ever gets implemented, please have an op-out option. I like how this extension allows me to interact with the text (highlight, copy) without it changing the reading position like with other extensions and the Edge built-in Read Aloud feature. I'm willing to trade that for the text highlighting capability. The best thing for my use would be to have 2 modes:
|
see also #179 |
the requested feature for Read Aloud that I wish for is the ability to have the read text be highlighted on the webpages as it is being read too me. instead of having a small box in the top left-hand corner. why not have the selected text be highlighted on the page itself instead , This would make it easier to follow along and focus on the content on the webpage itself.
Holy-molly I just relies how offensive this might be, but I mean no harm. I'm just formulating the possible issue and or milestone you guys might face. I'm not a great programmer just a beginner at this stuff and the first thing that comes to mind is thinking up obstacles that come be face and milestones that must be meant to continue.
- Possible obstruction that could be stopping you guys [ I guess but might be wrong]?
is the reason why it can't be done is that Read-Aloud load the whole webpage into the speech synthesizer
is an alternative reason why it can't be done is that you need a language tokenizers to tell the difference between each paragraph , sentence, or word on a webpage
Milestones [Phase 1]
Be able to Highlight A single sentence on the webpageMilestones [Phase 2]
Milestones [Phase 3]
Here the thing about this phase. it will require a UI over hall that I can't visualize yet until I got a firm grasp CSS know how.
sorry
Footnotes
I don't just mean Paragraph. I'm also Including
<div>
,<section>
,<body>
↩ ↩2A Meta-tag is just a html/xml tag (e.g. "
</p>
") ↩The text was updated successfully, but these errors were encountered: