Hello Applevis!
As a certain youtuber would say, I hope you're all having a lovely day :)
I am on sequoia rc, I literally have 0 idea whether or not this would work the same way on previous versions although I'd tend to think so.
So basically in college we were working and I was in team with someone who handed out the final copy written on paper to the teacher. For the sake of not losing any class related material, I asked him to send me the photo of what we'd done together.
So I received 2 jpeg. I just download them, thinking that I will do proper OCR on my windows laptop because it's just easier there. To organize it better I create one pdf with both files thanks to Finder actions. Then I open it in Preview, thinking I'll just do vo shift l, just to have an idea of which page is which.
I was then so shocked to discover that not only it'd done OCR, so each pages had text readable with Voiceover, but it had even kept some of the formatting along. So I had 2 pages, with readable text, 1 table per page, where rows and columns were 90% accurate, readable and sorted as they were written.
Now, I know that openbook from freedom scientific is a real scam, but at this extent?
Good luck of doing all that in windows, offline, with first party apps, in less than 30s.
Apple, you've done an incredible job on this one, thank you!
Comments
Mac OCR
That sounds fantastic. Any idea if it's just a feature of Preview or if it's system wide?
I only recently discovered VO+Shift+L which is occasionally handy for simple OCR but that's the only way I know how to get it to do anything similar.
I remember a few years ago, there was a new MacOs feature that allowed you to literally copy text out of an image using the mouse. I could see then and it was quite magical, although I only ever really used it on a screenshot from a web page telling me about the feature and I never figured out how to do something similar with VoiceOver.
Preview and Pdfs with Voiceover
Apple did a great job with preview in Mac OS Sequoia. Sequoia resolves the issue of reading Pdfs on Mac, now I can read any pdf on mac very easy. I didn't know the ocr feature, is it only availeable on english?
I experience the same
I agree, it's so easy with macOS Sequoia to read PDFs with VoiceOver. In July my Mac was broken for two and a half weeks, and already there working with my old Windows laptop was a real fight. So easy things became so difficult!
Not sure if it was like that…
Not sure if it was like that before but even in the print window we have ocr automatically performed on the pdf preview of what will be printed out. The printing experience on the mac is way better than windows by the way, no need to add a printer on the network it automatically appears in the printers list unlike windows.
Woah!
That's amazing! I didn't think the Apple image description feature was very useful based on my experience with Big Sur, but maybe it's significantly improved? How has PDF support improved? Does VoiceOver properly navigate links, headings, tables, lists, etc with the standard VO navigation commands and the rotor? I thought High Sierra made some significant improvements, but there were still bugs where it wouldn't jump to elements you thought it should, so I'm curious just how much of a big deal Sequoia really is.
No it's only inside pdf…
No it's only inside pdf viewer and not with vo shift l, re read what I wrote.
A pleasant surprise in MAC OS 15
Yesterday I happen to save a screen shot. I reviewed the screen shot in the preview app, I pressed VO plus L to understand about the image, but it did not give me enough information. I accidently interacted with the image and was in for a pleasant surprise!!!!! Voice over read the content of the image and I was able to move through line by line and read it.
I was unsure whether the voice over read the text in the screen shot or the content of the window from where I took the screen shot. I tried with another screen shot the same process, Yahoo !!!! it again worked perfectly. I think it is one of the unsung feature of voice over in MAC OS 15.
Yeah in quicktime too. When…
Yeah in quicktime too. When you screen record something you can brows through text displayed with standard voiceouer commands in quicktime. I'm really curious whether this would work for other regular video as well or not. I am not sure if it's voiceover or some ocr engine pushed and built-in systemwide but it works and works great.
Amazing
I just tried this in a medical document which I downloaded a few weeks ago from my local healthcare provider's website. They have done an excellent job in the first place at least where VoiceOver is concerned, but VO-Shift-L does indeed seem to work in .pdf documents now. You go Apple!
I never said vo-shift-l is…
I never said vo-shift-l is the trigger for ocr (which is not the case for me). In any case maybe cause of the update to RC I am not able to reproduce any of what I described in this thread. I think there is a way to force ocr from Preview but I can't remember it.
I think it's really…
I think it's really systemwide ocr, not related to voiceover at all.
Navigating pages
I get a quarterly newsletter from my local blindness organisation. Last time it was a PDF with a load of unlabelled images.
This time, I opened it up in Sequoia and it just gives me a series of pages like this: "Page 14 containing".
But If I VO+Shift+L it does OCR the images which I assume are in the pages. I think this is the behaviour I get in previous versions of MacOs.
I think you are talking about something better than this. Am I missing something? I can't seem to interact with the pages.
And seriously I can't believe that an organisation specifically for blind people can't even create an accessible PDF. It makes me really angry. The only bit they managed to make accessible is the bit telling me how to donate. I can't say I am particularly inclined to give them anything. What a joke.
Sad to hear. Indeed what I…
Sad to hear. Indeed what I did was really specific and as I wrote I'm not sure if it's only a sequoia thing, and I'm only able to reproduce this with by merging gpegs in pdf and the picture was taken on an iphone to so... Anyways.
I just open PDFs on my mac…
I just open PDFs on my mac in safari these days and then use hand off to view them on my iPhone.
More and more the Ipad/iPhone seems like the way to go for accessibility.
Inconsistent
I discovered a while back that interacting with an image would give me text in Preview, just like in Safari. And, I can scan pages directly from my flatbed scanner. However, as soon as I got excited about it, it stopped working consistently, long before I upgraded to Sequoia, including in Safari. Sometimes it works great. Not sure what the secret sauce is.
Regarding Windows, I was able to read an image-only PDF with narrator really quickly/well with NVDA+R. And, Jaws Convenient OCR feature is the best scanning option available on Windows today, since OneStep Reader (formerly KNFBReader) and Kurzweil 1000 are both abandonware. OpenBook, too, hadn't been updated in 10 years or more, last time I checked.
Side note: As much as I've bit**ed about Preview in years past, Adobe on Windows has gotten so frustrating that I actually go to My Mac to read PDFs now!
PDF have been cursed since…
PDF have been cursed since forever. Openbook is genuinely a scam from FS at this point, I"d rather pay for macos 14.0 anytime, at least the thing actually works. OpenBook is a pile of bugs that will never be patched according to a wordpress blog post from 2021 (correlated unofficially by FS themselves, the the lack of future updates I mean) and you have to pay $1000 to get it. It has a poor internal screen reader, whenever you press save or print the sr won't read, jaws won't either and your best luck is still nvda object navigation mode. They should honestly be sued, you can't just sell an unofficial abandonware at such a high price while being the facto one of the most accessibility influence worldwide. They even dared update the code enough for win10+ compatibility, but nothing else. An OCR (engine unmaintained) software (unmaintained) still costing $1000 in 2025 just for blind? I let you use your own judgement. I understand that maybe the old age demographic or people less comfortable with modern tech might benefit, but guys please just make it free as you don't even bother maintaining it!
Anyways.
Preview biggest problem now is just lists and nested lists, and text attribute announcements with vo-t, otherwise it's indeed incredible now.
Now just mae accessible MS Office in MacOS and I will done with
nice experience... now to leave windows for ever, just need that MS Office to become as smooth in formatting as in windows and I will come back again to macOS.
Greetings