Converting JPEG or Image Files to Accessible PDFs Introducing Optical Character Recognition (OCR)
Why use OCR? If you have a document that is comprised of scanned pages from either a book or a
journal article that you’ve used to create a PDF, you may need to remediate the resulting file to make it
edtiable, searchable, and / or readable to individuals who will acess your file in a variety of ways. While
the previously mentioned document features improve accessibility and navigabilty for all users, they are
particlularly helpful to those who are visually impaired and are therefore using a screen reading device
to assist them in accessing the content of your file. This task of converting your scanned image-file to a
readable text can be completed in Adobe Acrobat using Optical Character Recognition (OCR) tools sets.
1. To begin the process of creating an accessible PDF, open the file in Adobe Acrobat. Any recent
version of Acrobat will work for this process.
2. To determine whether or not your PDF is an image file, do the following: from the View tab’s
drop down menu, select Read Out Loud > Activate Read Out Loud.
3. Now return to the View tab once again, and from the drop-down menu, select Read This Page
Only. If the Read Aloud feature announces the following statement: Warning, empty page,” then
the file is an image file and not a text file.
4. To begin the process of converting the document, open the toolbox panel to the right of your
main document pane using the fly out panel button.
5. With this side panel, you are provided a list of tools. Select Action Wizard from the list of tools
there. If you do not see that particluar tool in the panel, type Action Wizard in the “Find your
tools here,” field at the top of the tools panel. Select the Action Wizard tool.
6. A pop-up window will appear next, click on Make Accessible located at the top of the list of the
pop up. Click the blue Start button.
7. A Description pop-up window will appear next. Untick the Leave As Is button below the first title
field, and give the document an internal title name. Using the term OCR in your title, may help
to organize and find files you have made accessible using OCR tools. Select OK.
8. Next, another pop-up window appears labeled Recognize Text - General Settings. From the
Document Language drop down menu, select whatever language that the document is written
in. For Output, select Searchable Image and set Downsample To 600 dpi (use the highest dpi
provided to you in the list of options there). Select OK.
9. Next, a pop-up window appears inquiring whether or not there are fillable forms in the file, (for
purposes of this tutorial, my file has no forms). Select No, Skip this Step.
10. For the Read Aloud function, you’re asked again to select a reading language. Select the
language you want the document to be read to users in. Select OK.
11. If there’s an image on your page, a pop-up will appear asking you to add alternative text. Enter a
description of the image omitting any introducotry verbiage such as, “This is an image of,” since
the screen reader will say that automatically without human prompting. So you may write
something like, “woman posed with flower arrangement.” or whatever is the subject of your
image. If the image highlighted is not an image but a decorative element or something that
otherwise doesn’t need to be read to users, tick the Decorative figure box, and leave the text
window empty. Select Save & Close.
12. Next an Accessibility Checker Options window pop-up appears. Untick the box that says Create
accesility report unless you track those types of documents, for ADA compliance reasons.
Leave all of the rest of the defaulted options in the pop-up selected, and select Start Checking.
13. At this point, a panel to the left of your document panel will open (if you don’t see it opened,
click the triangular open panel button). The panel shows you an Accessility Checker report. It is
likely to report that the document has issues. Twirl open the corresponding menu and right-click
on each of the issues with blue question marks next to them. Right click on the issue and select
Pass for each item noted. Close the panel. Save your document.
14. Now from the View tab, select Read Out Loud > Activate Read Out Loud. Repeat and select Read
This Page Only, and listen to the reading. If the reading function starts to read your page, it is
now accessible to a screen reader, and your project of converting an image file to a text file is
concluded. Be sure to save your file (note that including OCR in the file title will be helpful for
relocating the file at a later point in time).