Product FAQ - OCR Free

Introduction

Q 01: What is OCR?

Q 02: How accurate is the OCR text in the articles?

Q 03: What to do when you notice a mistake in the OCR text?

Q 04: Can I scan a paper copy and convert it into a Word file?

Q 05: Is the converted document an editable version?

Q 06: Does it preserve formatting and layout of documents?

Q 07: Can I OCR screen shots?

Q 08: How long will it take to recognize my image?

Q 09: What is the best scanning resolution for OCR?

Q 10: Will it handle a page that has been rotated when scanned?


Q 01: What is OCR?

A: Optical character recognition (OCR) text is the text that OCR software has translated from the original newspaper page to online format. It is electronically scanned in and has not been manually reviewed or corrected. OCR makes it possible to search large quantities of full text information but it is not 100% accurate. The accuracy depends on a variety of factors: condition of the original newspaper or microfilm, quality of the paper, size and style of the font and column layouts, for example.

Top of Page

Q 02: How accurate is the OCR text in the articles?

A: We have scanned the original newspaper pages using high quality scanners and an optical character recognition (OCR) process which converts the printed text to electronic text. Both these processes produce the most accurate results possible; however, it is inevitable that some errors slip through. The quality of the original newspaper affects the outcome and accuracy of the OCR scanning process. A range of factors are taken into account, including:

Highly complex layout

Radical differences in layout over time

Variable font sizes and character types (especially Gothic)

Narrow space between lines

Narrow gutter between columns

Missing or misprinted text

Poor quality or deteriorated inks

Poor quality or deteriorated papers

Irregular alignment of characters in hand-set press

Annotations by hand

Graphic devices and/or elements

Top of Page

Q 03: What to do when you notice a mistake in the OCR text?

A: When you have selected an article on a page in the viewer, you can view the original OCR text. As the Optical Character Recognition text is electronically translated, there are often errors. You can fix any errors line by line to make the necessary corrections. Using Microsoft Word is an ideal solution to find and fix errors.

Top of Page

Q 04: Can I scan a paper copy and convert it into a Word file?

A: Yes, the program has the ability to activate the scanner connected with your PC and scan paper copy to the program, then you can save it to text or Word document.

Top of Page

Q 05: Is the converted document an editable version?

A: Yes. The converted document can be edited in its native version, but only if the native program is installed on the computer.

Top of Page

Q 06: Does it preserve formatting and layout of documents?

A: No. The result is always plain text.

Top of Page

Q 07: Can I OCR screen shots?

A: Our OCR engine requires images at a resolution of 150 dpi or greater and as such it is not optimal suited for reading PC screen shots which are only 72dpi. An Additional problem is that the fonts used for web design are usually sans serif fonts, which are more difficult to recognize than serif fonts. But as we process image for better accuracy it might be possible to OCR even screen shots. Just give it a try. In the most cases the result is faster than typing all by hand.

Top of Page

Q 08: How long will it take to recognize my image?

A: The text recognition time depends on the image quality. The average time of recognizing one file is several seconds.

Top of Page

Q 09: What is the best scanning resolution for OCR?

A: Most OCR engines are optimized for 300 dpi images. Scanning at true 300 dpi optical resolution is very important. Scanning at a lower resolution and then using scanner software to increase the dpi later on does nothing for OCR. In cases where the font size of characters on an image are very small (point size of 4 or less), scanning images in at 400 dpi can improve character recognition. This again would require a scanner that supports true 400 dpi optical resolution.

Top of Page

Q 10: Will it handle a page that has been rotated when scanned?

A: Yes. You can click rotate buttons to fix it.

Top of Page

Logo