General
"Optical character recognition, abbreviation is OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website. OCR makes it possible to edit the text, search for a word or phrase, store it more compactly, display or print a copy free of scanning artifacts, and apply techniques such as machine translation, text-to-speech and text mining to it. OCR is a field of research in pattern recognition, artificial intelligence, and computer vision."
Quote taken from www.wikipedia.org, definition of term "OCR"
License conditions: http://creativecommons.org/licenses/by-sa/3.0/
The initial point is an image file (raster graphic) that will be generated by Print2CAD from the file being converted.
This image file has the name "file name_KazOcr.tif". It is automatically generated in the directory of the drawing being converted.
When using the OCR method with text separated in lines, a clear line weight is needed.
This OCR line weight can be specified in the OCR text recognition interface.
It is usually set to 0.4mm. If the texts are displayed blurry in the OCR viewer, set a lower value.

Another helpful tool of the OCR text recognition, is the separation of the texts.
A lot of PDFs feature texts as hatch. If this is the case, the separation can help to improve the text recognition by automatically discarding of distracting lines, etc.


