GNU Ocrad is an OCR (Optical Character Recognition) program and library based on a feature extraction method. It reads images in pbm (bitmap), pgm (greyscale), or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. It also includes a layout analyzer that is able to separate the columns or blocks of text normally found on printed pages. Ocrad can be used as a stand-alone console application, or as a backend to other programs.
| Tags | OCR |
|---|---|
| Licenses | GPLv3 |
| Operating Systems | POSIX |
| Implementation | C++ |


Release Notes: Two new filters have been added: "upper_num" and "upper_num_only". The description of "OCRAD_result_blocks" in the manual has been fixed.


Release Notes: Character recognition has been improved. (L vs Z). The filters "letters_only" and "numbers_only" now remove leading whitespace. "ocrad.texinfo" has been renamed to "ocrad.texi".


Release Notes: Character recognition has been improved (L vs Z).


Release Notes: Scaling and smoothing are now made before thresholding. Character recognition has been improved. (D-O, H-N, O-Q, V-Y, merged TT). The new library function "OCRAD_set_utf8_format" has been added. Small improvements have been made in the manual and in the man page. Quote characters in messages have been changed as advised by GNU Coding Standards.


Release Notes: Character recognition has been improved (D vs O).