[Athen] Announcing the OCRopus Open Source OCR

Berkowitz, Daniel J djbrky at bu.edu
Wed Apr 11 06:43:12 PDT 2007


>From the good people of /. [Slashdot <http://slashdot.org/> ]

Google has just announced
-source-ocr.html> work on OCRopus <http://code.google.com/p/ocropus/> ,
which it says it hopes will "advance the state of the art in optical
character recognition and related technologies". The goal of the project
is to advance the state of the art in optical character recognition and
related technologies, and to deliver a high quality OCR system suitable
for document conversions, electronic libraries, vision impaired users,
historical document analysis, and general desktop use."

Google is "hoping for contributions by the open source community in
areas such as adapting the system to additional languages, creating a
Gnome desktop application, integration with Gnome desktop search,
web-based tools for proofing and training, language modeling, additional
character recognition engines, and other useful tools and add-ons."

Late last year Google announced their intention to release (re-release?)
Tesseract OCR
r.html> , an obscure open-source program apparently familiar to only the
most ardent Linux developer.

Posted By D. Berkowitz to Access Technologists Higher Education Network
.html> at 4/11/2007 09:13:00 AM

More information about the athen-list mailing list