One of the oldest and best OCR engines was the Tesseract OCR engine. It was the most accurate and was used for Accuracy tests. The source code reads binary grey and colour images as well as text. A reader is inbuilt so as to read TIFF images that are uncompressed and libtiff can be added if you need to read compressed images.
Another application specially made for the Android development platform is the Mezzofanti and it allows augmentation of various forms of text and in various viewing options. All the augmentation of the text can be used in plenty of ways:
You can translate it in any of the forty languages available
You can search it on Wiki as well as Google
You can even look it up in a dictionary.
The name is Mezzofanti in honour of the Italian Giuseppe Caspar Mezzofanti who was an excellent linguist as well as a hyperpolygot. He didn't speak just a few languages but a whole bunch of them - thirty eight to be precise and a total of forty dialects. What better name could be found for this application which can translate into forty languages as well?
The Technology behind this:
The OCR engine is nothing else but a whole new version of Tesseract 2.03 which has been developed by Google for Android development as it holds the licence under Apache. It is currently available in 5 different languages namely - English, German, Spanish, Italian and French. There are no worries as the OCR works with all other languages as well, but the only condition is that they should be written with the Latin script.
The core of the OCR is programmed in Java
The translation engine gives you almost perfect translations even along with grammar as it works with Google translate
The interface amongst the OCR engine which works on C++ and Java is done within JNI
Licences:
All the code is available to android developers the world over for free under the Apache Licence version 2.0 and is posted at Google code whether they may be considered with offshore android development or outsource android development.
Another application specially made for the Android development platform is the Mezzofanti and it allows augmentation of various forms of text and in various viewing options. All the augmentation of the text can be used in plenty of ways:
You can translate it in any of the forty languages available
You can search it on Wiki as well as Google
You can even look it up in a dictionary.
The name is Mezzofanti in honour of the Italian Giuseppe Caspar Mezzofanti who was an excellent linguist as well as a hyperpolygot. He didn't speak just a few languages but a whole bunch of them - thirty eight to be precise and a total of forty dialects. What better name could be found for this application which can translate into forty languages as well?
The Technology behind this:
The OCR engine is nothing else but a whole new version of Tesseract 2.03 which has been developed by Google for Android development as it holds the licence under Apache. It is currently available in 5 different languages namely - English, German, Spanish, Italian and French. There are no worries as the OCR works with all other languages as well, but the only condition is that they should be written with the Latin script.
The core of the OCR is programmed in Java
The translation engine gives you almost perfect translations even along with grammar as it works with Google translate
The interface amongst the OCR engine which works on C++ and Java is done within JNI
Licences:
All the code is available to android developers the world over for free under the Apache Licence version 2.0 and is posted at Google code whether they may be considered with offshore android development or outsource android development.

MD @ Mobi People INC. Working For Clients for Various types of mobile application / software development. Working from last 10 years in web based software & Moile based application development industry.
0 comments:
Post a Comment