[Chargeable] RSTessOCR - Optical Character Recognition Offline

XverhelstX

Well-Known Member
Licensed User
Longtime User
RSTessOCR - Optical Character Recognition


Information:
RSTessOCR is a port/wrapper for the tesseract-ocr library found here: https://code.google.com/p/tesseract-ocr/.

Requirements:
-
Basic4Android Full Version
- Android 2.2+
- RSTessOcr

What will you receive:
- Project
- Library files
- Documentation
- Readme file

Payment:
Payment can be made with Paypal at the following address:
xtremelyvirtualstudio@gmail.com
The price of the library is 15 Euros.
After transaction has been made, i'll send the library files to the buyer within 24 hours.


Documentation:



Sample:
You can find an example apk file here:
https://www.dropbox.com/s/ab9c8xbn3ds79a1/RSTessOcr.apk
Note the size of the library is big because of the added language file in the Assets.
When deploying your app with this library, you should download the language file(s) from an external webservice to reduce the apk size (with 20 mb)

Regards,
Tomas
 

bluedude

Well-Known Member
Licensed User
Longtime User
Tried it but not sure if it works. I get a lot of weird characters in a document. Which language does the demo apk support?
 

XverhelstX

Well-Known Member
Licensed User
Longtime User
Tried it but not sure if it works. I get a lot of weird characters in a document. Which language does the demo apk support?

Hi

The 'problem' is related in the Tesseract library itself being not able to recognize the text because the image might not be clear.
The language is in english.

The best thing to do is do some image preprocessing on the image:
http://stackoverflow.com/questions/9480013/image-processing-to-improve-tesseract-ocr-accuracy

- Binarize and de-noise the image
- Greyscale and sharpen the image
- Set whitelist and blacklist to remove unwanted charachters.
- Focus on the text before taking a picture.

You can also set the mode you wish to use:
(I got best result with OEM_CUBE_ONLY, but takes a lot longer)

  • OEM_CUBE_ONLY As Int
    Run Cube only - better accuracy, but slower
  • OEM_DEFAULT As Int
    Default OCR engine mode.
  • OEM_TESSERACT_CUBE_COMBINED As Int
    Run both and combine results - best accuracy
  • OEM_TESSERACT_ONLY As Int
    Run Tesseract only - fastest

I'll try to get an image enhancing solution

Regards,
Tomas
 

WebbyBoy

Member
Licensed User
Longtime User
Hi,

I get an error when using the demo. When trying to scan, I get an error advising the picture must be in ARGB_8888 format.
 

laviniut

Active Member
Licensed User
Longtime User
I tried a simple application with tesseract from here. It don't worked too well and i realized that it need a lot of time to get the result, so i prefer the online solution, i mean the result is get from a remote server based on tesseract which work faster.
 

laviniut

Active Member
Licensed User
Longtime User
Last edited:

TheMightySwe

Active Member
Licensed User
Longtime User
Do anyone know if there is a simpler library for OCR that only needs to read a line or a word of text?

Like on a licence plate or similar?
 

dibb3386

Member
Licensed User
Longtime User
Is this lib dead? If not does it come with support? Not tried the demo apk yet, phone just died, but questions still remain. I need an offline solution for image to text. Ive seen a few people having problems with it mainly lighting for the image taken, what about plain printed text nothing special? or is this the same issue everyone is having with lighting problems?
 

WebbyBoy

Member
Licensed User
Longtime User
My problem was converting the image to ARGB 8888, but I've not tried this lib in a while. Android phoyo dev has come along quite a bit.
 

dibb3386

Member
Licensed User
Longtime User
Cheers for your reply WebbyBoy. Ive not heard back from dev so project is on hold till i find another solution. Will check out phoyo. Thanks
 

jesuslizonsoluciones

Member
Licensed User
Longtime User
Good morning,
I’ve been trying to purchase the following but I get a message from Paypal that you can’t receive payments at this moment. Is there any other way to purchase the product?
 

lemonisdead

Well-Known Member
Licensed User
Longtime User
Thomas seems to have left
 

llorenç Galofré

Member
Licensed User
Longtime User
Good morning:
I'm trying to adapt an OFFLINE OCR application and I'm interested in this RSTESSOCR, but I sent an email to the direcciñon on the first post, without any response. How do I do to get this library and example?, thank you.
 
Cookies are required to use this site. You must accept them to continue using the site. Learn more…