OCR online free

laviniut · Jul 11, 2013

Erel or somebody else, how can i use
http://weocr.ocrgrid.org/docs/WeOCR-API.html
http://weocr.ocrgrid.org/non-interactive.html
to make an ocr application ?
(there is a server used by weocr: http://www.ocr-extract.com, i want to use because it knows romanian)
please help me !

Erel · Jul 12, 2013

Does this server allow you to upload files with an API?

laviniut · Jul 12, 2013

Yes. server use weocr api from the link.
you can see here http://weocr.ocrgrid.org/non-interactive.html
in the api you must replace server name and ocr-extract is a weocr server.

Erel · Jul 14, 2013

This code will help you with building the file uploading request: Android Http Multipart requests

laviniut · Jul 15, 2013

Thank you.

laviniut · Aug 5, 2013

I tried to build a post request, but i failed. please help me!
my code is attached.
I think is needed more simple syntax request.
api from ocr server above say:

1. Introduction to WeOCR API

WeOCR API has been designed so that it can provide various
applications with a very simple means for accessing online OCR
services.

WeOCR API uses HTTP GET and POST methods.
No SOAP nor special protocol is used.

2. Non-interactive use of WeOCR server

WeOCR servers can be used non-interactively as well as via an
HTML page. To do so, you just call the CGI program directly
using a web tool such as cURL.

Example:
$ curl -F userfile=@Image_File_Name \
-F outputencoding="utf-8" \
-F outputformat="txt" \
http://Server_Address/cgi-bin/weocr/submit.cgi >result.txt

You probably need to use --max-time and --connect-timeout
options as well to avoid undesirable blockings of the process.

If you specify outputformat="txt", the first line of the
output data is used as the status line. The first line will be
blank upon successful recognition.

3. Locating the server

To find a WeOCR server suitable for your application, visit
http://weocr.ocrgrid.org/

(There may be some other WeOCR search engine sites.)

Every WeOCR web site has a server spec file "srvspec.xml"
in the site top directory. The CGI program can be located by
looking at the following entry in the spec file.

<ocrserver specversion="1.x">
<svinfo>
<cgi> ... </cgi>

4. Parameters

WeOCR servers accept the following parameters.
The default value is used if the parameter is not specified.

outputencoding: Output Encoding {utf-8, ...} (default: utf-8)
specifies the encoding of the output data.
For example, "utf-8" and "iso-8859-15" specify UTF-8 and
Latin9 (ISO-8859-15), respectively.
At least UTF-8 must be supported by the server.

contentlang: Contents Language {eng, deu, ...} (default: auto)
specifies the language used in the document image in case
the server supports multiple languages. The special value
"auto" allows the server to assume the language supported
by it or to detect automatically the language(s).
The language code is based on ISO 639-3, although some
derivatives may be accepted by some servers.

outputformat: Output Format {html,txt} (default: txt)
specifies the format of the output data.

If the output format is "txt", the first line of the text
data is used as the status line. A blank line represents
"no error".

and ocr server specification say:
<svinfo>

<revision>130602.01</revision>

<svengine name="OCRExtract" version="0.10"/>
<title>OCR server with Tesseract 3.02.02</title>
<url>http://www.ocr-extract.com/</url>
<cgi>http://www.ocr-extract.com/api/rest/extract</cgi>
<specxml>http://www.ocr-extract.com/srvspec.xml</specxml>
<organization>cluster:Systems CSG GmbH</organization>
<department>SaaS and Appliances</department>
<address>Gletscherstr.13, 16341 Panketal</address>
<country>GERMANY</country>
<contact>webmaster[a t]clustersystems.de</contact>


<svtype>single</svtype>


<svlevel>regular</svlevel>
</svinfo>

so, what is the URL i need to type in postrequest ?
and how can i get it to work ?

Frank · Oct 24, 2013

Hi, did you get any further with this? I would be interested in an OCR solution as well.
Thanks, Frank

Erel · Oct 25, 2013

Have you seen this example: [Example] Add OCR features to your Android application ?

mjtaryan · Nov 8, 2013

Erel, how fast is the turn around from the service? I'm wanting to create a "live" handheld ocr to speech app for visually impaired persons. Therefore, I need my app to be able to photograph the "page", send the image, get the results and convert it to speech quickly and in real time. It would seem having our own library (or a free and thoroughly tested/debugged wrapper for Tesseract) might speed up the process -- among many other reasons; hint, hint

Erel · Nov 8, 2013

You will need to test whether the speed meets your requirements.

laviniut · Nov 8, 2013

I am also working at my PhD for visually impaired persons and i used Erel'l example http://www.b4x.com/android/forum/threads/example-add-ocr-features-to-your-android-application.27080/ with success. Time is not exactly realtime but i think it can be satisfying and is more acurate and more quicker than wrapped library of Tesseract discussed here.

OCR online free

laviniut

Active Member

Erel

B4X founder

laviniut

Active Member

Erel

B4X founder

laviniut

Active Member

laviniut

Active Member

Attachments

Frank

Member

Erel

B4X founder

mjtaryan

Active Member

Erel

B4X founder

laviniut

Active Member