B4A Library TextRecognitionAndroid - OCR

Johan Schoeman · Aug 8, 2020

A wrap for this Github project.

1. Jar and XML attached - copy them to your additional libs folder
2. B4A sample project attached
3. resource.zip - extract the folder and copy the folder and its contents to be on the same folder level as that of the /Files folder of the B4A project

Click on the active preview to bring back the Blocks, Lines, and Words (events will be raised is the B4A project)
Watch the B4A Log when you click the active preview with text inside the active preview.

Sure you will figure it out.

Note that it does not show the graphic overlay!
Take note of the B4A manifest file when starting a new project.

".....Google Play Services must be active on a device......"

Edit: Update in post #14 of this thread

B4X:

#Region  Project Attributes
    #ApplicationLabel: AndroidTextRecognition
    #VersionCode: 1
    #VersionName:
    'SupportedOrientations possible values: unspecified, landscape or portrait.
    #SupportedOrientations: unspecified
    #CanInstallToExternalStorage: False
#End Region

#AdditionalRes: ..\resource

#Region  Activity Attributes
    #FullScreen: False
    #IncludeTitle: True
#End Region

Sub Process_Globals
    'These global variables will be declared once when the application starts.
    'These variables can be accessed from all modules.
    Private xui As XUI
    Dim rp As RuntimePermissions

End Sub

Sub Globals
    'These global variables will be redeclared each time the activity is created.
    Private atr1 As AnroidTextRecognition
End Sub

Sub Activity_Create(FirstTime As Boolean)

    Activity.LoadLayout("Layout")

End Sub

Sub Activity_Resume



End Sub

Sub Activity_Pause (UserClosed As Boolean)



End Sub

Sub Button1_Click

    Dim Result As Boolean = True                                                                 
    If Not(rp.Check(rp.PERMISSION_CAMERA)) Then                                                 
        rp.CheckAndRequest(rp.PERMISSION_CAMERA)                                                 
        Wait For Activity_PermissionResult (Permission As String, Result As Boolean)             
    End If                                                                                       
    If Result Then                                                                               
        atr1.startScanning
    End If
End Sub

Sub atr1_text_scanned_blocks(Blocks As Object)

    Dim scannedBlocks As List = Blocks
    For i = 0 To scannedBlocks.Size - 1
        Log("scannedBlocks(" & i & ") = " & scannedBlocks.Get(i))
    Next
    Log(" ")

End Sub

Sub atr1_text_scanned_lines(Lines As Object)
    Dim scannedLines As List = Lines
    For i = 0 To scannedLines.Size - 1
        Log("scannedLines(" & i & ") = " & scannedLines.Get(i))
    Next
    Log(" ")

End Sub

Sub atr1_text_scanned_words(Words As Object)
    Dim scannedWords As List = Words
    For i = 0 To scannedWords.Size - 1
        Log("scannedWords(" & i & ") = " & scannedWords.Get(i))
    Next
    Log(" ")

End Sub

simone77gl · Apr 11, 2021

problem compilation:

Generazione file R. Error
c:\android\tools\..\extras\b4a_remote\androidx\savedstate\savedstate\1.1.0\unpacked-savedstate-1.1.0\res\values\values.xml:3: error: Found tag id where item is expected
c:\android\tools\..\extras\b4a_remote\androidx\lifecycle\lifecycle-viewmodel\2.3.1\unpacked-lifecycle-viewmodel-2.3.1\res\values\values.xml:3: error: Found tag id where item is expected
c:\android\tools\..\extras\b4a_remote\androidx\lifecycle\lifecycle-runtime\2.3.1\unpacked-lifecycle-runtime-2.3.1\res\values\values.xml:3: error: Found tag id where item is expected

victormedranop · May 17, 2021

Johan Schoeman Is there a way to use the front cam instead ??

Thanks.

Victor

ilan · Oct 21, 2021

Thank you very much @Johan Schoeman.
this is the first OCR lib that really worked for me (after so many tries)

i have tried the example from post#14

One question please. can we scan text from bitmap instead of the live camera?

thank you

Johan Schoeman · Oct 21, 2021

ilan said:
Thank you very much @Johan Schoeman.
this is the first OCR lib that really worked for me (after so many tries)

i have tried the example from post#14

One question please. can we scan text from bitmap instead of the live camera?

thank you

OCR - Extracting text from a bitmap using the Play Services Vision API

This request comes from here: https://www.b4x.com/android/forum/threads/ocr-offline-on-screen.84867/#post-538774 The attached project extracts text from a bitmap that you can pass to the library making use of the Android Vision API. Sample code: #Region Project Attributes...

www.b4x.com

ilan · Oct 21, 2021

Johan Schoeman said:
OCR - Extracting text from a bitmap using the Play Services Vision API

This request comes from here: https://www.b4x.com/android/forum/threads/ocr-offline-on-screen.84867/#post-538774 The attached project extracts text from a bitmap that you can pass to the library making use of the Android Vision API. Sample code: #Region Project Attributes...

www.b4x.com

thank you so much @Johan Schoeman this is exactly what i was looking for!! ???

adriano.freitas · Nov 18, 2021

Hi!

Very good job! Congratulations!

I just have one question: I would like to use it as a function for a specific part of my application. A panel that is displayed or hidden as needed. For the sake of use of device resources, I think that the ideal would be to turn on the camera when displaying the panel to do the OCR and after reading the text, turn off the camera when hiding the panel, preventing the system from having resources used unnecessarily.

How can I do this?

adriano.freitas · Nov 18, 2021

Another thing I noticed when inserting the library into my application is that it takes two "startscans" for the preview to be functional. Why?

Thanks

Johan Schoeman · Nov 19, 2021

adriano.freitas said:
Another thing I noticed when inserting the library into my application is that it takes two "startscans" for the preview to be functional. Why?

Thanks

Does the original sample code behave this way or just when you use it in your project?

adriano.freitas · Nov 19, 2021

Hi!

In this case, it happened in my application, because in it, unlike the example, I shouldn't leave the camera on all the time. Thus, when reading a text, I need the camera to be turned off (I use stopscan, followed by visible false) and the user can turn it on again on a button (I use start scan and visible=true). So, after disabling the camera once, I need to double-tap the enable button so that it actually becomes active again.

I must emphasize that I capture the text via a button as well and not by tapping the image (according to an update posted at the request of someone else).

Johan Schoeman · Nov 20, 2021

adriano.freitas said:
Hi!

In this case, it happened in my application, because in it, unlike the example, I shouldn't leave the camera on all the time. Thus, when reading a text, I need the camera to be turned off (I use stopscan, followed by visible false) and the user can turn it on again on a button (I use start scan and visible=true). So, after disabling the camera once, I need to double-tap the enable button so that it actually becomes active again.

I must emphasize that I capture the text via a button as well and not by tapping the image (according to an update posted at the request of someone else).

Probably best that you upload a sample project that illustrates what it is that you are seeing.

adriano.freitas · Nov 20, 2021

Johan Schoeman said:
Probably best that you upload a sample project that illustrates what it is that you are seeing.

Hi!

The whole project I think will be more confusing because it's a big application, with a lot of code and it wouldn't be relevant, so I'm going to copy here the parts of the code that refer to OCR, ok? My natural language is Portuguese, so some variable names and the like may be in this language.

Basically it's the code below. Just what I talked about before, it already happens:

B4X:

Private Sub OCRCamera_Click
	'
	'  Sub to start OCR by activating the camera
	'
       OCRTextoEscaneado.Text=""	
	Try
		OCRPreview.startScanning		
	Catch
		Log(LastException)
	End Try
        '
	OCRPreview.Visible=True
End Sub


Private Sub OCRCameraDesligar_Click
	'
	' Finish OCR. Turn off camera.
	'
	Try
	        OCRPreview.stopScanning		
	Catch
		Log(LastException)
	End Try
	OCRPreview.Visible=False
	OCRPainel.Visible=False
	PainelTela.Visible=False
End Sub


Private Sub OCRLer_Click
	'
	' To take a text from camera
	'
	Private Blocos   As List
	Private Texto    As String
	Private Altura   As Int
	Private Contador As Int
	'
	' Start Scan
	'
	Try
		OCRPreview.startScanning
	Catch
		Log(LastException)
	End Try
	Sleep(0)
	'
	' Blocos
	'
	Blocos.Initialize
	Blocos = OCRPreview.TextBlocks
	'
	'  Turn scan off
	'
	Try
		OCRPreview.stopScanning
	Catch
		Log("Nada")
	End Try
	'
	OCRPreview.Visible=False
	Sleep(0)
	'
	If (Blocos.IsInitialized=False) Then
		Texto=""
	Else
		'
		For Contador = 0 To Blocos.Size - 1
			Texto = Texto & Blocos.Get(Contador) & CRLF
		Next
	End If
	'
	Texto=Texto.Trim
	'
	If (Texto="") Then
		Texto=MsgOCRNaoFoiPossivelLer & CRLF
	End If
	'
	Texto=AFFuncoes.Left(Texto,Texto.Length-1)
	OCRTextoEscaneado.Text=Texto
	'
	Altura=StringUtilities.MeasureMultilineTextHeight(QRCodeTextoEscaneado,Texto)
	OCRTextoEscaneado.Height=Altura
	OCRScroll.Panel.Height=Altura
End Sub

Johan Schoeman · Nov 21, 2021

adriano.freitas said:

Hi!

The whole project I think will be more confusing because it's a big application, with a lot of code and it wouldn't be relevant, so I'm going to copy here the parts of the code that refer to OCR, ok? My natural language is Portuguese, so some variable names and the like may be in this language.

Basically it's the code below. Just what I talked about before, it already happens:

B4X:

Private Sub OCRCamera_Click
    '
    '  Sub to start OCR by activating the camera
    '
       OCRTextoEscaneado.Text=""   
    Try
        OCRPreview.startScanning       
    Catch
        Log(LastException)
    End Try
        '
    OCRPreview.Visible=True
End Sub


Private Sub OCRCameraDesligar_Click
    '
    ' Finish OCR. Turn off camera.
    '
    Try
            OCRPreview.stopScanning       
    Catch
        Log(LastException)
    End Try
    OCRPreview.Visible=False
    OCRPainel.Visible=False
    PainelTela.Visible=False
End Sub


Private Sub OCRLer_Click
    '
    ' To take a text from camera
    '
    Private Blocos   As List
    Private Texto    As String
    Private Altura   As Int
    Private Contador As Int
    '
    ' Start Scan
    '
    Try
        OCRPreview.startScanning
    Catch
        Log(LastException)
    End Try
    Sleep(0)
    '
    ' Blocos
    '
    Blocos.Initialize
    Blocos = OCRPreview.TextBlocks
    '
    '  Turn scan off
    '
    Try
        OCRPreview.stopScanning
    Catch
        Log("Nada")
    End Try
    '
    OCRPreview.Visible=False
    Sleep(0)
    '
    If (Blocos.IsInitialized=False) Then
        Texto=""
    Else
        '
        For Contador = 0 To Blocos.Size - 1
            Texto = Texto & Blocos.Get(Contador) & CRLF
        Next
    End If
    '
    Texto=Texto.Trim
    '
    If (Texto="") Then
        Texto=MsgOCRNaoFoiPossivelLer & CRLF
    End If
    '
    Texto=AFFuncoes.Left(Texto,Texto.Length-1)
    OCRTextoEscaneado.Text=Texto
    '
    Altura=StringUtilities.MeasureMultilineTextHeight(QRCodeTextoEscaneado,Texto)
    OCRTextoEscaneado.Height=Altura
    OCRScroll.Panel.Height=Altura
End Sub

Hard to tell without a sample project. Dont know from your code what panel is a parent or a child.

adriano.freitas · Nov 21, 2021

I understand, but in fact the project itself is big and would generate more confusion. What I could say is that with regard to OCR there is nothing more. And as a parent I use a simple panel, the size of the entire screen... but anyway, I'll see if I can create a small project that replicates the problem and send it.

Thank you for your attention!

Ebic · Mar 4, 2023

Hi Joh, congratulations for this beautiful library.
I wanted to make a request: is it possible to save the image that comes
scanned for a different use?
Thank you.

Johan Schoeman · Mar 6, 2023

Ebic said:
Hi Joh, congratulations for this beautiful library.
I wanted to make a request: is it possible to save the image that comes
scanned for a different use?
Thank you.

Not sure this lib is still relevant. ML Kit should be the new to go if I am not mistaken.

cl0ud14 · Mar 6, 2023

Johan Schoeman said:
Not sure this lib is still relevant. ML Kit should be the new to go if I am not mistaken.

I used your lib and it still works but what is this ML Kit you speak of? I tried looking for it in the forums and I only found language identification.

cl0ud14 · Mar 8, 2023

Also, how can I zoom in/out the camera?

santiago · Nov 6, 2025

Sir, Thanks for your code.Its works fine.
But please, how can I set these parameters :
a) autofocus
b) camera resolution capture.
If I use it in a smartphone like Galaxy capture works great. But if I use it with a PDA with printer resolution is not so good .

I found the camera API parameters but I am not able to set them.

Thanks in advance for your time and help

B4A Library TextRecognitionAndroid - OCR

Attachments

Member

Well-Known Member

Expert

Expert

Expert

Active Member

Active Member

Expert

Active Member

Expert

Active Member

Expert

Active Member

Member

Expert

New Member

New Member

Member

Similar Threads

Privacy & Transparency

Privacy & Transparency