Android Code Snippet B4XOCRPage - Specific OCR capture area (using MLKIT GMS Vision OCR)

Theera · May 28, 2025

Hi Magma,
I've problem how to code Main.ocrresult ,after tested your code. where is refering to Main.

Main.ocrresult=textResultLabel.Text

Magma · May 28, 2025

Theera said:
Hi Magma,
I've problem how to code Main.ocrresult ,after tested your code. where is refering to Main.

Yes this is a snippet (crop i can say)...

you must save somewhere the value of txtresultlabel.text ...i thought is as a public string to use it... in other b4xpages... may be - change it with anything u want.

Theera · May 28, 2025

Magma said:
Yes this is a snippet (crop i can say)...

you must save somewhere the value of txtresultlabel.text ...i thought is as a public string to use it... in other b4xpages... may be - change it with anything u want.

Thank you for kind of you.

Surreal · Jul 3, 2025

Hi Magma,

Good morning, I am very interested in your work, but do you have a small project file to try?

thank you very much

Magma · Jul 3, 2025

Surreal said:
Hi Magma,

Good morning, I am very interested in your work, but do you have a small project file to try?

thank you very much

Hi there ... I will try to have one for you... until tomorrow

Surreal · Jul 3, 2025

Hi, thank you so much, there is no rush, so when you can.

Have a nice day

Surreal · Jul 5, 2025

Magma said:
Hi there ... I will try to have one for you... until tomorrow

Hi, Magma,
Good morning, good news?

Thanks for your help

Magma · Jul 5, 2025

...done...

attaching here..

Surreal · Jul 5, 2025

Magma said:
...done...

attaching here..

Great, thank you, now I study.
Have a nice day.

Surreal · Jul 6, 2025

Magma said:
...done...

attaching here..

Hi Magma,
I saw your code and it is excellent, I like it a lot.
I would like to ask you a question, I use in one of my projects the clsCameraIntent
I know that it is obsolete now, but I can not modify it now, I was wondering if your AddCropOverlay procedure is applicable in some way to this class.

Thanks for your patience and Have a nice day.

Magma · Jul 6, 2025

Can you share the part of code or project and your thought?

Surreal · Jul 6, 2025

Magma said:
Can you share the part of code or project and your thought?

Unfortunately the project is substantial, but I'll try to explain what it does.
Among the various functions, with the clsCameraIntent I activate the system camera (after activating the GPS and the acquisition of the coordinates in the camera - Option Save Position) I photograph a car and its license plate thus obtaining a geolocalized photo of the car, which I save in the smartphone, subsequently I send it to a Desktop software developed in WPF that searches for the license plate and the geolocalization and shows it on the map.
Everything works perfectly.
Looking at your project I tried with the OCR to identify the license plate, your system is perfect because I frame only the portion that interests me, that is the license plate, and as you can see from your modified project it works very well.
Subsequently I use the license plate found to check in the database, with SQL and DBUtils and its functions, if it is present and registered!
Now the problem is that using the clsCameraIntent I can't understand how to crop the image as you do.

I should implement Camera2 and CamEx2 in my project.

I hope I explained myself.
Thank you.

I attach your project, with the very small changes made by me.

Magma · Jul 6, 2025

Well it is not has to do with what camera lib...(camera2 ofcourse is a must for today use)

Crop is has to do with a picture/bitmap you will grab...

Didn't see your modified project... I am out this weekend

But for plates you will need tesseract or better ...Yolo project ... a python project... so need to run somewhere a service...

Text recognizer is for simple uses... and not having all language pack... for ex. Arabic

drgottjr · Jul 8, 2025

while i myself like the idea of a pre-cropped frame, it might be of interest to forum members how android handles ocr.

android ocr engine "automatically" identifies blocks of text. the extracted text is made available to the client in several forms. in addition,
the location of the various blocks of text found within the image is also returned to the client. in other words, although the user can pre-crop
a section of the image for text extraction, the api crops all the blocks found. it is possible to display all the blocks or only those of interest.
this is not quite the same as pre-cropping a particular block, but if you pre-crop a particular block, you will only get the pre-cropped block.
if you let android do its thing, you get all the blocks. if the image has 4 or 5 blocks, and you pre-crop a block, you will have to take 4 or 5
pictures of the same page if you wanted the various blocks (or you have to code a way to move the pre-crop frame around).

anyway, and for what it's worth, here is an example of how android handles text extraction when left to its own devices:

Magma · Jul 9, 2025

Surreal said:
Unfortunately the project is substantial, but I'll try to explain what it does.
Among the various functions, with the clsCameraIntent I activate the system camera (after activating the GPS and the acquisition of the coordinates in the camera - Option Save Position) I photograph a car and its license plate thus obtaining a geolocalized photo of the car, which I save in the smartphone, subsequently I send it to a Desktop software developed in WPF that searches for the license plate and the geolocalization and shows it on the map.
Everything works perfectly.
Looking at your project I tried with the OCR to identify the license plate, your system is perfect because I frame only the portion that interests me, that is the license plate, and as you can see from your modified project it works very well.
Subsequently I use the license plate found to check in the database, with SQL and DBUtils and its functions, if it is present and registered!
Now the problem is that using the clsCameraIntent I can't understand how to crop the image as you do.

I should implement Camera2 and CamEx2 in my project.

I hope I explained myself.
Thank you.

I attach your project, with the very small changes made by me.

B4X:

Sub IsValidPlate(valore As String) As Boolean
    Dim matcher1 As Matcher
    '
    'ITALIAN LICENSE PLATE
    '
    matcher1 = Regex.Matcher("[A-Z]{2}[0-9]{3}[A-Z]{2}", valore)
    If matcher1.Find = True Then        
        Return True
    Else
        Return False
    End If
End Sub

You mean added this... yes is a way... the problem comes when a plate is old, or a public vehicle (not having the same serial) or an agricultural machine-vehicle... perhaps a rusty plate...

Those can easily passed with the help of tesseract and learning models... or an AI (using a chatgpt assistant will have cost but for sure will help)

Tesseract will need some plates captured (more than hundred)

YOLOv8: State-of-the-Art Computer Vision Model

Learn all you need to know about YOLOv8, a computer vision model that supports training models for object detection, classification, and segmentation.

yolov8.com

(a python solution as a backend server will be also helpful)

TILogistic · Jul 9, 2025

Magma said:
Well it is not has to do with what camera lib...(camera2 ofcourse is a must for today use)

Crop is has to do with a picture/bitmap you will grab...

Didn't see your modified project... I am out this weekend

But for plates you will need tesseract or better ...Yolo project ... a python project... so need to run somewhere a service...

Text recognizer is for simple uses... and not having all language pack... for ex. Arabic

This is called automatic number plate recognition (ANPR)
I developed what the member @Surreal mentions some time ago with TextRecognition based on MLKit, and there are techniques you should use with the MLKIT APIs and the Camera.

Yolo:

GitHub - jrguignan/Proyecto-Deteccion_de_Matriculas: Se usa YOLOv10 para detectar vehículos en la vía, para luego detectar sus matriculas y usar tesseract-OCR para leer las matrículas

Se usa YOLOv10 para detectar vehículos en la vía, para luego detectar sus matriculas y usar tesseract-OCR para leer las matrículas - jrguignan/Proyecto-Deteccion_de_Matriculas

github.com

TILogistic · Jul 9, 2025

Tips for Post #15
Use ML Kit's OCR to process an image and obtain a text object containing the text blocks (TextBlock).
Traverse the blocks and obtain their coordinates (boundingBox or cornerPoints).
Draw rectangles on the original image at the position of each block to highlight them.

Android Code Snippet B4XOCRPage - Specific OCR capture area (using MLKIT GMS Vision OCR)

Expert

Expert

Expert

Member

Expert

Member

Member

Expert

Attachments

Member

Member

Expert

Member

Attachments

Expert

Expert

Attachments

Expert

Expert

Expert

Similar Threads