How do I read the seven segment font(OCR)?

Question

1.00/5 (1 vote)

See more:

Hello community,

I have been working on extracting text from images, specifically focusing on seven-segment fonts,
using .NET. Unfortunately, my attempts with popular libraries like Tesseract and IronOcr have been
unsuccessful, as they seem to excel with normal English fonts.

Here's a brief overview of my approach so far:
1. Tesseract: Limited to normal English fonts, unable to recognize seven-segment characters.
2. IronOcr: Similar limitations, not suitable for seven-segment fonts.

Despite these efforts, I'm facing challenges in accurately extracting text from images with
seven-segment fonts.

Link to : Image Dataset Folder

What I have tried:

Additionally, I've experimented with image processing techniques, including:
• Cropping and zooming to the text region.
• Applying gray, black and white, and binarization filters.

Posted 15-Dec-23 2:44am

Punit Kanet

Updated 15-Dec-23 5:52am

v2

Add a Solution

Comments

[no name] 17-Dec-23 14:31pm

Machine learning / pattern recognition. You feed it the font with the "answers" (a training set; a "subset" of the sample set). Then run tests with the whole sample set.
https://mdfarragher.medium.com/optical-character-recognition-with-c-ml-net-and-net-core-3cf71864b815

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)