Digit OCR Solution for Handwritten and Printed Digit Recognition

Frank Xue
Oct 22, 2022
3 min read

VeryUtils Digit OCR is a solution to extract handwritten & printed numbers from various kinds of documents, images and photos. For example, If you have an image that contains a set of handwritten digits, you need to extract each digit from the image, this Digit OCR software will be the right choice for you.

https://veryutils.com/digit-ocr-solution

The idea of converting written or printed text into digital text is called OCR (Optical Character Recognition). Detecting handwritten characters is a very difficult work. Today's OCR software and developed algorithms can not reach 100% accuracy, even a real person cannot always recognize what is written. In general, an OCR application works best when dealing with some limited works or domains. For example, it's possible to recognize the postcodes, license numbers, city names, credit card numbers, passport numbers, etc.

Many handwritten text detection systems are built following traditional image processing methods and work well with printed text, but if they are used for handwritten text recognition in images, it can get unexpected results and the recognition quality is poor. Our image processing engineers have developed a handwritten Text & Digit detection prototype that applies a custom algorithm that provides high level of accuracy, and eliminate most of OCR limitations.

We have a demo version of VeryUtils Digit OCR software for your evaluation,

https://veryutils.com/dl.php/DigitOCR.zip

You may evaluate VeryUtils Digit OCR software by following steps,

1. Download it to your system, unzip to a folder,

2. Run a CMD window, you can run following command line to train Digit OCR engine first, but if you want to use already trained data, you can skip this step,

digit_recognizer.exe --mode train --file "digit-template.png"

3. After you train the OCR engine, you may test the OCR engine by following command lines,

digit_recognizer.exe --mode test --file "test1.png"

digit_recognizer.exe --mode test --file "test2.png"

digit_recognizer.exe --mode test --file "test3.png"

digit_recognizer.exe --mode test --file "test4.jpg"

digit_recognizer.exe --mode test --file "test5.jpg"

4. This demo version is only for evaluation purpose, we are provide Custom Development Service based on our OCR technology.

VeryUtils Digit OCR Solution:

VeryUtils Digit Recognizer engine is using KNN classification and CNN neural network algorithms. We have optimized these algorithms so that they can better support the recognition of handwritten digits.

VeryUtils Digit OCR Solution is divided into three main stages, the first one is about the pre-processing in the input image and how to extract every digit in it, the second stage is about building the machine learning model (KNN or CNN) and train it to be ready for using, the third stage is about testing our model to get the result of our solution.

1-Pre-processing:

--Converting colour image to grayscale image.

--Read image into our software.

--Then applying global threshold to get digit in black and background in white.

--Negative the image to make it the same as data set.

--Using denoising algorithm to get ride of noise objects which are small connected pixels.

--Using edge detection algorithm to find digits in the image and label it with unique number.

--Loop over image to cut the digit from the image and appending then to list.

--For every extracted digit find the HOG descriptor with length(36) which will used for measuring the distance.

2-Training:

--Load our data set ("mnist-original").

--Using data analysis algorithm to split the data set into training parts.

--Lope over every image in the train parts to find the HOG descriptor with length.

--Use the KNN classification to find to build our model and training it with the splitted train parts.

--Save the model on the disk for future use in testing and trying.

3-Testing our model:

--Now load the model form the disk to test the model and find it's accuracy.

--Using new images to test the model.

Intelligent Video Analytics

Our technology could help you to upgrade your existing camera and monitoring systems with custom AI-based video analytics. It facilitates early threat warnings with real-time alerts, detect objects, people, or specific behaviors and extracts other valuable information to structure your video data for further analysis, all without manual monitoring or the need to install new costly equipment.

Optical Character Recognition (OCR) & Data Extraction

We apply advanced image processing and custom OCR algorithms to digitize handwritten & printed text, automate data entry, and perform robotic process automation. We employ intelligent document recognition technologies and value verification techniques to build custom Data Extraction algorithms that transform unstructured documents into machine-readable data.

We are provide Custom Development Service based on our OCR solution, you can build, test, and launch your product with our experienced and talented team. By using tried and true Agile development processes, our development team brings your product to life. No matter the technology or challenge, we can do it!

We can help you plan, design, and build your entire digital ecosystem.

Get in touch and schedule an appointment with one of our specialists via VeryUtils platform,

https://veryutils.com/contact

Digit OCR Solution for Handwritten and Printed Digit Recognition

Recent Posts

Comments