Click here to Skip to main content
15,880,427 members
Articles / Multimedia / GDI+

How to deskew an image

Rate me:
Please Sign up or sign in to vote.
4.65/5 (40 votes)
25 Apr 2006Apache2 min read 237.6K   6.6K   104   39
The article describes an algorithm to calculate the skew angle of an image.

Introduction

The following article describes an algorithm in VB.NET to deskew an image.

Background

Deskewing an image can help a lot, if you want to do OCR, OMR, barcode detection, or just improve the readability of scanned images. For example, think of a camera that automatically takes photos of goods with a barcode. If the skew angle is too high, the barcode can not be detected. After deskewing, the barcode can be read.

Before deskewing:

Image 1

After deskewing:

Image 2

Using the code

The following code determines the skew angle of the image bmpIn:

VB
Dim sk As New gmseDeskew(bmpIn)
Dim skewangle As Double = sk.GetSkewAngle
Dim bmpOut As Bitmap = RotateImage(bmpIn, -skewangle)

Points of interest

The basic idea of the algorithm is:

  • Find reference lines in the image.
  • Calculate the angle of the lines.
  • Calculate the skew angle as an average of the angles.
  • Rotate the image.

The lines are detected with the Hough algorithm. Each point in the image can lie on an infinite number of lines. To find the reference lines, we let each point vote for all the lines that pass through the point. The lines with the highest number of points are our reference lines.

First, we need the parameterization of a line. A line can be parameterized as:

y = m*x+t

with slope m and offset t. We are interested in the angle and not the slope. The angle alpha of the line satisfies:

m=tan(alpha)=sin(alpha)/cos(alpha)

We get:

y=sin(alpha)/cos(alpha)*x+t

Which is equivalent to:

y*cos(alpha)-x*sin(alpha)=d

We can not search an infinite parameter space, so we have to define a discrete one. We search for all lines with:

-20<=alpha<=20

in 0.2 steps, and we round d to an integer.

The basic algorithm in pseudo code:

1. Create a two-dimensional matrix Hough and initialize the values with 0 
2. for y=0 to Height-1 
3.    for x=0 to Width-1 
4.      if Point(x,y) is black then 
5.        for alpha=-20 to 20 step 0.2 
6.          d= Trunc(y*cos(alpha)-x*sin(alpha)) 
7.          Hough(Trunc(alpha*5),d)+=1 
8.        next alpha 
9.      end if 
10.   next x 
11. next y 
12. Find the top 20 (alpha,d) pairs that have the highest count in the Hough matrix 
13. Calculate the skew angle as an average of the alphas
14. Rotate the image by – skew angle

The algorithm is computationally expensive. To save some time, the number of voting points is reduced. For each text line, you can draw many lines with different angles through the letters:

Image 3

For deskewing, only the bottom line is important.

Image 4

The points on the bottom line have a lower neighbour that is white. So, we only let points (x,y) vote that satisfy:

  • The point (x,y) is black.
  • The lower neighbour (x,y+1) is white.

References

The article was taken from GMSE Imaging.

History

  • 03-30-06: Original article.
  • 03-31-06: More explanations about the Hough algorithm.
  • 04-25-06: Added the References section.

License

This article, along with any associated source code and files, is licensed under The Apache License, Version 2.0


Written By
Germany Germany
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
GeneralA note on full page scanning Pin
smartyP11-Feb-09 4:59
smartyP11-Feb-09 4:59 
QuestionHow to run Pin
nanni_n25-Feb-08 22:11
nanni_n25-Feb-08 22:11 
GeneralPerformances Pin
lucapan11-Oct-07 5:58
lucapan11-Oct-07 5:58 
GeneralRe: Performances Pin
nanni_n25-Feb-08 22:08
nanni_n25-Feb-08 22:08 
GeneralRe: Performances Pin
defwebserver20-Apr-08 12:59
defwebserver20-Apr-08 12:59 
GeneralRe: Performances Pin
lucapan18-Oct-09 21:47
lucapan18-Oct-09 21:47 
Questiongr8 work but ... Pin
looka730-Mar-07 22:21
looka730-Mar-07 22:21 
GeneralNice work Pin
karulont28-Mar-07 3:56
karulont28-Mar-07 3:56 
I realy need something like this for my orc but the problem is that i have never learned VB and i cann't understand it properly.
So if anybody could convert this to Managed C++ or C#
i would be crateful.
GeneralRe: Nice work Pin
Vimmi2616-Aug-08 13:26
Vimmi2616-Aug-08 13:26 
Questioncircular hough transfrom Pin
shdelpiero23-Feb-07 12:36
shdelpiero23-Feb-07 12:36 
GeneralRenuka Pin
Member 369869024-Jan-07 17:45
Member 369869024-Jan-07 17:45 
QuestionDeskew Application Pin
CholekarSagar9-Nov-06 0:27
CholekarSagar9-Nov-06 0:27 
GeneralRegarding Deskew Application Pin
CholekarSagar9-Nov-06 0:26
CholekarSagar9-Nov-06 0:26 
QuestionApproach weakness ? Pin
NinjaCross25-Apr-06 5:58
NinjaCross25-Apr-06 5:58 
AnswerRe: Approach weakness ? Pin
mackenb27-Apr-06 3:48
mackenb27-Apr-06 3:48 
GeneralGreat article Pin
zcaccau4-Apr-06 1:10
zcaccau4-Apr-06 1:10 
GeneralRe: Easier method Pin
zcaccau4-Apr-06 1:08
zcaccau4-Apr-06 1:08 
AnswerRe: Easier method Pin
Graham Toal21-Mar-11 9:22
Graham Toal21-Mar-11 9:22 
GeneralRe: more explanation ? Pin
mackenb31-Mar-06 0:49
mackenb31-Mar-06 0:49 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.