Click here to Skip to main content
15,303,965 members
Articles / Programming Languages / C#


22 bookmarked

Stripping Accents from Latin Characters: A Foray into Unicode Normalization

Rate me:
Please Sign up or sign in to vote.
4.63/5 (19 votes)
20 Mar 2006Apache9 min read
How to turn accented characters into ASCII for search purposes.




This article, along with any associated source code and files, is licensed under The Apache License, Version 2.0


About the Author

Evan Stein
United Kingdom United Kingdom
I'm a London-based software developer. Originally from New York, I came here in 1997 to run European application development for Standard & Poors. I now work independently ... and I'm still here!

Having seen how US software behaves outside the US, I'm keenly interested in problems of global and multilingual software design. I also used to write intelligence-gathering software, and still can't resist a well-turned algorithm! Before my IT career I was in music, and I'm now combining both interests in a highly-exciting 'Project-X'. I could tell you what it is, but ....

When not thinking about all of the above, I'm fascinated by all aspects of different cultures. (You can't take New York out of the New Yorker.) Interests include jazz, classical and world music, languages, history and ethnic food. I'm also an amateur travel writer and photographer, and run a site at, which you're welcome to stop by and visit!

Comments and Discussions