I have an input string that contains composite Unicode characters, like:
"Leppӓnen" == "\x004c\x0065\x0070\x0070\x04d3\x006e\x0065\x006e"
I want to convert this to use the precomposed characters, ie:
"Leppӓnen" == "\x004c\x0065\x0070\x0070\x00e4\x006e\x0065\x006e"
I have tried:
My last resort will be writing a method to manually look for the normalized versions of these characters and substitute the precomposed characters, but I was hoping there was a framework or Win32 function to do this.
If you have no idea what I'm talking about, see: http://en.wikipedia.org/wiki/Unicode_equivalence
To see the character sets I'm talking about, see: http://en.wikibooks.org/wiki/Unicode/Character_reference/0000-0FFF
Edited 1-Sep-11 3:33am
This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)