Unicode in XML & .NET






2.17/5 (7 votes)
Jul 26, 2006
1 min read

23210
Test with Unicode operations in XML and .NET
u
Introduction
Unicode in XML and .NET
The XML spec says (2.2): ‘All XML processors must accept the UTF-8 and UTF-16 encodings of ISO 10646…’. So the Unicode characters would be accepted in XML Document. Each and every node, element, attribute can be represented in Unicode. Here is a sample to show how it looks in Tamil Unicode. (Tamil is a south Indian language)

Now not just the XML can be in Unicode but the values can also be accessed. Another interesting feature is that the you can use string names, function names, class names, namespace name in unicode too.

I just thrilled to write code in Unicode.Heard that Japanese used to write code in Japanese. As for now how far this coding in unicode would help.But cant guess what would happen in the coming days.
What is Unicode?
The Unicode Standard defines the universal character set. Its primary goal is to provide an unambiguous encoding of the content of plain text, ultimately covering all languages in the world. Currently in its fourth major version, Unicode contains a large number of characters covering most of the currently used scripts in the world. It also contains additional characters for interoperability with older character encodings, and characters with control-like functions included primarily for reasons of providing unambiguous interpretation of plain text. Unicode provides specifications for use of all of these characters.