65.9K
CodeProject is changing. Read more.
Home

Unicode in XML & .NET

starIconstarIcon
emptyStarIcon
starIcon
emptyStarIconemptyStarIcon

2.17/5 (7 votes)

Jul 26, 2006

1 min read

viewsIcon

23210

Test with Unicode operations in XML and .NET

u

Introduction

Unicode in XML and .NET

 
The XML spec says (2.2): ‘All XML processors must accept the UTF-8 and UTF-16 encodings of ISO 10646…’. So the Unicode characters would be accepted in XML Document. Each and every node, element, attribute can be represented in Unicode. Here is a sample to show how it looks in Tamil Unicode. (Tamil is a south Indian language)
 
Sample image
 
Now not just the XML can be in Unicode but the values can also be accessed. Another interesting feature is that the you can use string names, function names, class names, namespace name in unicode too.
 
Sample image
 
I just thrilled to write code in Unicode.Heard that Japanese used to write code in Japanese. As for now  how far this coding in unicode would help.But cant guess what would happen in the coming days.
 
What is Unicode?
The Unicode Standard defines the universal character set. Its primary goal is to provide an unambiguous encoding of the content of plain text, ultimately covering all languages in the world. Currently in its fourth major version, Unicode contains a large number of characters covering most of the currently used scripts in the world. It also contains additional characters for interoperability with older character encodings, and characters with control-like functions included primarily for reasons of providing unambiguous interpretation of plain text. Unicode provides specifications for use of all of these characters.