This is U+0647, '
ه', Arabic Letter "Heh".
This is the trick of Arabic writing system fully implemented in Windows and, to best of my knowledge, in nearly all modern OS: if you put some Arabic letters together, they change their glyph to form proper connections with each other. Look: "
ههه". This is the same Arabic "Heh" repeated three times, even though this string looks like three different letters.
Now, you may ask: how to find it out? After all, you are not going ask me about every single character, are you?
To do this, you should understand how Unicode and UTFs work. Unicode is nothing like 16-bit encoding like may mistakenly think, rather, this is a standard defining one-to-one correspondence between "characters", understood as cultural entities abstracted from their glyph forms, and integer numbers, understood as abstract mathematical integer numbers, without any concern about their bit size or computer presentation. Those numbers are called "code points". So, the core Unicode does not define encoding. Encodings can be different and are defined by UTFs. Now, this page is in UTF-8, which is a byte code with variable size per character. So, if I simply tried to read your UTF-8 text in binary form, it would be hard to recognize the code points in this code. The only straightforward encoding with one-to-one correspondence between the encoding words and code points is UTF-64 (UTF-64LE or UTF-64RE). However, I knew for sure, that all Arabic subset lies in the BMP (Based Multilingual Plane) with code points within first 17 bits. Few more 16-bit places exists, but Arabic language is way too popular for that; the extra plains are reserved form much more exotic writing systems.
So, I copies your text in the text file and saved it as UTF-16LE (in Window jargon, it is called "Unicode files", but in fact this is UTF-16LE), opened it in the binary editors and recognized 4 identical Arabic code points U+0647. To find out what is it, I used the Windows application Character Map (Charmap.EXE) bundled with every version of Windows. It provided me with the information on the subset ("Unicode sub-range") of Arabic writing system (using code points U+06XX) and the information on this character.
Learn about it:
http://en.wikipedia.org/wiki/Unicode[
^],
http://unicode.org/[
^];
http://en.wikipedia.org/wiki/Code_point[
^],
http://en.wikipedia.org/wiki/Basic_Multilingual_Plane#Basic_Multilingual_Plane[
^],
http://en.wikipedia.org/wiki/UTF[
^],
http://unicode.org/faq/utf_bom.html[
^].
—SA