| Characters of the world |
|
There are Latin*1(or Roman), Russian (or Cyrillic), Greek, Arabic, Hebrew, Amharic, Devanagari (one of Indians *2), Thai, Laotian, Tibetan, and CJK characters*3 here.
Other famous characters are Indian*2, Georgian, Armenian, Burmese (in Myanmar), Khmer (in Cambodia), and Mongolian.
| Language name | ``Hello'' in the language | Remarks | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| in English | in the language | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Amharic |
|
| *4 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Arabic | العربية |
| *5Arabic*6 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Czech |
| Dobrý den | - | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Danish | Dansk | Hej, Goddag | - | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Dutch | Nederlands | Hallo, Dag | - | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Flemish | Vlaams | Dutch spoken in Belgium | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| English | English | Hello | - | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Esperanto | Esperanto
| Saluton
| An artificial language
| Estonian
| Eesti
| Tere, Tervist
| -
| FORTRAN
| FORTRAN
| PROGRAM
| A programming language, not a natural language.
| Finnish
| Suomi
| Hei
| -
| French
| Français
| Bonjour, Salut
| -
| German
| Deutsch Nord
| Guten Tag
| Low German, Northern
| Deutsch Süd
| Grüß Gott
| High German, Southern, Standard
| Greek
|
-
| Hebrew
|
עברית
|
*5
| Hindi
|
हिन्दी
|
| ,
Devanagari*2
| Italian
| Italiano
| Ciao, Buon giorno
| -
| Lao
|
| ,
-
| Maltese
| Malti
| Ciao
| An island in the Mediterranean Sea
| Norwegian
| Norsk
| Hei, God dag
| -
| Polish
| Polski
|
Hej,
|
,Gen Dobré
-
| Russian
|
-
| Slovak
|
-
| Spanish
| Español
| ¡Hola!
| -
| Swedish
| Svenska
| Hej, Goddag
| -
| Thai
|
| ,
-
| Tibetan
|
-
| Tigrigna
|
*4
| Turkish
| Türkçe
| Merhaba
| -
| Vietnamese
|
*7
| Yugoslavian
| -
| Zdravo
| not seem to be present Serbian.
| Japanese
|
| ,
Chinese and Kana*3
| Chinese
|
| ,
,
Simplified Chinese*3
| Cantonese
|
| ,
| ,
Traditional Chinese*3
| Korean
|
| ,
Hangul*3
| | ||||||||||||||||||
| (*0) |
GNU Emacs
20.7.2 (i386-vine-linux-gnu, X toolkit) of Wed Jan 17 2001 on apollo.athome
After downloading the fonts from the ftp server of Mule Projects' site, choosing "Mule - Show Script Examples" from the menu bar of the editer, the characters will appear below the following message. This is a list of ways to say hello in various languages. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*1) |
Latin spreads over much of Europe & Africa and
the almost whole Americas & Oceania.
And there are various character sets,
because each Latin language adds or omits a few letters,
or attaches small marks
(acute /circumflex /grave /macron accent, cedilla, dieresis /umlaut,
etc.)
to some of letters.
Since character encoding is required to express them on a computer,
an international agency (iso-8859 of
ISO),
nations (
ASCII
of US,
KOI8-R
of Russia,
JIS
of Japan, etc.) or enterprises (Windows of
Microsoft
and Macintosh of
Apple)
declare their standards for some languages. Now
Unicode (ISO 10646) is developing
to cover any of the world's writing systems.
The following table classifies languages according to iso-8859.
(See also
Numerical notations of the world.)
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*2) |
There are many kinds of characters in India.
(See Indian bank note)
Sinhala spoken by more than 70% Sri Lankan is isolated Indo-European, and Sinhalese belongs to Dravidian series. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*3) |
Chinese, Japanese and Korean are called
CJK characters in a pack.
In comparison with other alphabetical languages, these have more than
102~103 times kinds of characters.
In computers a character takes 2 bytes, so that they are also called
double byte characters.
China uses only Chinese, Japan Chinese and Kana, Korea Hangul only in
principle. Chinese is ideographic, Kana phonetic, Hangul both at the
same time.
Chinese has long history, about 3500 years, are propagated to neighboring countries and its shape was changed according to the circumstances. Now China and Japan usually use Chinese, Vietnam and Korea gave up Chinese and have changed for Latin and Hangul respectively.
Chinese character in Japan is called Kanji. In general, JIS ( Japanese Industrial Standard) of Kanji is JIS X 0208, but this does not cover all Kanji, the following standards exist.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*4) | Amharic characters are used in Amharic spoken in Ethiopia and Tigrigna spoken in Eritrea etc. Similarly to Japanese Kana, a character is assigned to each pair of a consonant and a vowel, as a result Amharic has about 250 kinds of characters which is the most in the world except CJK characters. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*5) |
Arabic, accurately speaking the languages that use Arabic characters,
and Hebrew are written and read from right to left.
Mule supports almost all of input methods and coding systems of computer-processable languages. In my installation of EmacsBut figures are written and read from left to right in these languages. Arabic has own figures which are different from Arabic figures spread all over the world. (See Figures of the world) Opening arabic.el supplied from the Mule project by Emacs, following mapping is obtained.Right-to-left writing is not yet supported.So I reversed the strings and made it image on the editer. For printing the text file that the editer output, the tool that convert it to PostScript format is prepared. The strings seem to be reversed if necessary in the conversion process.
There are other unique writing systems in the worlds. Mongolian system by not Russian but Mongolian characters is vertical. Japanese system has both horizontal and vertical ones, the latter is traditional style, adopted in papers or literary books today. The order of vertical lines is left to right in Mongolian, opposite in Japanese.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*6) |
Arabic is written and read from right to left,
and the word is consists of concatenated consonant letters.
On their top or bottom, some vowels are, if necessary, attached.
The sound of a vowel is any of a, i, u and the letter looks like
an accent.
Instead of lower or upper, each consonant letter has
4 cases, top, middle, tail and independent which make possible to write
a well-balanced word with a stroke.
Mongolian has 3 cases, and each word seems to have a vertical lead. The number of Arabic characters differs among the languages that adopted Arabic.
There are two popular fonts in Arabic, the one in this page is Naskh, and the other is Nastaliq:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| (*7) | Present Vietnamese adopts Latin characters. Mainly vowel letters are extended by 3 symbols (Hacek, circumflex and apostrophe), and another 5 accents (acute, grave, tilde etc.) for 6 tones can be attached over or under them, so there are max 18 patterns for a each vowel. Vietnamese in this page includes acute accented ê, under-dotted ê and a, the dot is an accent not a scratch nor a blur. |
| Updated on 2003.7.10 | First edition : 2001.12.25 |