ユニコード テスト ページ (アラブ編) Top (Japanese) Top (English) Unicode Test Page (Arabic)
目次 Contents
ISO 8859-6 準拠のアラビア文字 A Arabic letters based on ISO 8859-6
拡張アラビア文字 (コード順) E Extended Arabic letters ordered by code
拡張アラビア文字 (言語別) E Extended Arabic letters grouped by language
合字 (アルファベット順) AA Ligatures ordered by alphabet
合字 (二次元表形式) AA Ligatures displayed in 2-dim tables
母音符号 Points, Glyph part
結合符号 Combining marks
記号類 @ Punctuation, Signs, Marks and Symbol
指示符号 Subtending marks
敬意符号 Honorifics
コーラン注釈符号 Koranic annotation signs
数字 1 Digits
制御記号 Noncharacters and Special
関連サイト Related Sites
アラビア語のアルファベトは、ハムザ等を数えなければ 28 文字ですが、 ユニコード Ver.4 はこの言語用に約 1,000 個のコードを用意しています。 U Arabic alphabet consists of 28 letters except for diacritical marks. Including glyphs for contextual forms of letters, Unicode Ver.4 prepares about 1,000 codes for this language.
227 コード:アラブ プロパー 0600-0670 Arabic (ISO 8859-6)
& アラブ プロパー外アラビア文字 0671 -06FF Arabic (Extended)
data
627 コード:アラブ プロパー外の位置依存形 FB50-FDFF Arabic Presentation Forms-A
141 コード:アラブ プロパーの位置依存形 FE70-FEFF Arabic Presentation Forms-B
本ページはコード チャートを整理し、 ナスヒー書体 (上段) と ナスタリーク書体 (下段) の表示を可能にしました。 各表の列の並びは次のとおりです。 font I tried to arrange the Code Charts and compare the naskhi and the nastaliq font, typed on the upper and the lower row respectively. The columns of tables are laid out as follows.
fin←←←med←←←iniiso CodeNameRemark
語尾形語中形語頭形 独立形 代表形ユニコード 名前
[音訳]
備考
final formmedial forminitial form isolated form typical formUnicode Name
[transliteration]
Remark
ISO 8859-6 準拠のアラビア文字 Arabic letters based on ISO 8859-6
- - ا 0627 alef
[ a ]
- - ا
ب 0628 beh
[ b ]
ب
ت 062A teh
[ t ]
ت
ث 062B theh
[th]
ث
ج 062C jeem
[ j ]
ج
ح 062D hah
[ ḥ ]
ح
خ 062E khah
[kh]
خ
- - د 062F dal
[ d ]
- - د
- - ذ 0630 thal
[dh]
- - ذ
- - ر 0631 reh
[ r ]
- - ر
- - ز 0632 zain
[ z ]
- - ز
س 0633 seen
[ s ]
س
ش 0634 sheen
[sh]
ش
ص 0635 sad
[ ṣ ]
ص
ﺿ ض 0636 dad
[ ḍ ]
ﺿ ض
ط 0637 tah
[ ṭ ]
ط
ظ 0638 zah
[ ẓ ]
ظ
ع 0639 ain*
[ ` ]
ع
غ 063A ghain
[gh]
غ
ف 0641 feh
[ f ]
ف
ق 0642 qaf
[ q ]
ق
ك 0643 kaf
[ k ]
ك
ل 0644 lam
[ l ]
ل
م 0645 meem
[ m ]
م
ن 0646 noon
[ n ]
ن
ه 0647 heh
[ h ]
ه
- - و 0648 waw
[ w ]
- - و
ي 064A yeh
[ y ]
ي
* ain
voiced pharyngeal fricative (咽頭摩擦有声音)
→ 01B9 ƹ Latin small letter ezh reversed
→ 02BF ʿ modifier letter left half ring (transliteration of Arabic ain)
- - - ء 0621 hamza
[ ' ]
glottal stop [catch] (声門閉鎖音)
→ 02BE ʾ modifier letter right half ring (transliteration of Arabic hamza)
- - - ء
- - آ 0622 alef with madda above ≡ 0627 ا (alef) 0653 ٓ (maddah above)
- - آ
- - أ 0623 alef with hamza above ≡ 0627 ا (alef) 0654 ٔ (hamza above)
- - ؤ 0624 waw with hamza above ≡ 0648 و (waw) 0654 ٔ (hamza above)
- - إ 0625 alef with hamza below ≡ 0627 ا (alef) 0655 ٕ (hamza below)
ئ 0626 yeh with hamza above ≡ 064A ي (yeh) 0654 ٔ (hamza above)
ئ
- - ة 0629 teh marbuta * closed ت (teh)
teh は語末でこのように変形することがあります。ほとんどの場合、女性の名前、または形容詞ないし名詞の女性形を示すために使われます。
- - ة
- - - - ـ 0640 tatweel
= kashida
• inserted to stretch characters
• also used with Syriac
ى 0649 alef maksura *
• represents yeh-shaped letter with no dots in any positional form
長母音 aa を構成する alef は語末でこのように変形することがあります。
ى
- - - - ٮ 066E dotless beh
(archaic letter) (Not ISO 8859-6)
- - - - ٯ 066F dotless qaf
(archaic letter) (Not ISO 8859-6)
* → "learn arabic online".
* alef with fathatan は合字として Arabic Presentation Forms-A に登録されています。 (→ FD3D)
* 拡張アラビア文字としてカザフ語に high hamza という符号があり、 文字の右肩に付きます。 (→ 0674)
母音符号 Points from ISO 8859-6
and glyphs for spacing forms.
shadda with med iso Code Name Remark
mediso
- - ً 064B fathatan -
- - ٌ 064C dammatan -
- - ٍ 064D kasratan -
َ 064E fatha a 短母音
ُ 064F damma u 短母音
ِ 0650 kasra i 短母音
- - ّ 0651 shadda 子音を重ねる。
- - ﹿ ْ 0652 sukun
• marks absence of a vowel after the base consonant
• used in some Korans to mark a long vowel as ignored
06E1 ۡ Arabic small high dotless head of khah
- - - ٰ 0670 superscript alef
• actually a vowel sign, despite the name
(Not ISO 8859-6)
グリフの一部 Glyph part
FE73 tail fragment
• for compatibility with certain legacy character sets
結合符号 Combining marks
Code NameRemark
ٓ 0653 maddah above -
ٔ 0654 hamza above -
ٔ
ٕ 0655 hamza below -
ٖ 0656 subscript alef -
ٗ 0657 inverted damma -
٘ 0658 mark noon ghunna
Kashmiri and Baluchi
• indicates nasalization in Urdu
記号類 Punctuation, Signs, Marks and Symbol
، 060C comma
• also used with Thaana and Syriac in modern text
→ 002C , comma
؍ 060D date separator -
؛ 061B semicolon
• also used with Thaana and Syriac in modern text
→ 003B ; semicolon
؟ 061F question mark
• also used with Thaana and Syriac in modern text
→ 003F ? question mark
٪ 066A percent sign
→ 0025 % percent sign
٫ 066B decimal separator -
٬ 066C thousands separator
→ 0027 ' apostrophe
→ 2019 right single quotation mark
٭ 066D five pointed star
→ 002A * asterisk
۔ 06D4 full stop
Urdu
۔
FD3E ornate left parenthesis 飾り括弧 (左、閉じ). Left means closing.
﴿ FD3F ornate right parenthesis 飾り括弧 (右、起こし). Right means opening.
۽ 06FD Sindhi ampersand (Signs for Sindhi)
۾ 06FE Sindhi postposition men (Signs for Sindhi) → Sindhi and Lahnda in the 1911 Edition Encyclopedia
؎ 060E poetic verse sign (Poetic marks)
؏ 060F misra (Poetic marks)
FDFC Rial sign
(Currency sign) → Word Ligature
≈ <isolated> 0631 ر (reh) 06CC ی (Farsi yeh) 0627 ا (alef) 0644 ل(lam)
FDFD Bismillah Ar-rahman Ar-raheem (Symbol) → Word Ligature
指示符号 Subtending marks
他動詞 subtend は和訳しにくいようです。 (例) 各辺はフェルマー点から 120° をサブテンドする。
辞書には v. Geom. To be opposite to and delimit. [L subtendere, to extend beneath] とあり、 グリフを見ると subtendere の意味で使われているのかもしれません。
Code NameRemark
؀ 0600 number sign -
؁ 0601 sanah -
؂ 0602 footnote marker -
؃ 0603 safha cf. safah
敬意符号 Honorifics
Code NameRemark
ؐ 0610 sallallahou alayhe wassallam
• represents sallallahu alayhe wasallam
"may god's peace and blessings be upon him"
ؑ 0611 alayhe assallam
• represents alayhe assalam
"upon him be peace"
ؒ 0612 rahmatullah alayhe
• represents rahmatullah alayhe
"may god have mercy upon him"
ؓ 0613 radi allahou anhu
• represents radi allahu 'anhu
"may god be pleased with him"
ؔ 0614 takhallus
• sign placed over the name or nom-deplume of a poet, or in some writings used to mark all proper names
コーラン注釈符号 Koranic annotation signs
Code NameRemark
ؕ 0615 small high tah
• marks a recommended pause position in some Korans published in Iran and Pakistan
• should not be confused with the small tah sign used as a diacritic for some letters such as 0679 ٹ
ۖ 06D6 small high ligature sad with lam with alef maksura -
ۗ 06D7 small high ligature qaf with lam with alef maksura -
ۘ 06D8 small high meem initial form -
ۙ 06D9 small high lam alef -
ۚ 06DA small high jeem -
ۛ 06DB small high three dots -
ۜ 06DC small high seen -
۝ 06DD end of ayah -
۞ 06DE start of rub el hizb -
۟ 06DF small high rounded zero -
۠ 06E0 small high upright rectangular zero -
ۡ 06E1 small high dotless head of khah
= Arabic jazm
• used in some Korans (Qur'ans) to mark absence of a vowel
0652 ْ Arabic sukun
ۢ 06E2 small high meem isolated form -
ۣ 06E3 small low seen -
ۤ 06E4 small high madda -
ۥ 06E5 small waw -
ۦ 06E6 small yeh -
ۧ 06E7 small high yeh -
ۨ 06E8 small high noon -
۩ 06E9 place of sajdah -
۪ 06EA empty centre low stop -
۫ 06EB empty centre high stop -
۬ 06EC rounded high stop with filled centre -
ۭ 06ED small low meem -
数字 Arabic-Indic digits
These digits are used with Arabic proper.
数字 (東部) Eastern Arabic-Indic digits
These digits are used with Arabic-script languages of Iran, Pakistan, and India (Persian, Sindhi, Urdu, etc.). For details of variations in preferred glyphs, see the block description for the Arabic script.
٠ ٠ 0660 zero
١ ١ 0661 one
٢ ٢ 0662 two
٣ ٣ 0663 three
٤ ٤ 0664 four
٥ ٥ 0665 five
٦ ٦ 0666 six
٧ ٧ 0667 seven
٨ ٨ 0668 eight
٩ ٩ 0669 nine
۰ 06F0 -
۱ 06F1 -
۲ 06F2 -
۳ 06F3 -
۴ 06F4
• Persian has a different glyph than Sindhi and Urdu
۵ 06F5
• Persian, Sindhi, and Urdu share glyph different from Arabic
۶ 06F6
• Persian, Sindhi, and Urdu have glyphs different from Arabic
۷ 06F7
• Urdu and Sindhi have glyphs different from Arabic
۸ 06F8 -
۹ 06F9 -
制御記号 Noncharacters
These codes are intended for process internal uses, but are not permitted for interchange.
この区間の32文字は内部処理用です。 情報交換には使ってはいけません。
<not a character> FDD0 - FDEF
スペシャル Special
Code NameRemark
 FEFF Zero Width No-Break Space
= Byte Order Mark (BOM), ZWNBSP
• may be used to detect byte order by contrast with the noncharacter code point FFFE
• use as an indication of non-breaking is deprecated; see 2060 instead
→ 200B zero width space
→ 2060 word joiner
→ FFFE <not a character>
関連サイト Related Sites
N2413-4 Proposal to add Marks and Digits in Arabic Code Block (for Urdu)
Misra, Safah, Nuqtatain, Jazm, small high tah, Bismillah ligature, digits 0 to 9
First edition : 2003.7.20