Английская Википедия:Hindi–Urdu transliteration

Материал из Онлайн справочника
Перейти к навигацииПерейти к поиску

Шаблон:Short description Hindi–Urdu (Devanagari: Шаблон:Lang, Nastaliq: Шаблон:Lang) (also known as Hindustani)[1][2] is the lingua franca of modern-day Northern India and Pakistan (together classically known as Hindustan).[3] Modern Standard Hindi is officially registered in Indian Republic as a standard written using Devanagari script, and Urdu is officially registered in Pakistan as a standard written using extended Perso-Arabic script.

Hindi–Urdu transliteration (or Hindustani transliteration) is essential for Hindustani speakers to understand each other's text, and especially important since the underlying language of both the Hindi & Urdu registers are almost the same.[4] Transliteration is theoretically possible because of the common Hindustani phonology underlying Hindi-Urdu. In present day, the Hindustani language is seen as a unifying language,[5] as initially proposed by Mahatma Gandhi to resolve the Hindi–Urdu controversy.[6] ("Hindustani" is not to be confused with followers of Hinduism, as 'Hindu' in Persian means 'Indo')

Technically, a direct one-to-one script mapping or rule-based lossless transliteration of Hindi-Urdu is not possible, majorly since Hindi is written in an abugida script and Urdu is written in an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.[7] However, there have been dictionary-based mapping attempts which have yielded very high accuracy, providing near-to-perfect transliterations.[8] For literary domains, a mere transliteration between Hindi-Urdu will not suffice as formal Hindi is more inclined towards Sanskrit vocabulary whereas formal Urdu is more inclined towards Persian and Arabic vocabulary; hence a system combining transliteration and translation would be necessary for such cases.[9]

In addition to Hindi-Urdu, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Sindhi (written in extended Perso-Arabic in Sindh of Pakistan and in Devanagari by Sindhis in partitioned India), Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).[10][11][12]

Vowels

Hindustani vowels
IPA Hindi ISO 15919 Urdu[13] Approxi. English
equivalent
Initial Final Final Medial Initial
Шаблон:IPA link[14] a Шаблон:Lang Шаблон:Lang Шаблон:Nq Шаблон:Lang about
Шаблон:IPA link ā Шаблон:Lang Шаблон:Lang far
Шаблон:IPA link ि i Шаблон:Lang Шаблон:Nq Шаблон:Lang still
Шаблон:IPA link ī Шаблон:Nq Шаблон:Lang fee
Шаблон:IPA link u Шаблон:Lang Шаблон:Nq Шаблон:Lang book
Шаблон:IPA link ū Шаблон:Nq Шаблон:Lang moon
Шаблон:IPA link ē Шаблон:Lang Шаблон:Lang Шаблон:Lang mateШаблон:Verification needed
Шаблон:IPA link ai Шаблон:Nq Шаблон:Nq Шаблон:Lang fairy
Шаблон:IPA link ō Шаблон:Lang Шаблон:Lang force
Шаблон:IPA link au Шаблон:Nq Шаблон:Lang lot (Received Pronunciation)
Шаблон:IPA link[15] h Шаблон:Lang (Aspirated sounds) cake
Шаблон:IPA link[16] Шаблон:Lang Шаблон:Lang Шаблон:Lang nasal vowel faun
(Шаблон:IPA, etc.)
jungle

Consonants

Hindustani has a rich set of consonants in its full-alphabet, since it has a mixed-vocabulary (rekhta) derived from Old Hindi (from Dehlavi), with loanwords from Parsi (from Pahlavi) and Arabic languages, all of which itself are from 3 different language-families respectively: Indo-Aryan, Iranian and Semitic.

The following table provides an approximate one-to-one mapping for Hindi-Urdu consonants,[17] especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings,[18] but rather a readable text for both the readers. Note that Hindi–Urdu transliteration schemes can be used for Punjabi language as well, for Gurmukhi (East Punjabi) to Shahmukhi (West Punjabi) conversion, since Shahmukhi is a superset of Urdu alphabets (with 2 extra consonants) and Gurmukhi font can be easily converted to Devanagari font.

Hindustani Consonants
PersoArabic Roman Devanagari Comments
ک k
کھ kh
ق q क़ The nuqta, in colloquial settings, is sometimes ignored in Hindi and written as क[19][20]
خ k͟h ख़ The nuqta, in colloquial settings, is sometimes ignored in Hindi and written as ख[19][20]
گ g
غ g͟h ग़ The nuqta, in colloquial settings, is sometimes ignored in Hindi and written as ग[19][20]
گھ gh
چ c
چھ ch
ج j
جھ jh
ز z ज़ The nuqta, in colloquial settings, is sometimes ignored in Hindi and written as ज[19][20]
ذ ज़़ (Approximated in Devanagari for one-to-one map. Actually same sound as ज़)
ض (Approximated in Devanagari for one-to-one map. Actually same sound as ज़)
ظ ॹ़ (Approximated in Devanagari for one-to-one map. Actually same sound as ज़)
ژ zh झ़ Used in direct Farsi loan-words
ٹ
ٹھ ṭh
ڈ
ڈھ ḍh
ڑ ड़ Colloquially, ṛ is often confused with ḍ and vice versa
ڑھ ṛh ढ़ Colloquially, ṛh is often confused with ḍh and vice versa
ت t
تھ th
ط त़ (Approximated in Devanagari for one-to-one map. Actually same sound as त)
د d
دھ dh
ن n
پ p
پھ ph
ف f फ़ The nuqta, in colloquial settings, is sometimes ignored in Hindi and written as फ[19][20]
ب b
بھ bh
م m
ی y
ر r
ل l
و v و is transcribed as /w/ for Arabic words and /v/ for Indo-Iranian words
w व़
ش sh
س s
ص स़ (Approximated in Devanagari for one-to-one map. Actually same sound as स)
ث स़़ (Approximated in Devanagari for one-to-one map. Actually same sound as स)
ہ h
ح ह़ (Approximated in Devanagari for one-to-one map. Actually same sound as ह)
ۃ ह॒ Used only for Arabic-derived words (approximated in Devanagari)
ھ h ھ is generally only used for aspirated consonants. Any individual usage is generally considered an error and to be taken as ہ
ع ʿ ʿ Variable consonant placeholder

Sanskrit consonants

The following consonants are mostly used in words that are directly borrowed or adapted from Sanskrit.

Perso-Arabic Roman Devanagari Remarks
ن٘
ݩ ñ ݩ was introduced to write Gojri[21]
ݨ ݨ was introduced to write Shahmukhi[21]
لؕ Rarely used in Shahmukhi
ݜ ṣh ݜ was introduced to write Shina[21]
ڔّ

Implosive consonants

These consonants are mostly found only in languages like Sindhi and Saraiki.

Perso-Arabic Roman Devanagari
ڳ
ڄ
ݙ/ڏ
ٻ ॿ

Numerals

Usage Numeral System
Urdu East-Arabic ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹
International Hindu-Arabic 0 1 2 3 4 5 6 7 8 9
Hindi Modern Devanagari

Punctuations & Symbols

Script Period Question Mark Comma Semi-colon Slash Percent End of verse
Perso-Arabic ۔ ؟ ، ؛ ؍ ٪ ۝
Modern Devanagari ? , ; / %

Sample text

The following is an excerpt from the Hindustani poem Tarānah-e-Hindi written by Muhammad Iqbal.

Perso-Arabic Devanagari Roman English translation
सारे जहाँ से अच्छा,
हिन्दुसिताँ हमारा।
हम बुलबुलें हैं इसकी,
यह गुलसिताँ हमारा॥
sāre jahā̃ se acchā,
hindusitā̃ hamārā.
ham bulbulẽ haĩ iskī,
yah gulsitā̃ hamārā..
Better than the entire world,
is our India.
We are its nightingales,
and it (is) our garden abode.

See also

Шаблон:Portal

References

Шаблон:Reflist

  1. Шаблон:Cite web
  2. Шаблон:Cite book
  3. Шаблон:Cite book
  4. Шаблон:Cite journal
  5. Шаблон:Cite web
  6. Шаблон:Cite web
  7. Шаблон:Cite journalШаблон:Dead linkШаблон:Cbignore
  8. Шаблон:Cite journal
  9. Шаблон:Cite journal
  10. Шаблон:Cite web
  11. Шаблон:Cite web
  12. Шаблон:Cite journal
  13. Diacritics in Urdu are normally not written and usually implied and interpreted based on the context of the sentence
  14. Шаблон:IPA occurs as a conditioned allophone of Шаблон:IPA near an Шаблон:IPA surrounded on both sides by schwas. Usually, the second schwa undergoes syncopation, and the resultant is just an Шаблон:IPA preceding an Шаблон:IPA. Hindi does not have a letter to represent ə as it is usually implied
  15. Hindi has individual letters for aspirated consonants whereas Urdu has a specific letter to represent an aspirated consonant
  16. No words in Hindustani can begin with a nasalised letter/diacritic. In Urdu the initial form (letter) for representing a nasalised word is: ن٘ (nūn + small nūn ghunna diacritic)
  17. Шаблон:Citation
  18. Шаблон:Cite journal
  19. 19,0 19,1 19,2 19,3 19,4 Шаблон:Cite book
  20. 20,0 20,1 20,2 20,3 20,4 Шаблон:Cite book
  21. 21,0 21,1 21,2 Шаблон:Cite web