I am studying Farsi. Its now been couple of months that I have been attending classes. First of all, its not easy for an empirical positivist like me to understand the rather free flowing mechanism of learning languages. Grammar rules, punctuation, vocabulary are all foreign bodies to me. But interestingly enough, I read this article:
“Urdu is a mixture of Persian, Arabic and Turkish words formed with the intermingling of invading Muslim armies and local Hindi-speaking Hindus. It’s a Turkish word which means Army camp, hoard, etc.”
Almost everyone who knows something about the Urdu language knows this statement, which is logically incomprehensible, historically incorrect and linguistically misleading. Irrespective of the fact as to who made this statement, why it was made and when it was initiated for the first time, one thing can be said with utmost certainty that it has profound socio-political and socio-linguistic impact in the Indian Subcontinent for the last 150 years, i.e., with the advent of British Raj in this region.
One can draw a few inferences from this falsehood which has shaped our perception, consciously and sub-consciously that Urdu is not a native language of the Indian Subcontinent rather it’s a language of foreign invaders. Consequently it must be disowned if not hated.
Historically it is incorrect because the Muslim rulers did not introduce any new language. Instead they gave a new script (Persio-Arabic or Nastaliq), which was comprehendible to the spoken language of India. They even invented and introduced new signs or letters for the new sounds which are utterly local to the existing Persio-Arabic script, i.e., all the aspirated sounds of Bha, Pha, Tha, Gha, Dha, Rha, Lha and retroflexed sounds like Rah, Taa, Daa, etc. Hence all the tens of thousands of words spoken in Urdu containing these sounds have their origin in the early Vedic or middle Vedic era, i.e., 400 to 600BC.
In addition to that all the infinitives (Masader) ending on Na sound like Aana, Jaana, Khana, Peena, Uthna, Baithna, Perhna, Likhna, Sona, Jagna, Chalna, Bhagna, Larna, Dhukna, Boolna, Sunna, Kehna, etc. All are words of this language being spoken as the vernacular of early Vedic and middle Vedic period.
The process of loaning words from other languages is a sign of a living and progressing language. Urdu is one such language.
All the speakers of Urdu neither became Muslim by including Persian or Arabic words nor are they now converted to Christianity by including English words in its words corpus. On the contrary, perhaps, the Indian ruling elite believe that they will become Muslims if they use Arabic and Persian words, so they are making conscious efforts to replace most of these words with Sanskrit words. This policy may have socio-political advantages albeit not without socio-political repercussions.
As a student of linguistics only one comment can be made on this current policy of India that such a policy leads into secluding more people and ethnic groups rather than integrating them.
Lets analyse a few historical facts. The Turks started converting to Islam in 920AD with the invasion of Arabs. The Arabs put their foot on the soil of Sindh in 711AD but they were in constant contact with the Indian Subcontinent for centuries prior to Islam. The Ottoman Turk Empire was established in 1299. Lahore was under the rule of Mahmud Ghaznavi in 1254AD.
However, the interaction of Persian speaking people with India via trade goes back to the before Christ (BC) era. In short, the cultural association of Persian and Arabic-speaking people with the Indian people predates Islam.
Arabic, Persian and Turkish vocabularies were brought to India by traders, invaders and preachers. Turkish is the only language which was restricted to invaders or rulers whereas Arabic was introduced by traders as well as early invaders which remained restricted to present day Sindh and southern Punjab. Persian, the most influential of the three, remained the language of invaders, traders and preachers over the centuries. Interestingly, the language of preachers and Sufis and early Muslim poets in India always remained Persian … neither Turkish nor Arabic.
Coming to the misstatement mentioned in the beginning of this article, let’s analyse the empirical data as to how many words of Turkish are borrowed by Urdu? Based on this data the inference will be made on whether Turkish has any part in the making of Urdu. Leaving aside the syntax and grammar of the two, which are completely different, many people believe the theory of Urdu’s derivation from Turkish, and a few have attempted to prove it too.
One such attempt was made by Mr Purdil Khattak who wrote Urdu aur Turki Kay Mushtarik Alfaz published by Muqtedarah Qaumi Zuban (1987), Islamabad. He did make a great effort and was able to enlist only 2,608 words, which are commonly spoken by a Turkish speaker and an Urdu speaker. If we take this statement as it is even then in a language which has over 3,00,000 words with the base of more than 80,000 Lexemes (as contained in 21 volumes of Urdu Lughat of Urdu Lughat Board Karachi, a meticulous work completed in 25 years), 2,608 words means 0.8 per cent of the total words, which itself means nothing to that claim that Turkish contributed in the formation of Urdu.
The most interesting part of this research is that the list of 2,608 words common in Turkish and Urdu only contains 24 words which are pure Turkish. The rest are either Arabic, Persian or English words used commonly by Turks and Urdu speakers.
This list contains 1,546 pure Arabic words most of them are Quranic words such as ayat (Quranic verse), bait (house), azeem (great), barq (thunder), jahil (illiterate), jannat (heaven), jamal (beauty), jaib (pocket), jehad (holy war), dakhil (interior), jurm (crime), dalil (proof), deen (religion), ambiya (prophets), ahim (important), fatwa(religious decree), atraf (sides), fashi (eloquent), ghafil (indolent), fikr (thought), khaber (news), hakim (ruler), haal (present), khalis (pure), khas (special), harb (war), hilal (crescent), khilaf (opposite), hudood (limits), and so on.
In the same list, 485 words are pure Persian, borrowed from Turkish such as aab-o-hawa (weather), ambaar (heap), asoodah (well off), ashiyana (home/nest), arzoo (desire), arasta (decorated), badan (body), bahaar (spring season), bohran (crisis), buland (high), badter (worst), Beyzar (dejected), kahkasan (galaxy), kiswar (country), kutubkhana (library), madad (help), marasim (relations), masroor (ecstatic), mard (man), maakhana (pub), medaan (ground), murdar (dead), etc.
But there are so many words that I grew up with that I find common. Angur (grapes), rooz (day), hafte (week), maa (month), saal (year), jangal (forest), manzil (house), raah (path / road), yek baar (once), do bar (twice), sefed (white), buzugh (big/elder), nazik (thin), tazeh (fresh), khaley (empty), khoub (good), nerem (soft), ananas (pineapple), peyaz (onions), lobeya (beans), bird (perendeh), sheyer (lion), lebas (clothes) and so on and so forth.
While I am struggling with the verbs, the compound verbs, the issue that the sentence structure is Subject Object Verb, but the words are quite common, i am surprised at this level of similarities between Hindi and Urdu with Persian. But I guess I shouldnt be surprised. Fascinating. I am also heavily struggling with writing left to right and the damn diacritical marks. The punctuation/ diacritical marks are so important, you have to be precise and remember what the hell it is. Not just that, you could have the same letter but it could be a vowel or it could be a consonant. And one letter can have three pronunciations. Just kill me. Why the hell did I have to pick this up? I should have picked up something simpler, like mine clearing..