Skip to main content

Entering Pali Diacritics & the Niggahīta/Anusvara (ṃ)

Pali diacritic marks
Diacritics, or diacritical marks, are those curious glyphs added to a letter. The term derives from Ancient Greek.
ā ī ū ṅ  ñ ṇ ṭ ṭh ḍ ḍh ṇ ḷ ṃ ṁ ŋ
Pali is a phonetic language and has no written alphabet of its own. Ever since the 1st century, scholars have relied on their own native alphabets to write Pali ! European scholars have thus transliterated Pali into the Roman alphabet and this required its augmentation with additional characters represented by letter-pairs and diacritics. This was fine whilst Pali literature was mainly printed, but with the introduction of computers, the problem arose of how to represent these characters within a standard ASCII font.

Many differing methods have been adopted over the years meaning unfortunately that there is no standard way of representing Pali's diacritic characters via the then  limited character sets available on PCs. As a result the student will encounter a variety of legacy approaches some of which include:
  • Ignoring them altogether. This method is used by Access to Insight. For example:
panatipata veramani sikkha-padam samadiyami.
  • Another method is to represent long vowels with doubled characters ( aa ii uu); and placing punctuation marks before letters to represent the consonants (  .r .t .th .d .dh .n .m .s .l ~n "n). This is also called the Velthuis system. For example:
paa.naatipaataa verama.nii sikkhaa-pada.m samaadiyaami.
  • The development of HTML lead a few to use some sort of HTML accent: ä ï ü, à ì ù, â î û ñ etc. Example:
pâ.nâtipâtâ verama.nî sikkhâ-pada.m samâdiyâmi.
  • Others have employed capitalized letters to represent the diacritics. Though simple, it is hard to distinguish between the palatal and guttural n. Example:
pANAtipAtA veramaNI sikkhA-padaM samAdiyAmi.

However, the introduction of modern Unicode fonts has meant the problem of representing diacritics is now trivial. Unicode fonts include Tahoma, Arial MS Unicode and the latest version of Times New Roman (in MS word from 2010 onward). And these fonts have all the characters we need. But it's a pain to have to insert them as special symbols...

So how can I type them? -Direct Input

This brings us to the second problem which is how to enter these characters using a standard keyboard. Many tools/apps have created their own methods; often through menu selection.

To aid input of diacritics I've created a JavaScript tool: Pali diacritics converter tool. Simply type in Velthuis character combinations as below and they will magically change into Unicode.

Here's a table for codes comparisons:
character
UDP
Unicode number
Velthuis
HTML code
ā
a macron
\u0101
aa
ā
ñ
n tilde
\u00f1
~n
ñ
ī
i macron
\u012b
ii
ī
d dot-under
\u1e0d
.d
ḍ
n dot-over
\u1e45
“n
ṇ
l dot-under
\u1e37
.l
ḷ
t dot-under
\u1e6d
.t
ṭ
m dot-over
\u1e41
“m
ṁ
ū
u macron
\u016b
uu
ū
n dot-under
\u1e47
.n
ṇ
m dot-under
\u1e43
.m
ṃ
ŋ

\u014b

ŋ

What I really want is: Pali Keyboard

There are also dedicated keyboard tools, for instance see the Pali Keyboard, which allow direct typing of diacritic characters when using word processors, web browsers etc.. by simple key combinations. Simply follow the instructions to install and start typing away...

A quick note on looking up words - the Niggahīta 

Another source of variation among texts is the representation of the nasal niggahīta sound (also called anusvāra), which in western script it has been transliterated as η, ṁ or ṃ.

Just to add to the confusion, when occurring in the middle of a word, the ‘ṃ’ in some Pali texts can be substituted by a nasal - ṅ, ñ, ṇ, n, or just plain m; which means some texts spell words with ‘ṃ’ and some with a nasal! We will look at this issue in the  next post  Pali alphabet & Dictionaries.

Dictionaries generally use the nasal. So if you come across a Pali word which has ‘ṃ’ in the middle of it, you have to replace it with a nasal in order to find the word in the dictionary! Also the order of words in dictionaries containing the niggahīta ṃ can be difficult to navigate and so I recommend using the search function.

Again, as an aid I've created another tool Pali Dictionary lookup tool that will take Unicode and re-format to be consistent with the PED. It also produces direct links to any entry too!

As an aside, it’s also important to note that ‘ti the marker of direct speech affects the spelling of the word immediately preceding it  (due to Sandhi) in two ways: an immediately preceding vowel becomes lengthened and the niggahīta ṃ changes to a nasal before ‘ti and sometimes ‘ca. So, when looking up words, these effects must first be reversed.


See the next post on the Pali alphabet & Dictionaries.

More posts

Comments

Popular posts from this blog

Sutta Number to PTS reference converter

Easily look up PTS references in the Sutta Piṭaka. [New expanded coverage tables. Includes alternate numbering. Search accepts space, comma or period separated numbers, case insensitive, diacritic insensitive.] Quick Jump Table   PTS Vol Dīgha Nikāya (DN) D i , D ii , D iii Majjhima Nikāya (MN)   M i , M ii , M iii Saṁyutta Nikāya (SN) S i , S ii , S iii , S iv , S v Aṅguttara Nikāya (AN)   A i , A ii , A iii , A iv , A v Go to Khuddaka Nikaya (KN) ( New! KN ref converter ) Use the quick jump table above; Note: For Vol. i of SN, there are two sets of PTS page numbers for each sutta. This is because the L. Feer editions differs from Somaratne 1999. Or type a Sutta number or name into the search boxes below to search that column of the table!

What is Pali Language? A little history

In all these grammar tutorials we have never stopped to ask: What is Pali?” “What does the word mean?” “What are the origins of Pali? And this is what we will investigate in this post.... What is Pali & Who Speaks it? Well, let's get the obvious answer out of the way: Pali is an ancient Middle Indo-Aryan language native to the Indian subcontinent. It is widely studied because it is the language, in which, the scriptures of Theravada school of Buddhism - or Tipiṭaka - have been preserved and passed down. True. Today Pali is studied mainly to gain access to Theravada Buddhist scriptures, and is frequently chanted in a ritual context. But when we say a ' language ', most languages are named either after a population or a region, and we have no evidence of a region called Pali or even a population of Pali speakers... Along side this, nowhere in the Pali canon itself is there a mention of a language called Pali!

Karaniya Metta Sutta Snp 1.8 | translation guide

Continuing my look at the Sutta Nipata, - which contains some of the oldest portion of the canon. This translation guide analyses the Mettā Sutta (Snp 1.8) which, is perhaps the most popular discourse for reciting. It is a poem on the virtues of mettā, loving-kindness. It is a brief set of ten verse which lays out the ethical foundations for good will, and shows how to develop it in meditation. The word “Metta” is derived from the Pali word “Mitta” meaning “friend”. In the English, it has been translated as kindness, loving kindness, love and goodwill. Verse composition can throw up challenges to the amateur translator as it often employs alternate forms of words according to the needs of the metre. As such, we may find some unusual archaic endings in verse, or instances where words have been altered (vowel lengths, dropping of a niggahãta etc.) in order to meet the metre. And several examples of these can be found in the verses below.

PTS reference converter: Khuddaka Nikaya

Easily look up PTS references in the Khuddaka Nikaya. (Or look up PTS references in the first four Nikayas ) The Khuddaka Nikaya is the fifth Nikaya known as Collection of Minor Texts , it consists of 15 to 18 books depending on tradition. I beleive, the original Pali Canon has 15 books while the Thai edition has 17 and the Burmese edition 18 books. These then have been arranged and numbered in a couple of different ways.

Attadanda Sutta Snp 4.15 | Translation guide

For those wishing a bit more of a challenge the Atthaka-vagga, of the Sutta Nipata, according to scholars, may well be the oldest portion of the entire canon. It is composed mostly in verse, and includes some lovely imagery. In this guide I've chosen the Attadaṇḍā Sutta (Snp 4.15) which, according to tradition, was taught by the Buddha when he went to settle a quarrel. The title of the sutta, translated here as "arming oneself," conveys the image of a person taking up a stick, (daṇḍa) - the stick being a common symbol in Indian literature for both violence and punishment. This then sets out the basic problem of confrontation  - people hurting each other. The sutta continues by outlining the cause  - the misplaced wish for security driven by desire - and finally, its solution - letting go of the ego.