Skip to main content

Entering Pali Diacritics & the Niggahīta/Anusvara (ṃ)

Pali diacritic marks
Diacritics, or diacritical marks, are those curious glyphs added to a letter. The term derives from Ancient Greek.
ā ī ū ṅ  ñ ṇ ṭ ṭh ḍ ḍh ṇ ḷ ṃ ṁ ŋ
Pali is a phonetic language and has no written alphabet of its own. Ever since the 1st century, scholars have relied on their own native alphabets to write Pali ! European scholars have thus transliterated Pali into the Roman alphabet and this required its augmentation with additional characters represented by letter-pairs and diacritics. This was fine whilst Pali literature was mainly printed, but with the introduction of computers, the problem arose of how to represent these characters within a standard ASCII font.

Many differing methods have been adopted over the years meaning unfortunately that there is no standard way of representing Pali's diacritic characters via the then  limited character sets available on PCs. As a result the student will encounter a variety of legacy approaches some of which include:
  • Ignoring them altogether. This method is used by Access to Insight. For example:
panatipata veramani sikkha-padam samadiyami.
  • Another method is to represent long vowels with doubled characters ( aa ii uu); and placing punctuation marks before letters to represent the consonants (  .r .t .th .d .dh .n .m .s .l ~n "n). This is also called the Velthuis system. For example:
paa.naatipaataa verama.nii sikkhaa-pada.m samaadiyaami.
  • The development of HTML lead a few to use some sort of HTML accent: ä ï ü, à ì ù, â î û ñ etc. Example:
pâ.nâtipâtâ verama.nî sikkhâ-pada.m samâdiyâmi.
  • Others have employed capitalized letters to represent the diacritics. Though simple, it is hard to distinguish between the palatal and guttural n. Example:
pANAtipAtA veramaNI sikkhA-padaM samAdiyAmi.

However, the introduction of modern Unicode fonts has meant the problem of representing diacritics is now trivial. Unicode fonts include Tahoma, Arial MS Unicode and the latest version of Times New Roman (in MS word from 2010 onward). And these fonts have all the characters we need. But it's a pain to have to insert them as special symbols...

So how can I type them? -Direct Input

This brings us to the second problem which is how to enter these characters using a standard keyboard. Many tools/apps have created their own methods; often through menu selection.

To aid input of diacritics I've created a JavaScript tool: Pali diacritics converter tool. Simply type in Velthuis character combinations as below and they will magically change into Unicode.

Here's a table for codes comparisons:
character
UDP
Unicode number
Velthuis
HTML code
ā
a macron
\u0101
aa
ā
ñ
n tilde
\u00f1
~n
ñ
ī
i macron
\u012b
ii
ī
d dot-under
\u1e0d
.d
ḍ
n dot-over
\u1e45
“n
ṇ
l dot-under
\u1e37
.l
ḷ
t dot-under
\u1e6d
.t
ṭ
m dot-over
\u1e41
“m
ṁ
ū
u macron
\u016b
uu
ū
n dot-under
\u1e47
.n
ṇ
m dot-under
\u1e43
.m
ṃ
ŋ

\u014b

ŋ

What I really want is: Pali Keyboard

There are also dedicated keyboard tools, for instance see the Pali Keyboard, which allow direct typing of diacritic characters when using word processors, web browsers etc.. by simple key combinations. Simply follow the instructions to install and start typing away...

A quick note on looking up words - the Niggahīta 

Another source of variation among texts is the representation of the nasal niggahīta sound (also called anusvāra), which in western script it has been transliterated as η, ṁ or ṃ.

Just to add to the confusion, when occurring in the middle of a word, the ‘ṃ’ in some Pali texts can be substituted by a nasal - ṅ, ñ, ṇ, n, or just plain m; which means some texts spell words with ‘ṃ’ and some with a nasal! We will look at this issue in the  next post  Pali alphabet & Dictionaries.

Dictionaries generally use the nasal. So if you come across a Pali word which has ‘ṃ’ in the middle of it, you have to replace it with a nasal in order to find the word in the dictionary! Also the order of words in dictionaries containing the niggahīta ṃ can be difficult to navigate and so I recommend using the search function.

Again, as an aid I've created another tool Pali Dictionary lookup tool that will take Unicode and re-format to be consistent with the PED. It also produces direct links to any entry too!

As an aside, it’s also important to note that ‘ti the marker of direct speech affects the spelling of the word immediately preceding it  (due to Sandhi) in two ways: an immediately preceding vowel becomes lengthened and the niggahīta ṃ changes to a nasal before ‘ti and sometimes ‘ca. So, when looking up words, these effects must first be reversed.


See the next post on the Pali alphabet & Dictionaries.

More posts

Comments

Popular posts from this blog

What is Pali Language? A little history

In all these grammar tutorials we have never stopped to ask: What is Pali?” “What does the word mean?” “What are the origins of Pali? And this is what we will investigate in this post.... What is Pali & Who Speaks it? Well, let's get the obvious answer out of the way: Pali is an ancient Middle Indo-Aryan language native to the Indian subcontinent. It is widely studied because it is the language, in which, the scriptures of Theravada school of Buddhism - or Tipiṭaka - have been preserved and passed down. True. Today Pali is studied mainly to gain access to Theravada Buddhist scriptures, and is frequently chanted in a ritual context. But when we say a ' language ', most languages are named either after a population or a region, and we have no evidence of a region called Pali or even a population of Pali speakers... Along side this, nowhere in the Pali canon itself is there a mention of a language called Pali!

Sutta Number to PTS reference converter

Easily look up PTS references in the Sutta Piṭaka. [New expanded coverage tables. Includes alternate numbering. Search accepts space, comma or period separated numbers, case insensitive, diacritic insensitive.] Quick Jump Table   PTS Vol Dīgha Nikāya (DN) D i , D ii , D iii Majjhima Nikāya (MN)   M i , M ii , M iii Saṁyutta Nikāya (SN) S i , S ii , S iii , S iv , S v Aṅguttara Nikāya (AN)   A i , A ii , A iii , A iv , A v Go to Khuddaka Nikaya (KN) ( New! KN ref converter ) Use the quick jump table above; Note: For Vol. i of SN, there are two sets of PTS page numbers for each sutta. This is because the L. Feer editions differs from Somaratne 1999. Or type a Sutta number or name into the search boxes below to search that column of the table!

Learn Pali: Best way to start? 5 Tips to make it easy

Once people have answered the question: "Why learn Pali?" The next query is: "How do I learn Pali?" Here’s the way I suggest you begin with your study of Pali. Build foundations for language learning Start at the right level Stick with it Build vocabulary Make use of the Pali language tools 1 Build foundations for language learning One thing that you really should have before beginning to learn Pali is a basic understanding of general grammatical terms and concepts. Many of the Pali language grammar guides seem to assume you have studied Sanskrit or Latin before. If you haven’t, and you really don’t know the difference between a subject and an object, or the meanings of such terms as nouns, verbs, adjectives, pronouns, prepositions, or declension and conjugation - then perhaps you should spend some time studying English grammar. I found that even though I'm a native English sp

Pali Alphabet & using the PED Dictionary

What is the Pali Alphabet? How is it arranged? How do you use a Pali dictionary? Well, the first thing to note is that Pāli is written in many different scripts. As the buddhist canon travelled, local people used either Indian scripts or switched to their own local scripts. For more on local scripts see here . But for now we are just interested in the Romanized Pali alphabet. This runs Vowels first, then consonants. as follows: a, ā, i, ī, u, ū, e, o (ṃ*) k, kh, g, gh, (ṅ) c, ch, j, jh, ñ ṭ, ṭh, ḍ, (ḍh), (ṇ) t, th, d, dh, n p, ph, b, bh, m y, r, l, (ḷ), (ḷh), v, s, h [Note: the letters in brackets have no entries in the dictionary.] Pali is a phonetic  language so each entry above represents a single sound. Every letter always has the same pronunciation regardless of its context, so no letter has more than one pronunciation, and no sounds are represented by more than one letter. As there are more sounds in Pāli than letters in the roman alphabet, to make distinctions, special

Velthuis to Diacritics converter & Pali Dictionary Lookup Tool

Convert to Unicode Type these Velthuis codes to convert into Pali unicode. aa → ā .m → ṃ ii → ī .t → ṭ uu → ū .d → ḍ "n → ṅ .n → ṇ