Romanization of Armenian - AbsoluteAstronomy.com

Hübschmann-Meillet

In linguistic literature on Classical Armenian, the commonly used transliteration is that of Hübschmann-Meillet (1913).

It uses a dot above mark to express the aspirates, t῾, ch῾, č῾, p῾, k῾. However, the correct support of this diacritic has been poor for long in the past and was not very common on many usual applications and computer fonts or rendering systems. Some documents were published using the ASCII backquote ` U+0060 as a fallback (or even the ASCII apostrophe ' U+0027 when there was no confusion possible), but the preferred character today is the left half-ring modifier letter (see below).

Also, some ambiguities were not solved to work with modern vernacular Armenian, which has two dialects, both using two possible orthographies (besides, the modern orthography is used for Classical Armenian in modern publications).

BGN/PCGN (1981)

BGN/PCGN romanization

BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use .The systems have been approved by the BGN and the PCGN for...

(1981) uses a right single quotation mark

Quotation mark

Quotation marks or inverted commas are punctuation marks at the beginning and end of a quotation, direct speech, literal title or name. Quotation marks can also be used to indicate a different meaning of a word or phrase than the one typically associated with it and are often used to express irony...

to express aspirates, t’, ch’, ts’, p’, k’, diverging from the original spiritus asper

Spiritus asper

In the polytonic orthography of Ancient Greek, the rough breathing , is a diacritical mark used to indicate the presence of an sound before a vowel, diphthong, or rho. It remained in the polytonic orthography even after the Hellenistic period, when the sound disappeared from the Greek language...

motivation.

This romanization was taken up by ISO (1996), and considered obsolete. This system is a loose transcription and is not reversible (without using dictionnary lookup), notably for single Armenian letters romanized into digraphs (these non reversible, or ambiguous romanizations are shown in a red cell in the table below).

Some Armenian letters have several romanizations, depending on their context:

the Armenian vowel letter Ե/ե should be romanized as ye initially or after the vowel characters Ե/ե, Է/է, Ը/ը, Ի/ի, Ո/ո, ՈՒ/ու and Օ/օ; in all other cases it should be romanized as e;
the Armenian vowel letter Ո/ո should be romanized as vo initially, except in the word եո where it should be romanized as ov; in all other cases it should be romanized as o;
the Armenian consonant letter Վ/վ should be romanized yev initially, in isolation or after the vowel characters Ե/ե, Է/է, Ը/ը, Ի/ի, Ո/ո, ՈՒ/ու and Օ/օ; in all other cases it should be romanized as ev.

ISO 9985 (1996)

ISO 9985 (1996) is the international standard for transliteration of the modern Armenian alphabet. Like with the BGN/PCGN romanization, the right single quotation mark is used to denote most of the aspirates.

This system is reversible because it avoids the use of digraphs and returns to the Hübschmann-Meillet (however some diacritics for vowels are also modified).

The aspirate series is not given fully consistent treatment in ISO 9985; while p’, t’, c’, k’ are romanised with a quotation mark, չ č is not, and instead its unaspirated counterpart ճ is notated with an underdot appearing nowhere else in the system. Note that in this scheme, č (signifying չ) collides with the Hübschmann-Meillet transliteration (where it signifies ճ).

This system is recommended for international bibliographic text interchange (it is also the base of simplified romanizations found to localize the Armenian toponomy of for transliterating human names), where it works very well with the common ISO 8859-2 Latin encoding used in Central Europe.

ALA-LC (1997)

ALA-LC (1997) is largely compatible with BGN/PCGN, but returns to expressing aspirates with a left single quotation mark

Quotation mark

(in fact the modifier letter left half-ring ʿ U+02BF, US-MARC hexadecimal code B0, that is also used to denote ayin

Ayin

' or ' is the sixteenth letter in many Semitic abjads, including Phoenician, Aramaic, Hebrew and Arabic . It is the twenty-first letter in the new Persian alphabet...

in Arabic, so some documents may contain either the preferred left half-ring, or sometimes the ASCII backquote ` U+0060).

This standard changes the transliteration scheme used between Classical/Eastern Armenian and Western Armenian for the Armenian consonnants represented by swapping the pairs b vs. p, g vs. k, d vs. t, dz vs. ts and ch vs. j.

In all cases, and to make this romanizatrion less ambiguous and reversible,

a soft sign (a prime, US-MARC hexadecimal code A7) is inserted between two separate letters that would otherwise be interpreted as a digraph (in red in the table below); no prime is present in the middle of romanized digraphs zh, kh, ts, dz, gh and ch representing a single Armenian letter;
with the Classical Armenian orthography only, the vowel represented by e will be represented by y instead, when it is at the initial position in a name and followed by another vowel; this difficulty has disappeared in modern Armenian with the reformed orthography that changed the original Armenian letter in such case;
with the Classical Armenian orthography only, the vowel represented by y will be represented by h instead, when it is at the initial position of a word or of a radical in a compound word; this difficulty has disappeared in modern Armenian with the reformed orthography that changed the original Armenian letter in such case.

ASCII-only input methods

On various Armenian websites, non-standard transliterators have appeared to allow inputting modern Western or Eastern Armenian text using ASCII only characters. It is not a proper transliterator but can be convenient for users that don't have Armenian keyboards.

Despite these input methods are commonly used, they are not obeying to any approved international or Armenian standard, so they are not recommended for the romanization of Armenian. Note that the input methods recognize the Latin digraphs zh, dz, gh, tw, sh, vo, ch, rr for Classic or Eastern Armenian, and zh, dz, tz, gh, vo, ch, rr for Western Armenian, but offer no way to disambiguate words where the digraphs should not be recognized.

Some Armenian letters are entered as Latin digraphs, and may also be followed by the input of an ASCII single quote (which acts as the only letter modifier recognized) but this quote does not always mean that the intended Armenian letter should be aspirated (this may be the reverse for the input ch'), it is also used as a vowel modifier. Due to ambiguities, texts must be corrected by entering an intermediate dummy character before entering the second Latin letter or quote, then removing the dummy character, so that the automatic input converter keeps the Armenian letters distinct.

Transliteration table

Some Armenian letters have very different phonetic sounds between Classical or Eastern Armenian and Western Armenian, so that the usage of Armenian letters is different between the two sub-branches of the language.

This is made visible in the table below by coloring transliterations specific to Classical or Eastern Armenian on green background, and those for Western Armenian on blue background. Other letters are transliterated independently of the language branch. However, cells with red background contain transliterations that are context dependent.

Armenian script	capital	Ա	Բ	Գ	Դ	Ե	Զ	Է	Ը	Թ	Ժ	Ի	Լ	Խ	Ծ	Կ	Հ	Ձ	Ղ	Ճ	Մ
	capital	0531	0532	0533	0534	0535	0536	0537	0538	0539	053A	053B	053C	053D	053E	053F	0540	0541	0542	0543	0544
	small	ա	բ	գ	դ	ե	զ	է	ը	թ	ժ	ի	լ	խ	ծ	կ	հ	ձ	ղ	ճ	մ
	small	0561	0562	0563	0564	0565	0566	0567	0568	0569	056A	057B	056C	056D	056E	057F	0570	0571	0572	0573	0574
Romanization of Classical or Eastern Armenian	ASCII ASCII The American Standard Code for Information Interchange is a character-encoding scheme based on the ordering of the English alphabet. ASCII codes represent text in computers, communications equipment, and other devices that use text... input	a	b	g	d	e	z	e'	y'	t'	zh	i	l	x	c'	k	h	dz	gh	tw	m
	Hübschmann-Meillet							ê	ə	t῾	ž				c			j	ł	č
	ISO 9985							ē	ë	t’	ž				ç			j	ġ	č̣
	BGN/PCGN BGN/PCGN romanization BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use .The systems have been approved by the BGN and the PCGN for...					e, ye		e	y	t’	zh			kh	ts			dz	gh	ch
	ALA-LC					e, y		ē	ě	tʿ					ts			dz		ch
Romanization of Western Armenian	ALA-LC		p	k	t	e, y		ē	ě	tʿ					dz	g		ts		j
Romanization of Western Armenian	ASCII ASCII The American Standard Code for Information Interchange is a character-encoding scheme based on the ordering of the English alphabet. ASCII codes represent text in computers, communications equipment, and other devices that use text... input		p	k	t	e		e'	y	t'				x	dz	g		tz		j

Armenian script	capital	Յ	Ն	Շ	Ո	Չ	Պ	Ջ	Ռ	Ս	Վ	Տ	Ր	Ց	Ւ	Փ	Ք	Օ	Ֆ	ՈՒ
	capital	0545	0546	0547	0548	0549	054A	054B	054C	054D	054E	054F	0550	0551	0552	0553	0554	0555	0556	0548 0552
	small	յ	ն	շ	ո	չ	պ	ջ	ռ	ս	վ	տ	ր	ց	ւ	փ	ք	օ	ֆ	ու	եւ
	small	0575	0576	0577	0578	0579	057A	057B	057C	057D	057E	057F	0580	0581	0582	0583	0584	0585	0586	0578 0582	0587
Romanization of Classical or Eastern Armenian	ASCII ASCII The American Standard Code for Information Interchange is a character-encoding scheme based on the ordering of the English alphabet. ASCII codes represent text in computers, communications equipment, and other devices that use text... input	y	n	sh	vo	ch	p	j	rr	s	v	t	r	c	w	p'	k', q	o	f	u	ev
	Hübschmann-Meillet			š	o	č῾		ǰ	r̄					c῾		p῾	k῾	ô		u	ev
	ISO 9985			š	o	č		ǰ	ṙ					c’		p’	k’	ò		ow	ew
	BGN/PCGN BGN/PCGN romanization BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use .The systems have been approved by the BGN and the PCGN for...			sh	o, vo	ch’		j	rr					ts’		p’	k’	o		u	ev, yev
	ALA-LC	y, h			o	chʿ		j	ṛ					tsʿ		pʿ	kʿ	ō			ew, ev
Romanization of Western Armenian	ALA-LC	y, h			o	chʿ	b	ch	ṛ			d		tsʿ		pʿ	kʿ	ō			ew, ev
Romanization of Western Armenian	ASCII ASCII The American Standard Code for Information Interchange is a character-encoding scheme based on the ordering of the English alphabet. ASCII codes represent text in computers, communications equipment, and other devices that use text... input	h'			vo	ch	b	ch'	rr			d		c		p'	k', q	o			ev

Note that in the table above, the last two columns refer to digraphs, not isolated letters. However the last column displays the ligature that is used only as an isolated symbol for the short Armenian word ew (meaning and) in a way similar to the ampersand (&) in the Latin script (the ligature should not be used within other Armenian words so it is not really ambiguous); the same transliteration to ew (classical Armenian) or ev (reformed orthography) will be used for the letters this ligature represents, when they are used as digraphs: it used to refer to the w consonant, now it refers to the v consonant.

Armenian script also uses some other digraphs that are often written as optional ligatures, in lowercase only (five of them are encoded in Unicode only for full roundtrip compatibility with some legacy encodings); when present, these ligatures (which are purely typographic and carry no semantic distinction in normal Armenian texts) must be romanized by decomposing their component letters.

Bibliographic references

Antoine Meillet and Heinrich Hübschmann, Altarmenisches Elementarbuch, Heidelberg, 1913 (2nd edition, 1980).

External links

Armenian Transliteration Converter Supports both Eastern and Western pronunciations of Armenian, includes a spell checker.
Transliteration of Armenian by Thomas T. Pedersen, in KNAB (Kohanimeandmebaas, Place Names Database) of Eesti Keele Instituut (Institute of the Estonian Language).broken link