The first extant Breton texts, contained in the
Leyden manuscript, were written at the end of the 8th century, 50 years prior to the
Strasbourg Oaths, which are considered to be the earliest example of
French. Like many medieval
orthographies, Old and Middle Breton had an orthography that was at first not standardised, and the spelling of a particular word varied at the author's discretion. In 1499, however, the
Catholicon, was published; as the first dictionary written for both French and Breton, it became a point of reference on how to transcribe the language. The orthography presented in the
Catholicon was largely similar to that of French, in particular with respect to the representation of vowels, as well as the use of both the Latinate
digraph , which was a remnant of the sound change > in Latin, and the Brittonic or to represent before front vowels. As phonetic and phonological differences between the dialects began to magnify, many regions, particularly the
Vannes country, began to devise their own orthographies. Many of those orthographies were more closely related to the French model albeit with some modifications. Examples of modifications include the replacement of Old Breton - with - to denote word-final (an evolution of Old Breton in the Vannes dialect) and the use of - to denote the initial mutation of (today, this mutation is written ). In the 1830s,
Jean-François Le Gonidec created a modern phonetic system for the language. During the early 20th century, a group of writers known as elaborated and reformed Le Gonidec's system. They made it more suitable as a super-dialectal representation of the dialects of
Cornouaille, Leon and Trégor (known as from , and in Breton). This
KLT orthography was established in 1911. At the same time, writers of the more divergent Vannetais dialect developed a phonetic system, which was also based on that of Le Gonidec. Following proposals that had been made during the 1920s, the KLT and Vannetais orthographies were merged in 1941 to create an orthographic system to represent all four dialects. This ("wholly unified") orthography was significant for the inclusion of the digraph , which represents a in Vannetais and corresponds to a in the KLT dialects. In 1955,
François Falcʼhun and the group proposed a new orthography, which was designed to use a set of
graphemes closer to the conventions of French. This ("University Orthography", known in Breton as ) was given official recognition by the French authorities as the "official orthography of Breton in French education". It was opposed in the region and was only by the magazine and the publishing house Emgleo Breiz, which disappeared in 2015. In the 1970s, a new standard orthography was devised: the or . This system is based on the derivation of the words. Today, most writers continue to use the
Peurunvan orthography, and it is the version taught in most Breton-language schools.
Alphabet Breton is written in the
Latin script.
Peurunvan, the most commonly used orthography, consists of the following letters: : a, b, ch, cʼh, d, e, f, g, h, i, j, k, l, m, n, o, p, r, s, t, u, v, w, y, z The
circumflex,
grave accent,
trema and
tilde appear on some letters. The
diacritics are used in the following way: : â, ê, î, ô, û, ù, ü, ñ
Differences between and Both orthographies use the above alphabet although is used only in . Differences between the two systems are particularly noticeable in word endings. In Peurunvan, final
obstruents, which are devoiced in absolute final position and voiced in
sandhi before
voiced sounds, are represented by a grapheme that indicates a
voiceless sound. In OU they are written as voiced but represented as voiceless before suffixes: "big", "bigger". In addition, Peurunvan maintains the KLT convention, which distinguishes noun/adjective pairs by nouns written with a final voiced consonant and adjectives with a voiceless one. No distinction is made in pronunciation. , vs. .
Pronunciation of the Breton alphabet C (as a single letter),
Q and
X appear mainly in loanwords. ⟨ks⟩ or ⟨gz⟩ may be used to represent /ks/ or /ɡz/.
Notes: • Vocative particle: "O Brittany". • Word-initially. • Word-finally. • Unwritten lenition of and spirantization of > . • Unstressed represent in Leoneg but in the other dialects. The realisations appear mainly before (also less often before ), semivowels , consonant clusters beginning with or . Stressed long represent . • In Gwenedeg velars are palatalized before and , i.e. , , , , , , represent . In the case of word-final and palatalization to also occurs after . • Before a vowel other than the digraph is written instead of , e.g. "to drive", radical , 1PS preterite , 3PS preterite . • Silent in words such as , , , , , and . Always silent in Gwenedeg and Leoneg. • is realized as when it precedes or follows a vowel (or when between vowels), but in words such as , , it represents (in orthography may be used: , , ). • represents when it follows a vowel, after a consonant it represents . But before a vowel other than , is written instead of , e.g. "to follow", radical , 1PS preterite , 3PS preterite . In some regions may be heard instead of . • Word-finally after a cluster of unvoiced consonants. • In front of . • The digraph is realized like when preceded or followed by a vowel (or when between vowels), but in words such as , , it represents . • The digraph represents plural endings. Its pronunciation varies by dialect: rating geographically from Northwest Leon to Southeast Gwened. • usually represents , but word-finally (except in word-final ) it represents in KLT, in Gwenedeg and in Goëlo. The pronunciation is retained word-finally in verbs. In words , , , , , it represents in KLT, in Gwenedeg and in Goëlo. Word-finally following it represents . • But silent in words such as , , , , , , , , , , ', , , , , , , , , , , , , , , , , , , . is generally silent in Kerneweg, Tregerieg and Gwenedeg, but in Leoneg is always pronounced. • Used to distinguish words such as "river", "heir", "town" (also written ) from "sense", "bold", "dear". • Used to distinguish "circuit/tour" from "foot". • In northern dialects (mainly in Leoneg), there is a tendency to voice between vowels. also appears as the lenition of and mixed mutation of . • The lenition of and the spirantization of are both represented by is mainly pronounced although in certain regions (especially for the spirantization of in Cornouaille) and (in some Haut-Vannetais varieties) also occur. • The pronunciation of varies by dialect, nowadays uvular (or ) is standard; occurs in Leoneg, or in Tregerieg, and in Gwenedeg. • In Gwenedeg an unstressed often represents . • Lenited varieties of may appear word-initially in case of soft mutation. • In Leoneg in front of a nasal. • In Leoneg represents before . • In Leoneg represents or before . • In Leoneg represents . • Before a vowel. • Forms of the indefinite article. • A conservative realisation of the initial mutation of and , used in certain parts of the Vannes country. ==Sample texts==