The following languages have letters with diacritics that are orthographically distinct from those without diacritics.
Latin script Baltic :*
Latvian has the following letters: , , , , , , , , , , :*
Lithuanian. In general usage, where letters appear with the caron (, and ), they are considered as separate letters from , or and collated separately; letters with the
ogonek (, , and ), the
macron () and the
overdot () are considered as separate letters as well, but not given a unique collation order.
Celtic :*
Welsh uses the circumflex, diaeresis, acute, and grave accents on its seven vowels , , , , , , (hence the composites , , , , , , , , , , , , , , , , , , , , , , , , , , , ). However all except the circumflex (which is used as a macron) are fairly rare. :* Following spelling reforms since the 1970s,
Scottish Gaelic uses graves only, which can be used on any vowel (, , , , ). Formerly acute accents could be used on , and , which were used to indicate a specific vowel quality. With the elimination of these accents, the new orthography relies on the reader having prior knowledge of pronunciation of a given word. :*
Manx uses the cedilla diacritic combined with h to give the digraph (pronounced ) to mark the distinction between it and the digraph (pronounced or ). Other diacritics used in Manx included the circumflex and diaeresis, as in , , , etc. to mark the distinction between two similarly spelled words but with slightly differing pronunciation. :*
Irish uses only acute accents to mark long vowels, following the 1948 spelling reform.
Lenition is indicated using an
overdot in
Gaelic type (,,, , , , , ); in
Roman type, a suffixed is used. Thus, is equivalent to . :*
Breton does not have a single orthography (spelling system), but uses diacritics for a number of purposes. The diaeresis is used to mark that two vowels are pronounced separately and not as a diphthong/digraph. The circumflex is used to mark long vowels, but usually only when the vowel length is not predictable by phonology. Nasalization of vowels may be marked with a tilde, or following the vowel with the letter . The plural suffix -où is used as a unified spelling to represent a suffix with a number of pronunciations in different dialects, and to distinguish this suffix from the digraph which is pronounced as . An apostrophe is used to distinguish , pronounced as the digraph is used in other Celtic languages, from the French-influenced digraph ch, pronounced .
Finno-Ugric :*
Estonian has a distinct letter , which contains a tilde. Estonian vowels with
double-dot diacritics , , are similar to German, but these are also distinct letters, unlike
German umlauted letters. All four have their own place in the alphabet, between and .
Carons in or appear only in foreign proper names and
loanwords. Also these are distinct letters, placed in the alphabet between
s and
t. :*
Finnish uses double-dotted vowels ( and ). As in Swedish and Estonian, these are regarded as individual letters, rather than 'vowel + diacritic' combinations (as happens in German). It also uses the characters , and in foreign names and loanwords. In the Finnish and Swedish alphabets, , and collate as separate letters after , the others as variants of their base letter. :*
Hungarian uses the double-dot, the acute and double acute diacritics (the last is unique to Hungarian): (, ), (, , , , ) and (, ). The acute accent indicates the long form of a vowel (in case of /, /, /) while the double acute performs the same function for and . The acute accent can also indicate a different sound (more open, as in case of /, /). Both long and short forms of the vowels are listed separately in the
Hungarian alphabet, but members of the pairs /, /, /, /, /, / and / are collated in dictionaries as the same letter. :*
Livonian has the following letters: , , , , , , , , , , , , , , , , , .
Germanic :*
German uses the
two-dots diacritic (): letters , , , used to indicate the
fronting of back vowels (see
umlaut (linguistics)). :*
Dutch uses acute, circumflex, grave and two-dots diacritics with most vowels and cedilla with c, as in French. This results in , , , , , , , , , , , , , , , and . This is mostly on words (and names) originating from French (like
crème, café, gêne, façade). The acute accent is also used to stress the vowel (like
één). The two-dots diacritic is used as a linguistic diaeresis (a
vowel hiatus) that splits the two vowels, e.g.,
reële, reünie, coördinatie), rather than to indicate a linguistic as used in German. :*
Afrikaans uses 16 additional vowel forms, both uppercase and lowercase: , , , , , , , , , , , , , , , . :*
Faroese uses acutes and some additional letters. All are considered separate letters and have their own place in the alphabet: , , , , and . :*
Icelandic uses acutes and other additional letters. All are considered separate letters, and have their own place in the alphabet: , , , , , and . :*
Danish and
Norwegian use additional characters like the o-slash and the a-overring . These letters come after and in the order , . Historically, the has developed from a ligature by writing a small superscript over a lowercase ; if an character is unavailable, some Scandinavian languages allow the substitution of a doubled
a, thus . The Scandinavian languages collate these letters after , but have different national
collation standards. :*
Swedish uses a-diaeresis () and o-diaeresis () in the place of () and slashed o () in addition to the a-overring (). Historically, the two-dots diacritic for the Swedish letters and developed from a small Gothic written above the letters. These letters are collated after , in the order , , .
Romance :* In
Asturian,
Galician and
Spanish, the character is a letter and collated between
n and
o. :*
Asturian uses an underdot: (
lower case, ), and (
lower case ) :*
Catalan uses the acute accent , , , , the grave accent , , , the diaeresis , , the cedilla , and the
interpunct . ::* In
Valencian, the circumflex , , , , may also be used. :*
Corsican uses the following in
its alphabet: /, /, /, /, /. :*
French uses four diacritics, appearing on vowels (circumflex, acute, grave, diaeresis) and the cedilla appearing in . :*
Italian uses two diacritics, appearing on vowels (acute, grave) :*
Leonese: could use or . :*
Portuguese uses a tilde with the vowels and and a cedilla with c. :*
Romanian uses a
breve on the letter
a () to indicate the sound
schwa , as well as a circumflex over the letters
a () and
i () for the sound . Romanian also writes a
comma below the letters
s () and
t () to represent the sounds and , respectively. These characters are collated after their non-diacritic equivalent. :*
Spanish uses acute accents (, , , , ) to indicate stress falling on a different syllable than the one it would fall on based on default rules, and to distinguish certain one-syllable homonyms (e.g. (masculine singular definite article) and [he]). The acute accent is also used to break up sequences of vowels that would normally be pronounced as a diphthong into two syllables, as in the word . Diaeresis is used on u only, to distinguish the combinations from , e.g. . The tilde on is not considered a diacritic as is considered a distinct letter from , not a mutated form of it.
Slavic :*
Gaj's Latin alphabet, used in
Croatian and latinized
Serbian, has the symbols , , , and , which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. It also has one
digraph including a diacritic,
dž, which is also alphabetized independently, and follows and precedes in the alphabetical order. :* The
Czech alphabet uses the acute (lowercase
á é í ó ú ý, uppercase
Á É Í Ó Ú Ý), caron (lowercase
č ď ě ň ř š ť ž, uppercase
Č Ď Ě Ň Ř Š Ť Ž), and for one letter (lowercase
ů, uppercase
Ů) the ring. (In ď and ť the caron is modified to look rather like an apostrophe.) Letter with caron are considered separate letters, whereas vowels are considered only as longer variants of the unaccented letters. Acute does not affect alphabetical order, letters with caron are ordered after original counterparts. :*
Polish has the following letters:
ą ć ę ł ń ó ś ź ż. These are considered to be separate letters: each of them is placed in the alphabet immediately after its Latin counterpart (e.g. between and ), and are placed after in that order. :* The
Serbian Cyrillic alphabet has no diacritics, instead it has a grapheme (
glyph) for every letter of
its Latin counterpart (including Latin letters with diacritics and the digraphs dž,
lj and
nj). :* The
Slovak alphabet uses the acute (lowercase
á é í ó ú ý ĺ ŕ, uppercase
Á É Í Ó Ú Ý Ĺ Ŕ), caron (lowercase
č ď ľ ň š ť ž dž, uppercase
Č Ď Ľ Ň Š Ť Ž DŽ), umlaut (
ä Ä) and circumflex accent (
ô Ô). All of those are considered separate letters and are placed directly after the original counterpart in the
alphabet. :* The basic
Slovenian alphabet has the symbols , , and , which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. Letters with a
caron are placed right after the letters as written without the diacritic. The letter ('d with bar') may be used in non-transliterated foreign words, particularly names, and is placed after and before .
Turkic :*
Azerbaijani includes the distinct Turkish alphabet letters
Ç,
Ğ,
I,
İ,
Ö,
Ş and
Ü. :*
Crimean Tatar includes the distinct Turkish alphabet letters
Ç,
Ğ,
I,
İ,
Ö,
Ş and
Ü. Unlike Turkish, Crimean Tatar also has the letter
Ñ. :*
Gagauz includes the distinct Turkish alphabet letters
Ç,
Ğ,
I,
İ,
Ö and
Ü. Unlike Turkish, Gagauz also has the letters
Ä,
Ê Ș and
Ț.
Ș and
Ț are derived from the
Romanian alphabet for the same sounds. Sometime the Turkish
Ş may be used instead of
Ș. :*
Turkish uses a with a breve (), two letters with
two dots ( and , representing two rounded front vowels), two letters with a cedilla ( and , representing the affricate and the fricative ), and also possesses a dotted capital (and a
dotless lowercase representing a high unrounded back vowel). In Turkish each of these are separate letters, rather than versions of other letters, where dotted capital and lower case are the same letter, as are dotless capital and lowercase .
Typographically, and are sometimes rendered with an
underdot, as in . The new Azerbaijani, Crimean Tatar, and Gagauz alphabets are based on the Turkish alphabet and its same diacriticized letters, with some additions. :*
Turkmen includes the distinct Turkish alphabet letters
Ç,
Ö,
Ş and
Ü. In addition, Turkmen uses A with diaeresis (
Ä) to represent , N with caron () to represent the
velar nasal , Y with acute () to represent the
palatal approximant , and Z with caron () to represent .
Other :*
Albanian has two special letters
Ç and
Ë upper and lowercase. They are placed next to the most similar letters in the alphabet, c and e correspondingly. :*
Esperanto has the symbols
ŭ,
ĉ, ĝ, ĥ, ĵ and
ŝ, which are included in the alphabet, and considered separate letters. :*
Filipino also has the character
ñ as a letter and is collated between letters
n and
ng. :* Modern
Greenlandic does not use any diacritics, although
ø and
å are used to spell loanwords, especially from Danish and English. From 1851 until 1973, Greenlandic was written in an alphabet invented by
Samuel Kleinschmidt, where
long vowels and
geminate consonants were indicated by diacritics on vowels (in the case of consonant gemination, the diacritics were placed on the vowel preceding the affected consonant). For example, the name
Kalaallit Nunaat was spelled
Kalâdlit Nunât. This scheme uses the
circumflex (◌̂) to indicate a long vowel (e.g. ; modern: ), an
acute accent (◌́) to indicate gemination of the following consonant: (i.e. ; modern: ) and, finally, a
tilde (◌̃) or a
grave accent (◌̀), depending on the author, indicates vowel length and gemination of the following consonant (e.g. ; modern: ). , used only before , are now written in Greenlandic. :*
Hawaiian uses the kahakō (
macron) over vowels, although there is some disagreement over considering them as individual letters. The kahakō over a vowel can completely change the meaning of a word that is spelled the same but without the kahakō. :*
Kurdish uses the symbols
Ç,
Ê,
Î,
Ş and
Û with other 26 standard Latin alphabet symbols. :*
Lakota alphabet uses the
caron for the letters
č,
ȟ,
ǧ,
š, and
ž. It also uses the
acute accent for stressed vowels á, é, í, ó, ú, áŋ, íŋ, úŋ. :*
Malay uses some diacritics such as
á, ā, ç, í, ñ, ó, š, ú. Uses of diacritics was continued until late 19th century except
ā and
ē. :*
Maltese uses a C, G, and Z with a dot over them (Ċ, Ġ, Ż), and also has an H with an extra horizontal bar. For uppercase H, the extra bar is written slightly above the usual bar. For lowercase H, the extra bar is written crossing the vertical, like a
t, and not touching the lower part (
Ħ, ħ). The above characters are considered separate letters. The letter 'c' without a dot has fallen out of use due to redundancy. 'Ċ' is pronounced like the English 'ch' and 'k' is used as a hard c as in 'cat'. 'Ż' is pronounced just like the English 'Z' as in 'Zebra', while 'Z' is used to make the sound of 'ts' in English (like 'tsunami' or 'maths'). 'Ġ' is used as a soft 'G' like in 'geometry', while the 'G' sounds like a hard 'G' like in 'log'. The digraph 'għ' (called
għajn after the
Arabic letter name
ʻayn for غ) is considered separate, and sometimes ordered after 'g', whilst in other volumes it is placed between 'n' and 'o' (the Latin letter 'o' originally evolved from the shape of
Phoenician ʻayin, which was traditionally collated after Phoenician
nūn). :* The
romanization of Syriac uses the altered letters of.
Ā, Č, Ḏ, Ē, Ë, Ġ, Ḥ, Ō, Š, Ṣ, Ṭ, Ū, Ž alongside the 26 standard Latin alphabet symbols. :*
Vietnamese uses the
horn diacritic for the letters
ơ and
ư; the
circumflex for the letters
â,
ê, and
ô; the
breve for the letter
ă; and a bar through the letter
đ. Separately, it also has á, à, ả, ã and ạ, the five tones used for vowels besides the flat tone 'a'.
Cyrillic letters :*
Belarusian and
Uzbek Cyrillic have a letter . :* Belarusian,
Bulgarian, Russian and Ukrainian have the letter . :* Belarusian and
Russian have the letter . In Russian, this letter is usually replaced by , although it has a different pronunciation. The use of instead of does not affect the pronunciation.
Ё is always used in children's books and in dictionaries. A
minimal pair is все (''vs'e
, "everybody" pl.) and всё (vs'o'', "everything" n. sg.). In Belarusian the replacement by is a mistake; in Russian, it is permissible to use either or for but the former is more common in everyday writing (as opposed to instructional or juvenile writing). :* The Cyrillic
Ukrainian alphabet has the letters , and . Ukrainian
Latynka has many more. :*
Macedonian has the letters and . :* In Bulgarian and
Macedonian the possessive pronoun ѝ (
ì, "her") is spelled with a grave accent in order to distinguish it from the conjunction и (
i, "and"). :* The acute accent above any vowel in Cyrillic alphabets is used in dictionaries, books for children and foreign learners to indicate the word stress, it also can be used for disambiguation of similarly spelled words with different lexical stresses. ==Diacritics that do not produce new letters==