The Romance languages for the most part have continued to use the Latin alphabet while adapting it to their evolution. One exception was Romanian, where before the nineteenth century, the
Romanian Cyrillic alphabet was used due to Slavic influence after the Roman retreat. A Cyrillic alphabet was also used for Romanian (then called Moldovan) in the
USSR. The non-Christian populations of Spain also used the scripts of their religions (
Arabic and
Hebrew) to write Romance languages such as
Judaeo-Spanish and
Mozarabic in
aljamiado.
Letters The
classical Latin alphabet of 23 letters –
A,
B,
C,
D,
E,
F,
G,
H,
I,
K,
L,
M,
N,
O,
P,
Q,
R,
S,
T,
V,
X,
Y,
Z – was
modified and augmented in various ways to yield the spelling systems of the Romance languages. In particular, the single Latin letter
V split into
V (consonant) and
U (vowel), and the letter
I split into
I and
J. The Latin letter
K and the new letter
W, which came to be widely used in
Germanic languages, are seldom used in most Romance languages – mostly for unassimilated foreign names and words. Indeed, in Italian prose is properly . Portuguese and Catalan eschew importation of "foreign" letters more than most languages. Thus Wikipedia is in Catalan but in Spanish; chikungunya, sandwich, kiwi are , , in Portuguese but , , in Spanish. While most of the 23 basic Latin letters have maintained their phonetic value, for some of them it has diverged considerably; and the new letters added since the Middle Ages have been put to different uses in different scripts. Some letters, notably
H and
Q, have been variously combined in
digraphs or
trigraphs (see below) to represent phonetic phenomena that could not be recorded with the basic Latin alphabet, or to get around previously established spelling conventions. Most languages added auxiliary marks (
diacritics) to some letters, for these and other purposes. The spelling systems of most Romance languages are fairly simple, and consistent within any language. Spelling rules are typically
phonemic (as opposed to being strictly
phonetic); as a result of this, the actual pronunciation of standard written forms can vary substantially according to the speaker's accent (which may differ by region) or the position of a sound in the word or utterance (
allophony). The following letters have notably different values between languages, or between Latin and the Romance languages: :
B, V: Merged in Spanish and some dialects of Catalan, where both letters represent a single phoneme pronounced as either or depending on position, with no differentiation between
B and
V. :
C: Generally a "hard" , but "soft" (
fricative or
affricate) before
e,
i, or
y. :
G: Generally a "hard" , but "soft" (fricative or affricate) before
e,
i, or
y. In some languages, like Spanish, the hard
g, phonemically , is pronounced as a fricative after vowels. In Romansch, the soft
g is a
voiced palatal plosive or a voiced
alveolo-palatal affricate . :
H:
Silent in most languages; used to form various
digraphs. But represents in Romanian, Walloon and Gascon Occitan. :
J: Represents the fricative in most languages, the
palatal approximant in Romansh and in several of the languages of Italy, and [x] or [h] in Spanish (depending on the
variety). Italian does not use this letter in native words, replacing it with
gi before a vowel. :
Q: As in Latin, its phonetic value is that of a hard
c, i.e. , and in native words it is almost always followed by a (sometimes silent)
u. Romanian does not use this letter in native words, using
ch instead. :
S: Generally
voiceless , but in some languages it can be voiced instead in certain contexts (especially between vowels). In Spanish, Romanian, Galician and several varieties of Italian, it is always pronounced voiceless between vowels. If the phoneme /s/ is represented by the letter
S, predictable assimilations are normally not shown (e.g. Italian 'sled', spelled
slitta but pronounced , never with ). Also at the end of syllables it may represent special
allophonic pronunciations. In Romansh, it also stands for a voiceless or voiced fricative, or , before certain consonants. :
W: No Romance language uses this letter in native words, with the exception of
Walloon. :
X: Its pronunciation is rather variable, both between and within languages. In the Middle Ages, the
languages of Iberia used this letter to denote the
voiceless postalveolar fricative , which is still the case in modern
Catalan and
Portuguese. With the Renaissance the classical pronunciation – or similar
consonant clusters, such as , , or – were frequently reintroduced in
latinisms and hellenisms. In
Venetian it represents , and in
Ligurian the
voiced postalveolar fricative . Italian does not use this letter in native words for historical reasons. :
Y: This letter is not used in most languages, with the prominent exceptions of French and Spanish, where it represents before vowels (or various similar fricatives such as the
palatal fricative , in Spanish), and the vowel or
semivowel elsewhere. :
Z: In most languages it represents the sound . However, in Italian it denotes the affricates and (which are two separate phonemes, but rarely contrast; among the few examples of minimal pairs are "ray" with , "race" with (both are phonetically long between vowels); in Romansh the voiceless affricate ; and in Galician and Spanish it denotes either the
voiceless dental fricative or . Otherwise, letters that are not combined as digraphs generally represent the same phonemes as suggested by the
International Phonetic Alphabet (IPA), whose design was, in fact, greatly influenced by Romance spelling systems.
Digraphs and trigraphs Since most Romance languages have more sounds than can be accommodated in the Roman Latin alphabet they all resort to the use of digraphs and trigraphs – combinations of two or three letters with a single phonemic value. The concept (but not the actual combinations) is derived from Classical Latin, which used, for example,
TH,
PH, and
CH when transliterating the Greek letters "θ", "ϕ" (later "φ"), and "χ". These were once
aspirated sounds in Greek before changing to corresponding fricatives, and the
H represented what sounded to the Romans like an following , , and respectively. Some of the digraphs used in modern scripts are: :
CI: used in Italian, Romance languages in Italy, Corsican and Romanian to represent before
A,
O, or
U. In Sicilian, it is used for in contexts where it descended from Latin
FL, as in
ciamma. :
CH: used in Italian, Romance languages in Italy, Corsican, Romanian, Romansh and
Sardinian to represent before
E or
I (including yod ); in
Occitan, Spanish, Astur-leonese and Galician; or in Romansh before
A,
O or
U; and in most other languages. In Catalan it is used in some old spelling conventions for . :
CHI/CHJ: used in Corsican and Sicilian to represent . In Corsican, it is always written as
CHJ, but in Sicilian, the spelling might vary with
CHI. :
DD: used in
Sicilian and
Sardinian to represent the
voiced retroflex plosive . In recent history more accurately transcribed as
DDH. :
DJ: used in Walloon and Catalan for . :
DZ: used in Aromanian for . :
GI: used in Italian, Romance languages in Italy, Corsican and Romanian to represent before
A,
O, or
U, and in Romansh to represent or or (before
A,
E,
O, and
U) or :
GH: used in Italian, Romance languages in Italy, Corsican, Romanian, Romansh and
Sardinian to represent before
E or
I (including yod ), and in Galician for the
voiceless pharyngeal fricative (not standard sound). :
GHI/GHJ: used in Corsican and Sicilian to represent . In Corsican, it is always written as
GHJ, but in Sicilian, the spelling might vary with
GHI. :
GL: used in Romansh before consonants and
I and at the end of words for . :
GLI: used in Italian and Corsican for and Romansh for . :
GN: used in French, some Romance languages in Italy, Corsican, Romansh Walloon for , as in
champignon; in Italian to represent , as in "ogni" or "lo gnocco". :
GU: used before
E or
I to represent or in all Romance languages except Italian, Romance languages in Italy, Corsican, Romansh, and Romanian, which use
GH instead. :
IG: used at the end of word in Catalan for , as in
maig,
safareig or
enmig. :
IX: used between vowels or at the end of word in Catalan/Aragonese for , as in
caixa/
caixa or
calaix/
calaixo. :
JH: used in Walloon for /ʒ/ or /h/. :
LH: used in Portuguese and Occitan for . :
LJ: used in Aromanian for . :
LL: used in Spanish, Catalan, Galician, Astur-leonese, Norman and Dgèrnésiais, originally for which has merged in some cases with . Represents in French unless it follows
I (
i) when it represents (or in some dialects). As in Italian, it is used in Occitan for a
long . :
L·L: used in Catalan for a geminate consonant . :
N-: used in Piedmontese and Ligurian for between two vowels, although sometimes
NN- is preferred in Ligurian instead. :
NH: used in Portuguese and Occitan for , used in official Galician for . :
NJ: used in Aromanian for . :
NN: used in
Leonese for , in Italian for geminate . :
NY: used in Catalan, Aragonese and Walloon for . :
QU: represents in Italian, Romance languages in Italy, and Romansh; in French, Astur-leonese (normally before
e or
i); (before
e or
i) or (normally before
a or
o) in Occitan, Catalan and Portuguese; in Spanish (always before
e or
i). :
RR: used between vowels in several languages (Occitan, Catalan, Spanish) to denote a
trilled or a
guttural R, instead of the
flap . :
SC: used before
E or
I in Italian, Romance languages in Italy as , in European Portuguese as and in French, Brazilian Portuguese, Catalan and
Hispanic American Spanish as in words of certain etymology (notice this would represent in standard peninsular Spanish) :
SCH: used in Romansh for or , in Italian for before
E or
I, including yod . :
SCI: used in Italian, Romance languages in Italy, and Corsican to represent before
A,
O, or
U. :
SH: used in
Aranese, Spanish (almost only in foreign words), Occitan, Walloon and Aromanian for . :
SS: used in French, Portuguese, Piedmontese, Romansh, Occitan, and Catalan for between vowels, in Italian, Romance languages of Italy, and Corsican for long . :
TS: used in Catalan and Aromanian for . :
TSH: used in Walloon for /tʃ/. :
TG: used in Romansh for or . In Catalan is used for before
E and
I, as in
metge or
fetge. :
TH: used in Jèrriais for ; used in Aranese for either or . :
TJ: used between vowels and before
A,
O or
U, in Catalan for , as in
sotjar or
mitjó. :
TSCH: used in Romansh for . :
TX: used at the beginning or at the end of word or between vowels in Catalan for , as in
txec,
esquitx or
atxa. :
TZ: used in Catalan for . :
XH: used in Walloon for /ʃ/ or /h/, depending on the dialect. While the digraphs
CH,
PH,
RH and
TH were at one time used in many words of Greek origin, most languages have now replaced them with
C/QU,
F,
R and
T. Only French has kept these
etymological spellings, which now represent or , , and , respectively.
Double consonants Gemination, in the languages where it occurs, is usually indicated by doubling the consonant, except when it does not contrast phonemically with the corresponding short consonant, in which case gemination is not indicated. In
Jèrriais, long consonants are marked with an apostrophe: is a long , is a long , and is a long . The phonemic contrast between geminate and single consonants is widespread in
Italian, and normally indicated in the traditional orthography: 'done' vs. 'fate, destiny'; 's/he, it fell' vs. 's/he, it falls'. The double consonants in French orthography, however, are merely etymological. In Catalan, the gemination of is marked by a ("flying point"): .
Diacritics Romance languages also introduced various marks (
diacritics) that may be attached to some letters, for various purposes. In some cases, diacritics are used as an alternative to digraphs and trigraphs; namely to represent a larger number of sounds than would be possible with the basic alphabet, or to distinguish between sounds that were previously written the same. Diacritics are also used to mark word stress, to indicate exceptional pronunciation of letters in certain words, and to distinguish words with same pronunciation (
homophones). Depending on the language, some letter-diacritic combinations may be considered distinct letters, e.g. for the purposes of
lexical sorting. This is the case, for example, of Romanian
ș () and Spanish
ñ (). The following are the most common use of diacritics in Romance languages. •
Vowel quality: the system of marking
close-mid vowels with an
acute accent,
é, and
open-mid vowels with a
grave accent,
è, is widely used (e.g. Catalan, French, Italian). Portuguese, however, uses the
circumflex (
ê) for the former, and the acute (
é), for the latter. Some minority Romance languages use an
umlaut (diaeresis mark) in the case of
ä, ö, ü to indicate fronted vowel variants, as in
German. Centralized vowels () are indicated variously (
â in Portuguese,
ă/î in Romanian,
ë in
Piedmontese, etc.). In French, Occitan and Romanian, these accents are used whenever necessary to distinguish the appropriate vowel quality, but in the other languages, they are used only when it is necessary to mark unpredictable stress, or in some cases to distinguish homophones. •
Vowel length: French uses a circumflex to indicate what had been a
long vowel (although nowadays this rather indicates a difference in vowel quality, if it has any effect at all on pronunciation). This same usage is found in some minority languages. •
Nasality: Portuguese marks
nasal vowels with a
tilde (
ã) when they occur before other written vowels and in some other instances. •
Palatalization: some historical
palatalizations are indicated with the
cedilla (
ç) in French, Catalan, Occitan and Portuguese. In Spanish and several other world languages influenced by it, the grapheme
ñ represents a
palatal nasal consonant. •
Separate pronunciation: when a vowel and another letter that would normally be combined into a
digraph with a single sound are exceptionally pronounced apart, this is often indicated with a
diaeresis mark on the vowel. This is particularly common in the case of
gü /ɡw/ before
e or
i, because plain
gu in this case would be pronounced /ɡ/. This usage occurs in Spanish, French, Catalan and Occitan, and occurred before the 2009 spelling reform in Brazilian Portuguese. French also uses the diaeresis on the second of two adjacent vowels to indicate that both are pronounced separately, as in
Noël "Christmas" and
haïr "to hate". •
Stress: the stressed vowel in a polysyllabic word may be indicated with an accent, when it cannot be predicted by rule. In Italian, Portuguese and Catalan, the choice of accent (acute, grave or circumflex) may depend on vowel quality. When no quality needs to be indicated, an acute accent is normally used (
ú), but Italian and Romansh use a grave accent (
ù). Portuguese puts a diacritic on all stressed monosyllables that end in
a e o as es os, to distinguish them from unstressed function words:
chá "tea",
más "bad (fem. pl.)",
sé "seat (of government)",
dê "give! (imperative)",
mês "month",
só "only",
nós "we" (cf.
mas "but",
se "if/oneself",
de "of",
nos "us"). Word-final stressed vowels in polysyllables are marked by the grave accent in Italian, thus
università "university/universities",
virtù "virtue/virtues", resulting in occasional minimal or near-minimal pairs such as
parlo "I speak" ≠
parlò "s/he spoke",
capi "heads, bosses" ≠
capì "s/he understood",
gravita "it, s'/he gravitates" ≠
gravità "gravity, seriousness". •
Homophones: words (especially monosyllables) that are pronounced exactly or nearly the same way and are spelled identically, but have different meanings, can be differentiated by a diacritic. Typically, if one of the pair is stressed and the other isn't, the stressed word gets the diacritic, using the appropriate diacritic for notating stressed syllables (see above). Portuguese does this consistently as part of notating stress in certain monosyllables, whether or not there is an unstressed homophone (see examples above). Spanish also has many pairs of identically pronounced words distinguished by an acute accent on the stressed word:
si "if" vs.
sí "yes",
mas "but" vs.
más "more",
mi "my" vs.
mí "me",
se "oneself" vs.
sé "I know",
te "you (object)" vs.
té "tea",
que/quien/cuando/como "that/who/when/how" vs.
qué/quién/cuándo/cómo "what?/who?/when?/how?", etc. A similar strategy is common for monosyllables in writing Italian, but not necessarily determined by stress: stressed
dà "it, s/he gives" vs. unstressed
da "by, from", but also
tè "tea" and
te "you", both capable of bearing phrasal stress. Catalan has some pairs where both words are stressed, and one is distinguished by a vowel-quality diacritic, e.g.
os "bone" vs.
ós "bear". When no vowel-quality needs distinguishing, French and Catalan use a
grave accent: French
ou "or" vs.
où "where", French
la "the" vs.
là "there", Catalan
ma "my" vs.
mà "hand".
Upper and lowercase Most languages are written with a mixture of two distinct but phonetically identical variants or "
cases" of the alphabet:
majuscule ("uppercase" or "capital letters"), derived from Roman stone-carved letter shapes, and
minuscule ("lowercase"), derived from
Carolingian writing and Medieval
quill pen handwriting which were later adapted by printers in the fifteenth and sixteenth centuries. In particular, all Romance languages capitalize (use uppercase for the first letter of) the following words: the first word of each complete
sentence, most words in names of people, places, and organizations, and most words in titles of books. The Romance languages do not follow the German practice of capitalizing all nouns including common ones. Unlike English, the names of months, days of the weeks, and derivatives of proper nouns are usually not capitalized: thus, in Italian one capitalizes
Francia ("France") and
Francesco ("Francis"), but not
francese ("French") or
francescano ("Franciscan"). However, each language has some exceptions to this general rule. == Vocabulary comparison ==