Some 7-bit character sets for non-Latin alphabets are derived from the ISO/IEC 646 standard: these do not themselves constitute ISO/IEC 646 due to not following its invariant code points (often replacing the letters of at least one case), due to supporting differing alphabets which the set of national code points provide insufficient encoding space for. Examples include: • 7-bit Turkmen (ISO-IR-230). • 7-bit Greek. • In
ELOT 927 (ISO-IR-088), maps the Greek alphabet over both letter cases using a different scheme (not in alphabetical order, but trying where possible to match Greek letters over Roman letters which correspond in some sense), and ISO-IR-019 maps the Greek uppercase alphabet over the Latin lowercase letters using the same scheme as ISO-IR-018. • The lower half of the
Symbol font character encoding uses its own scheme for mapping Greek letters of both cases over the ASCII Roman letters, also trying to map Greek letters over Roman letters which correspond in some sense, but making different decisions in this regard (see chart below). It also replaces invariant code points 0x22 and 0x27 and five national code points with mathematical symbols. Although not intended for use in typesetting Greek prose, it is sometimes used for that purpose. • ISO-IR-027 • 7-bit Cyrillic •
KOI-7 or Short KOI, used for
Russian. The Cyrillic characters are mapped to positions 0x60–0x7E, on top of the Latin lowercase letters, matching homologous letters where possible (where в is mapped to w, not v). Superseded by the
KOI-8 variants. •
SRPSCII and MAKSCII, Cyrillic variants of YUSCII (the Latin variant is YU/ISO-IR-141 in the chart above), used for
Serbian and
Macedonian respectively. Largely homologous to the Latin variant of YUSCII (following Serbian
digraphia rules), except for
Љ (lj),
Њ (nj),
Џ (dž), and
ѕ (dz), which correspond to digraphs in
Latin-script orthography, and are mapped over letters which are not used in Serbian or Macedonian (q, w, x, y). • The G0 sets for the
World System Teletext encodings for Russian/Bulgarian and Ukrainian use G0 sets similar to KOI-7 with some modifications. The corresponding G0 set for Serbian Cyrillic uses a scheme based on the Teletext encoding for Latin-script
Serbo-Croatian and
Slovene, as opposed to the significantly different YUSCII. • 7-bit Hebrew,
SI 960. The
Hebrew alphabet is mapped to positions 0x60–0x7A, on top of the lowercase Latin letters (and grave accent for aleph). 7-bit Hebrew was always stored in visual order. This mapping with the high bit set, i.e. with the Hebrew letters in 0xE0–0xFA, is
ISO/IEC 8859-8. The World System Teletext encoding for Hebrew uses the same letter mappings, but uses BS_Viewdata as its base encoding (whereas SI 960 uses US-ASCII) and includes a
shekel sign at 0x7B. • 7-bit Arabic,
ASMO 449 (ISO-IR-089). The
Arabic alphabet is mapped to positions 0x41–0x5A and 0x60–0x6A, on top of both uppercase and lowercase Latin letters. A comparison of some of these encodings is below. Only one case is shown, except in instances where the cases are mapped to different letters. In such instances, the mapping with the smallest code is shown first. Possible transcriptions are given for some letters; where this is omitted, the letter can be considered to correspond to the Roman one which it is mapped over. == See also ==