Scopes: • Individual languages •
Macrolanguages (Set 3) • Collections of languages (Sets 2, 5). Some collections were already in Set 2, and others were added only in Set 5: • Remainder groups: 36 collections in both Set 2 and 5 are of this kind — for compatibility with Set 2 when Set 5 was still not published, the remainder groups do not contain any language and collection that was already coded in Set 2 (however new applications compatible with Set 5 may treat these groups inclusively, as long they respect the containment hierarchy published in Set 5 and they use the most specific collection when grouping languages); • The only collection which previously assigned with two-alphabet code is
Bihari (bh) during the Part 1 era, which deprecated in June 2021. • Regular groups: 29 collections in both Sets 2 and 5 are of this kind — for compatibility with Set 2, they can not contain other groups; • Families: 50 new collections coded only in Set 5 (including one containing a regular group already coded in Set 2) — for compatibility with Set 2, they may contain other collections except remainder groups. •
Dialects: they were intended to be covered by former ISO 639-6 (proposed but now withdrawn). • Special situations (Sets 2, 3). • Reserved for local use (Sets 2, 3). Also used sometimes in applications needing a two-letter code like standard codes in Sets 1 and 2 (where the special code mis is not suitable), or a three-letter code for collections like standard codes in Set 5. Types (for individual languages): •
Living languages (Sets 2, 3) (except Sanskrit, all other macrolanguages are living languages) •
Extinct languages (Sets 2, 3) (599, 5 of them are in Set 2: chb, chg, cop, lui, sam; none are in Set 1) •
Historical languages (Sets 1, 2, 3) (213, 35 of them are in Set 2; and 5 of them, namely ave, chu, lat, pli and san, also have a code in Set 1: ae, cu, la, pi, sa) • 124 of those were categorised as
Ancient languages, this type has merged into Historical since about 2024 •
Constructed languages (Sets 1, 2, 3) (23, 9 of them in Set 2: afh, epo, ido, ile, ina, jbo, tlh, vol, zbl; 5 of them in Set 1: eo, ia, ie, io, vo) Individual languages and macrolanguages with two distinct three-letter codes in Set 2: • Bibliographic (some of them were deprecated, none were defined in Set 3): these are legacy codes (based on language names in English). • Terminologic (also defined in Set 3): these are the preferred codes (based on native language names, romanized if needed). • All others (including collections of languages and special/reserved codes) only have a single three-letter code for both uses. ==Relations between the sets==