A
genetic relationship between Chinese, Tibetan, Burmese, and other languages was first proposed in the early 19th century and is now broadly accepted. The initial focus on languages of civilizations with long literary traditions has been broadened to include less widely spoken languages, some of which have only recently, or never, been written. However, the reconstruction of the family is much less developed than for families such as
Indo-European or
Austroasiatic. Difficulties have included the great diversity of the languages, the lack of inflection in many of them, and the effects of language contact. In addition, many of the smaller languages are spoken in mountainous areas that are difficult to reach and are often also sensitive border zones. There is no consensus regarding the date and location of their origin.
Early work During the 18th century, several scholars noticed parallels between Tibetan and Burmese, both languages with extensive literary traditions. Early in the following century,
Brian Houghton Hodgson and others noted that many non-literary languages of the highlands of northeast India and Southeast Asia were also related to these. The name "Tibeto–Burman" was first applied to this group in 1856 by
James Richardson Logan, who added
Karen in 1858. The third volume of the
Linguistic Survey of India, edited by
Sten Konow, was devoted to the Tibeto–Burman languages of
British India. Studies of the "Indo-Chinese" languages of Southeast Asia from the mid-19th century by Logan and others revealed that they comprised four families: Tibeto-Burman,
Tai,
Mon–Khmer and
Malayo-Polynesian.
Julius Klaproth had noted in 1823 that Burmese, Tibetan, and Chinese all shared common basic
vocabulary but that
Thai,
Mon, and
Vietnamese were quite different.
Ernst Kuhn envisaged a group with two branches, Chinese–Siamese and Tibeto-Burman.
August Conrady called this group Indo-Chinese in his influential 1896 classification, though he had doubts about Karen. Conrady's terminology was widely used, but there was uncertainty regarding his exclusion of Vietnamese.
Franz Nikolaus Finck in 1909 placed Karen as a third branch of Chinese–Siamese.
Jean Przyluski introduced the French term
sino-tibétain as the title of his chapter on the group in
Meillet and
Cohen's
Les langues du monde in 1924. He divided them into three groups: Tibeto-Burman, Chinese and Tai, and was uncertain about the affinity of Karen and
Hmong–Mien. The English translation "Sino-Tibetan" first appeared in a short note by Przyluski and
Luce in 1931.
Shafer and Benedict In 1935, the anthropologist
Alfred Kroeber started the Sino-Tibetan Philology Project, funded by the
Works Project Administration and based at the
University of California, Berkeley. The project was supervised by Robert Shafer until late 1938, and then by
Paul K. Benedict. Under their direction, the staff of 30 non-linguists collated all the available documentation of Sino-Tibetan languages. The result was eight copies of a 15-volume typescript entitled
Sino-Tibetan Linguistics. This work was never published but furnished the data for a series of papers by Shafer, as well as Shafer's five-volume
Introduction to Sino-Tibetan and Benedict's
Sino-Tibetan, a Conspectus. Benedict completed the manuscript of his work in 1941, but it was not published until 1972. Instead of building the entire family tree, he set out to reconstruct a
Proto-Tibeto-Burman language by comparing five major languages, with occasional comparisons with other languages. He reconstructed a two-way distinction on initial consonants based on voicing, with aspiration conditioned by pre-initial consonants that had been retained in Tibetic but lost in many other languages. Thus, Benedict reconstructed the following initials: Although the initial consonants of cognates tend to have the same
place and
manner of articulation, voicing and aspiration are often unpredictable. This irregularity was attacked by
Roy Andrew Miller, though Benedict's supporters attribute it to the effects of prefixes that have been lost and are often unrecoverable. The issue remains unsolved today. It was cited together with the lack of reconstructable shared morphology, and evidence that much shared lexical material has been borrowed from
Chinese into
Tibeto-Burman, by
Christopher Beckwith, one of the few scholars still arguing that Chinese is not related to Tibeto-Burman. Benedict also reconstructed, at least for Tibeto-Burman, prefixes such as the
causative s-, the
intransitive m-, and
r-,
b- g- and
d- of uncertain function, as well as suffixes
-s,
-t and
-n.
Study of literary languages Old Chinese is by far the oldest recorded Sino-Tibetan language, with inscriptions dating from around 1250 BC and a huge body of literature from the first millennium BC. However, the Chinese script is logographic and does not represent sounds systematically; it is therefore difficult to reconstruct the phonology of the language from the written records. Scholars have sought to reconstruct the
phonology of Old Chinese by comparing the obscure descriptions of the sounds of
Middle Chinese in medieval dictionaries with phonetic elements in
Chinese characters and the rhyming patterns of early poetry. The first complete reconstruction, the
Grammata Serica Recensa of
Bernard Karlgren, was used by Benedict and Shafer. Karlgren's reconstruction was somewhat unwieldy, with many sounds having a highly non-uniform distribution. Later scholars have revised it by drawing on a range of other sources. Some proposals were based on cognates in other Sino-Tibetan languages, though workers have also found solely Chinese evidence for them. For example, recent
reconstructions of Old Chinese have reduced Karlgren's 15 vowels to a six-vowel system originally suggested by
Nicholas Bodman. Similarly, Karlgren's *l has been recast as *r, with a different initial interpreted as *l, matching Tibeto-Burman cognates, but also supported by Chinese transcriptions of foreign names. A growing number of scholars believe that Old Chinese did not use tones and that the tones of Middle Chinese developed from final consonants. One of these, *-s, is believed to be a suffix, with cognates in other Sino-Tibetan languages. text found at
Turfan Tibetic has extensive written records from the adoption of writing by the
Tibetan Empire in the mid-7th century. The earliest records of
Burmese (such as the 12th-century
Myazedi inscription) are more limited, but later an extensive literature developed. Both languages are recorded in
abugida scripts ultimately derived from the
Brahmi script of Ancient India. Most comparative work has used the conservative written forms of these languages, following the dictionaries of
Jäschke (Tibetan) and
Judson (Burmese), though both contain entries from a wide range of periods. There are also extensive records in
Tangut, the language of the
Western Xia (1038–1227). Tangut is recorded in a Chinese-inspired logographic script, whose interpretation presents many difficulties, even though multilingual dictionaries have been found.
Gong Hwang-cherng has compared Old Chinese, Written Tibetan, Burmese, and Tangut to establish sound correspondences between those languages, as he identifies these four languages as the most influencial literary languages within the family for their abundance in textual evidence. He found that Tibetic and Burmese correspond to two Old Chinese vowels, *a and *ə. While this has been considered evidence for a separate Tibeto-Burman subgroup, Hill (2014) finds that Burmese has distinct correspondences for Old Chinese rhymes
-ay : *-aj and
-i : *-əj, and hence argues that the development *ə > *a occurred independently in Tibetan and Burmese. Other languages with historical texts have also come to the attention of linguists more recently, such as
Classical Newar,
Loloish languages written in the
Yi script,
Lepcha, and
Meiteilon. But higher-grade reconstructions for these languages and their subbranches are yet to be established.
Fieldwork The descriptions of non-literary languages used by Shafer and Benedict were often produced by missionaries and colonial administrators of varying linguistic skills. Most of the smaller Sino-Tibetan languages are spoken in inaccessible mountainous areas, many of which are politically or militarily sensitive and thus closed to investigators. Until the 1980s, the best-studied areas were
Nepal and northern
Thailand. In the 1980s and 1990s, new surveys were published from the Himalayas and southwestern China. Of particular interest was the increasing literature on the
Qiangic languages of western
Sichuan and adjacent areas. ==Distribution==