Because of its rather large vocabulary for an
auxlang (as of April 2021, the full Sambahsa-English dictionary contained more than 19,500 entries), it is difficult to assess the share of each language in Sambahsa's eclectic wordstock. However, the main layers are (either reconstructed or extrapolated) Indo-European vocabulary, Greco-Roman scientific and technical vocabulary (which is not discussed below, as it is more or less comparable to what is found in English) and multiple sources extending from Western Europe to Eastern Asia.
Indo-European vocabulary The core of Sambahsa's vocabulary is undoubtedly of Indo-European origin. Only a few Sambahsa words can be traced back to pre-Indo-European times (like , '
chamois', cf.
Basque: ). Many basic Sambahsa words are thus very close to their reconstructed Indo-European counterparts. See (Sambahsa/Proto-Indo-European):
/ ('hedgehog'),
/ ('gland'),
/ ('to comb'),
/ ('to jump'),
/ ('fist'),
/ ('weevil'),
/ ('to go'),
/ ('yew wood' in Sambahsa; 'yew' in PIE),
/ ('dwelling'),
/ ('oath'),
/ ('Sir, lord'). But less attested Indo-European vocabulary is found in Sambahsa too. For example, the common Sambahsa word for
person is , as in , 'someone, somebody', and can be derived from PIE , only found in
Old Armenian ('person') and
Old Norse ('smell'). And ('hoe') may be a cognate of
Old Church Slavonic and English
mattock.
Further development from the Indo-European background Though Sambahsa, like any other conlang, has derivation rules, it sometimes uses
backformation too. For example, the relation between
Lithuanian ('companion'), Old Greek ('father-in-law') and Sanskrit ('companion') is uncertain; however Sambahsa "reconstructs" this root as from
behnd 'to bind'. PIE has 'earth' and (with
nasal infix) 'to shape, to make pottery'; accordingly, Sambahsa has and , but the latter can be understood as "to put earth on" if we refer to ('yoke') and ('to join'), both from PIE and . The Sambahsa word for 'ice pellet' is ; it rests on the word 'frozen snow', itself from
Old Norse , Lithuanian ('frost') and Russian . But the
suffix -it was abstracted from PIE words like 'grain of wheat' and 'grain of barley'; thus can be understood as 'a grain of frozen snow'.
Words common to different language families A characteristic of Sambahsa is to include words found in different language families, while the most famous
auxiliary languages tend to limit themselves to a compilation of
Romance vocabulary with some borrowings from the
Germanic languages. For example: • ('cupboard') has cognates both in Germanic and
Slavic languages: Russian , Polish , Ukrainian , Danish , Icelandic , Franconian dialect and Swedish . • ('count', as a nobility title) is a German word from Greek that has been borrowed into many languages including Azerbaijani , Bulgarian , Czech , Danish , Estonian , Croatian , Hungarian , Finnish , Lithuanian , Icelandic and Russian . • ('mug') is found in German and many other Germanic languages. It comes from
Low Latin and is at the origin of Hungarian , Italian and Romanian , all meaning 'glass'. • means 'big hall, palace' and has the same Turkish and Persian origin as English
seraglio but with a meaning closer to its etymology and to Russian ('barn'). ===The
Balkan sprachbund=== Though they belong to different language families, the languages spoken in Southeast Europe share a number of common grammatical features and of loanwords due to their historical background. That is why Sambahsa includes words from this region. • ('hornless') corresponds to Romanian , Bulgarian/Serbo-Croatian ; also Albanian 'hornless'. • ('pitcher') comes from Old Greek , like Serbo-Croatian , Russian , Romanian and Albanian . • ('coating') comes from Greek , which has given, among others, Romanian ('brick') and Arabic 'tile'.
Words from Arabic and Persian A significant part of Sambahsa's vocabulary comes from
Arabic and
Persian. Both languages have extensively provided loanwords to a lexical continuum ranging from the Atlantic Ocean to Indonesia because, respectively, of the spread of
Islam and the brilliance of the former Persian civilization. Sambahsa learning materials often call this stratum "Muslim". • ('assets') comes from Arabic and is found in Turkish ('estate') and Persian . • ('adultery') comes from Arabic and is found in Persian and many other languages spoken by a majority of Muslims. • ('merlon') comes from Spanish and Portuguese from Arabic and ultimately Persian which has its origin in PIE like Sambahsa ('door').
Sinitic vocabulary Classical Chinese has heavily influenced the wordstock of neighbouring languages, mostly Japanese, Korean and Vietnamese. As a result, Sambahsa incorporates some Sinitic vocabulary, but the phonetic differences between these various languages can be high. • ('goldfish') corresponds to , which is read in Mandarin Pinyin and in Japanese. • ('fortified palace') corresponds to the Han character read in Mandarin Pinyin, in Japanese
Go-on reading, in Korean, and in Vietnamese. • ('humankind') in an attempt to balance Mandarin , Japanese , Korean , and other renderings of . This word showcases some seeming flaws of the language's approach to be reminiscent of all targeted languages at once. Not all Sambahsa Sinitic words come from Classical Chinese. The
Min Nan language of Southern China provided loanwords to some South-East Asian languages, and some of these borrowings are, in turn, found in Sambahsa. • Sambahsa ('
wonton') is an Indonesian word from Min Nan , while Mandarin Chinese (Pinyin) has . • Likewise, Sambahsa ('attic') comes from Min Nan through Indonesian
loteng. ==Sample phrases==