Latin text Latin text in with typical capital letters, taken from
Cicero's
De finibus bonorum et malorum: • Which in modern punctuation is: • • "Nobody likes pain for its own sake, or looks for it and wants to have it, just because it is pain..." With ancient Latin punctuation is:
Greek text Greek text in with typical capital letters, taken from
Hesiod's
Theogony: • Which in modern punctuation is: • • "From the Heliconian Muses let us begin to sing, who hold the great and holy mount of Helicon, and dance on soft feet about the deep-blue spring and the altar of the almighty son of Cronos,"
Hebrew text Hebrew text is well known for lacking punctuation for many centuries. Modern versions of the language gradually amended those features.
Runic text The entire Swedish
Rök runestone is written in , which poses problems for scholars attempting to translate it. One example is a phrase repeated several times, . Interpretations proposed include 'let us say the memory to
Yggr', 'let us say the folk-memory', and 'let us say to the group of young men'.
Modern Latin script A form of has become common in internet e-mail addresses and
domain names where, because the "space" character is invalid, the address for a website for "Example Fake Website" is written as examplefakewebsite.com – without spaces between the separate words. However, the "underscore" or "dash" characters are often used as stand-ins for the "space" character when its use would be invalid and their use would not be. As another example, so-called
camel case—in which the first letter of each word is capitalized—has become part of the culture of many
computer programming languages. In this context, names of
variables and
subroutines as well as other
identifiers are rendered easier to read, as in MaxDataRate. Camel case can also eliminate ambiguity: CharTable might name a table of characters, whereas Chartable could ask or answer the question, "Can (something) be charted?"
Chinese language Chinese does not encounter the problem of incorporating spaces into text because, unlike most
writing systems, Chinese characters represent
morphemes and not phonemes. However, sentences can still be ambiguous due to a lack of punctuation and/or word breaks. One Chinese joke concerns a contract between a landlord and a poor scholar, which was written without punctuation and thus was interpreted in two different ways:
Japanese script Japanese implements extensive use of
Chinese characters—called
kanji in Japanese. However, due to the radical differences between the Chinese and Japanese languages, writing Japanese exclusively in kanji would make it extremely difficult to read. This can be seen in texts that predate the modern
kana system, in which Japanese was written entirely in kanji and , the latter of which are written solely to indicate a word's pronunciation as opposed to its meaning. For that reason, different
syllabary systems called kana were developed to differentiate phonetic
graphemes from
ideographic ones. Modern Japanese is typically written using three different types of graphemes, the first being kanji and the latter two being kana systems, the cursive
hiragana and the angular
katakana. While spaces are not normally used in writing, boundaries between words are often quickly perceived by Japanese speakers since kana are usually visually distinct from kanji. Japanese speakers also know that certain words, morphemes, and parts of speech are typically written using one of the three systems. Kanji is typically used for words of Japanese and Chinese origin as well as
content words (e.g. nouns, verbs, adjectives, adverbs). Hiragana is typically used for native Japanese words, as well as commonly known words, phrases, and
grammatical particles, as well as inflections of content words like verbs, adjectives, and adverbs. Katakana is typically used for loanwords from languages other than Chinese,
onomatopoeia, and emphasized words. Like Chinese, Japanese lacked any sort of punctuation until interaction with Western civilizations became more common. Punctuation was adopted during the
Meiji period.
Thai script Modern Thai script, which was said to have been created by King
Ram Khamhaeng in 1283, does not contain any spaces between words. Spaces indicate only the clear endings of clauses or sentences. Below is a sample sentence of Thai written first without spaces between words (with Thai romanization in parentheses), second in Thai with spaces between words (also with Thai romanization in parentheses), and then finally translated into English. For example, "ในน้ำมีปลา ในนามีข้าว" (pronounced "
nai nam mi pla nai na mi khao", meaning "In the water there are fish; in the paddy fields there is rice.") can also be written as "ใน น้ำ มี ปลา ใน นา มี ข้าว".
Javanese script This example shows the first line of the
Universal Declaration of Human Rights in
Javanese script, and a case of the text being divided, as in some modern writing, by spaces and dash signs, which look different. • • : • • : • All human beings are born free and equal in dignity and rights. Because of the absence of space, in computer typography, the line-break have to be inserted manually, otherwise a long sentence will not break into new lines. Some computer input methods have put
zero-width space (ZWS) instead for word break, which would then break the long sentences into multiple lanes, but the drawback of that method is it will not render the writing correctly. • ("incorrect" words include the first two words, which in joined form would looks like ) Another way to type it is to put ZWSs not after every word, but only in words that ended with a vowel (open syllable), or special final consonant -r, -h, and -ng. Thus it will look indistinguishable from the correct output above, but the machine will read it with some "spaces" (illustrated by the Latin transliteration below) that allows for line break. • • :
Arabic script Before typewriters, computers and smartphones changed the way of writing, Arabic was written continuously. That is easy because 22 letters in Arabic have final, medial and initial forms, which is comparable to the initial capital form for the Latin alphabet since the
Renaissance. Six or seven letters in Arabic have only a final form (namely , , , , and , as well as ) and whenever they occur in a word they are followed by space that was originally as wide as the space between words, creating a clear visual break. There was no hyphenation either. In the early Quranic manuscripts, all diacritics in the Arabic script were also omitted because pointing or other diacritics did not exist in the Arabic script until the early 2nd millennium, and this form is called
rasm. Rasm is also written continuously without spacing. In all early manuscripts, words were finished on the next line or, in many
Quranic manuscripts, even on the next page. The letter
hamza is the only letter of the
Arabic alphabet that lacks a final, initial or medial form, only its alone or isolated form, as it is an unlinked letter.
Punjabi (Gurmukhi) script Before the late 1960s and the early 1970s,
Gurbani and other
Sikh scriptures were written in the traditional method of writing the
Gurmukhi script known as
larivār where there were no spacing between words in the texts (interpuncts in the form of a dot were used by some to differentiate between words, such as by
Guru Arjan). This is opposed to the comparatively more recent method of writing in Gurmukhi known as
pad ched, which breaks the words by inserting spacing between them.
Numbers Before the invention of delimiters and other punctuation to set off groups of three digits in numbers above four digits, large numbers (e.g. numbers greater than 999) were written continuously. As of now, only numbers with fewer than four digits are written with no delimiter or other punctuation. This manner is somewhat similar how to separate a word in a sentence. While numbers up to four digits are recommended for separating three digits, there are some of them are not. These include most
Slavic languages,
Spanish,
Hungarian and
Swiss German. These languages do not use a delimiter to separate numbers in four digits.
English sometimes follows this practice. == See also ==