Written Chinese is not based on an alphabet or syllabary. Most characters can be analyzed as compounds of smaller components, which may be assembled according to several different principles. Characters and components may reflect aspects of meaning or pronunciation. The best known exposition of Chinese character composition is the
Shuowen Jiezi, compiled by
Xu Shen . Xu did not have access to the earliest forms of Chinese characters, and his analysis is not considered to fully capture the nature of the writing system. Nevertheless, no later work has supplanted the
Shuowen Jiezi in terms of breadth, and it is still relevant to etymological research today.
Derivation of characters According to the
Shuowen Jiezi, Chinese characters are developed on six basic principles. (These principles, though popularized by the
Shuowen Jiezi, were developed earlier; the oldest known mention of them is in the
Rites of Zhou, a text from .) The first two principles produce simple characters, known as :
Pictographs (): in which the character is a graphical depiction of the object it denotes. :
Examples: , , .
Indicatives (): in which the character represents an abstract notion. :
Examples: , , . The remaining four principles produce complex characters historically called , though this term is now generally used to refer to all characters, whether simple or complex. Of these four, two construct characters from simpler parts:
Ideographic compounds (): in which two or more parts are used for their meaning. This yields a composite meaning, which is then applied to the new character. :
Example: , which represents a sun rising in the trees.
Phono-semantic compounds (): in which one part—often called the
radical—indicates the general semantic category of the character, such as being related to
water or
eyes, with the other part being another character used for its phonetic value. :
Example: , which is composed of , and , which is used for its pronunciation. The last two principles do not produce new written forms; they instead transfer new meanings to existing forms:
Transference (): in which a character, often with a simple, concrete meaning takes on an extended, more abstract meaning. :
Example: , which was originally a pictograph depicting a fishing net. Over time, it has taken on an extended meaning, covering any kind of
lattice: for instance, it is the word used to refer to computer networks.
Loangraphs (): in which a character is used, either intentionally or accidentally, for some entirely different purpose. :
Example: is not attested in formal writing prior to the Tang dynasty, and was created from the leftmost component of the more ancient character . The ancient character meaning 'elder brother' continues to be used in idioms and formal writing, whereas is used in daily conversation in most Chinese dialects. Some dialects such as Minnan which retain features of spoken Old Chinese continue to use exclusively for 'elder brother' in daily conversation. In contrast to the popular conception of written Chinese as
ideographic, the vast majority of characters—about 95% of those in the
Shuowen Jiezi—either reflect elements of pronunciation, or are logical aggregates. In fact, some phonetic complexes were originally simple pictographs that were later augmented by the addition of a semantic root. An example is , now archaic, which was originally a pictograph of a lamp stand , a character that is now pronounced and means 'host', or the character was added to indicate that the meaning is fire related. Chinese characters are written to fit into a square, even when composed of two simpler forms written side-by-side or top-to-bottom. In such cases, each form is compressed to fit the entire character into a square.
Strokes Character components can be further subdivided into individual written strokes. The strokes of Chinese characters fall into eight main categories: "horizontal" , "vertical" , "left-falling" , "right-falling" , "rising", "dot" , "hook" , and "turning" , , . There are eight basic rules of stroke order in writing a Chinese character, which apply only generally and are sometimes violated: • Horizontal strokes are written before vertical ones. • Left-falling strokes are written before right-falling ones. • Characters are written from top to bottom. • Characters are written from left to right. • If a character is framed from above, the frame is written first. • If a character is framed from below, the frame is written last. • Frames are closed last. • In a symmetrical character, the middle is drawn first, then the sides.
Layout As characters are essentially rectilinear and are not joined with one another, written Chinese does not require a set orientation. Chinese texts were traditionally written in columns from top to bottom, which were laid out from right to left. Prior to the 20th century, Literary Chinese used little to no punctuation, with the breaks between sentences and phrases determined largely by context and the rhythms implied by patterns of syllables. In the 20th century, the layout used in Western scripts—where text is written in rows from left to right, which are laid out from top to bottom—became predominant in mainland China, where it was mandated by the Chinese government in 1955. Vertical layouts are still used for aesthetic effect, or when space limitations require it, such as on signage or book spines. The government of
Taiwan followed suit in 2004 for official documents, but vertical layouts have persisted in some books and newspapers. Less frequently, Chinese is written in rows from right to left, usually on signage or banners, though a left to right orientation remains more common. The use of punctuation has also become more common. In general, punctuation occupies the width of a full character, such that text remains visually well-aligned in a grid. Punctuation used in simplified Chinese shows clear influence from that used in Western scripts, though some marks are particular to Asian languages. For example, there are double and single quotation marks (『 』 and 「 」), and a hollow full stop (。), which is used to separate sentences in an identical manner to a Western full stop. A special mark called an
enumeration comma (、) is used to separate items in a list, as opposed to the clauses in a sentence. == History ==