Robert Weber notes: "To make valid inferences from the text, it is important that the classification procedure be reliable in the sense of being consistent: Different people should code the same text in the same way". The validity, inter-coder reliability and intra-coder reliability are subject to intense methodological research efforts over long years. Lacy and Riffe identify the measurement of inter-coder reliability as a strength of quantitative content analysis, arguing that, if content analysts do not measure inter-coder reliability, their data are no more reliable than the subjective impressions of a single reader. According to today's reporting standards, quantitative content analyses should be published with complete codebooks and for all variables or measures in the codebook the appropriate inter-coder or
inter-rater reliability coefficients should be reported based on empirical pre-tests. Furthermore, the
validity of all variables or measures in the codebook must be ensured. This can be achieved through the use of established measures that have proven their validity in earlier studies. Also, the
content validity of the measures can be checked by experts from the field who scrutinize and then approve or correct coding instructions, definitions and examples in the codebook.
Kinds of text There are five types of texts in content analysis: •
written text, such as books and papers • oral text, such as speech and theatrical performance • iconic text, such as drawings, paintings, and icons • audio-visual text, such as TV programs, movies, and videos •
hypertexts, which are texts found on the Internet
History Content analysis is research using the categorization and classification of speech, written text, interviews, images, or other forms of communication. In its beginnings, using the first newspapers at the end of the 19th century, analysis was done manually by measuring the number of columns given a subject. The approach can also be traced back to a university student studying patterns in Shakespeare's literature in 1893. Over the years, content analysis has been applied to a variety of scopes.
Hermeneutics and
philology have long used content analysis to interpret sacred and profane texts and, in many cases, to attribute texts'
authorship and
authenticity. In recent times, particularly with the advent of
mass communication, content analysis has known an increasing use to deeply analyze and understand media content and media logic. The political scientist
Harold Lasswell formulated the core questions of content analysis in its early-mid 20th-century mainstream version: "Who says what, to whom, why, to what extent and with what effect?". The strong emphasis for a quantitative approach started up by Lasswell was finally carried out by another "father" of content analysis,
Bernard Berelson, who proposed a definition of content analysis which, from this point of view, is emblematic: "a research technique for the objective, systematic and quantitative description of the manifest content of communication". Quantitative content analysis has enjoyed a renewed popularity in recent years thanks to technological advances, being fruitfully applied in mass and personal communication research. Content analysis of textual
big data produced by
new media, particularly
social media and
mobile devices has become popular. These approaches take a simplified view of language that ignores the complexity of
semiosis, the process by which meaning is formed out of language. Quantitative content analysts have been criticized for limiting the scope of content analysis to simple counting, and for applying the measurement methodologies of the natural sciences without reflecting critically on their appropriateness to social science. Conversely, qualitative content analysts have been criticized for being insufficiently systematic and too impressionistic.
Latent and manifest content Manifest content is readily understandable at its face value. Its meaning is direct. Latent content is not as overt, and requires interpretation to uncover the meaning or implication. ==Uses==