Precursors to IQ testing Historically, there were attempts to classify people into
intelligence categories by observing their behaviour in daily life. He hypothesized that there should exist a correlation between intelligence and other observable traits such as
reflexes, muscle grip, and
head size. He set up the first mental testing centre in the world in 1882 and he published "Inquiries into Human Faculty and Its Development" in 1883, in which he set out his theories. After gathering data on a variety of physical variables, he was unable to show any such correlation, and he eventually abandoned this research. , co-developer of the
Stanford–Binet test French psychologist
Alfred Binet and psychiatrist
Théodore Simon, had more success in 1905, when they published the
Binet–Simon Intelligence test, which focused on verbal abilities. The score on the Binet–Simon scale would reveal the child's
mental age. For example, a six-year-old child who passed all the tasks usually passed by six-year-olds—but nothing beyond—would have a mental age that matched his chronological age, 6.0. (Fancher, 1985). Binet and Simon thought that intelligence was multifaceted, but came under the control of practical judgment. In Binet and Simon's view, there were limitations with the scale and they stressed what they saw as the remarkable diversity of intelligence and the consequent need to study it using qualitative, as opposed to quantitative, measures (White, 2000). American psychologist
Henry H. Goddard published a translation of it in 1910. American psychologist
Lewis Terman at
Stanford University revised the Binet–Simon scale, which resulted in the
Stanford revision of the Binet-Simon Intelligence Scale (1916). It became the most popular test in the United States for decades. The abbreviation "IQ" was coined by the
psychologist William Stern for the
German term , his term for a scoring method for
intelligence tests at
University of Breslau he advocated in a 1912 book.
General factor (g) The many different kinds of IQ tests include a wide variety of item content. Some test items are visual, while many are verbal. Test items vary from being based on abstract-reasoning problems to concentrating on arithmetic, vocabulary, or general knowledge. The British psychologist
Charles Spearman in 1904 made the first formal
factor analysis of
correlations between the tests. He observed that children's school grades across seemingly unrelated school subjects were positively correlated, and reasoned that these correlations reflected the influence of an underlying general mental ability that entered into performance on all kinds of mental tests. He suggested that all mental performance could be conceptualized in terms of a single general ability factor and a large number of narrow task-specific ability factors. Spearman named it
g for "general factor" and labelled the specific factors or abilities for specific tasks
s. In any collection of test items that make up an IQ test, the score that best measures
g is the composite score that has the highest correlations with all the item scores. Typically, the "
g-loaded" composite score of an IQ test battery appears to involve a common strength in abstract reasoning across the test's item content.
United States military selection in World War I During World War I, the Army needed a way to evaluate and assign recruits to appropriate tasks. This led to the development of several mental tests by
Robert Yerkes, who worked with major hereditarians of American psychometrics—including Terman, Goddard—to write the test. The testing generated controversy and much public debate in the United States. Nonverbal or "performance" tests were developed for those who could not speak English or were suspected of
malingering. Subsequently, there was an increase in jobs and funding in psychology in the United States. Group intelligence tests were developed and became widely used in schools and industry. The results of these tests, which at the time reaffirmed contemporary racism and nationalism, are considered controversial and dubious, having rested on certain contested assumptions: that intelligence was heritable, innate, and could be relegated to a single number, the tests were enacted systematically, and test questions actually tested for innate intelligence rather than subsuming environmental factors. played a significant role in the history and culture of the
United States during the
Progressive Era, from the late 19th century until US involvement in
World War II. The
American eugenics movement was rooted in the
biological determinist ideas of the British Scientist
Sir Francis Galton. In 1883, Galton first used the word eugenics to describe the biological improvement of human genes and the concept of being "well-born". He believed that differences in a person's ability were acquired primarily through genetics and that eugenics could be implemented through
selective breeding in order for the human race to improve in its overall quality, therefore allowing for humans to direct their own evolution.
Henry H. Goddard was a eugenicist. In 1908, he published his own version,
The Binet and Simon Test of Intellectual Capacity, and cordially promoted the test. He quickly extended the use of the scale to the public schools (1913), to immigration (
Ellis Island, 1914) and to a court of law (1914). Unlike Galton, who promoted eugenics through selective breeding for positive traits, Goddard went with the US eugenics movement to eliminate "undesirable" traits. Goddard used the term "
feeble-minded" to refer to people who did not perform well on the test. He argued that "feeble-mindedness" was caused by heredity, and thus feeble-minded people should be prevented from giving birth, either by institutional isolation or sterilization surgeries. California's sterilization program was so effective that the Nazis turned to the government for advice on how to prevent the birth of the "unfit". While the US eugenics movement lost much of its momentum in the 1940s in view of the horrors of Nazi Germany, advocates of eugenics (including Nazi geneticist
Otmar Freiherr von Verschuer) continued to work and promote their ideas in the United States. As it becomes possible to test for and correlate genes with IQ (and its proxies), ethicists and embryonic
genetic testing companies are attempting to understand the ways in which the technology can be ethically deployed.
Cattell–Horn–Carroll theory defined
fluid and crystallized intelligence and authored the
Cattell Culture Fair III IQ test.
Raymond Cattell (1941) proposed two types of cognitive abilities in a revision of Spearman's concept of general intelligence.
Fluid intelligence (Gf) was hypothesized as the ability to solve novel problems by using reasoning, and
crystallized intelligence (Gc) was hypothesized as a knowledge-based ability that was very dependent on education and experience. In addition, fluid intelligence was hypothesized to decline with age, while crystallized intelligence was largely resistant to the effects of aging. The theory was almost forgotten, but was revived by his student
John L. Horn (1966) who later argued Gf and Gc were only two among several factors, and who eventually identified nine or ten broad abilities. The theory continued to be called Gf-Gc theory. In 1999, a merging of the Gf-Gc theory of Cattell and Horn with Carroll's Three-Stratum theory led to the Cattell–Horn–Carroll theory (CHC Theory), with
g as the top of the hierarchy, ten broad abilities below, and further subdivisions into seventy narrow abilities on the third stratum. CHC Theory has greatly influenced many broad IQ tests. According to Vygotsky, the maximum level of complexity and difficulty of problems that a child is capable to solve under some guidance indicates their level of potential development. The difference between this level of potential and the lower level of unassisted performance indicates the child's zone of proximal development. Combination of the two indexesthe level of actual and the zone of the proximal developmentaccording to Vygotsky, provides a significantly more informative indicator of psychological development than the assessment of the level of actual development alone. His ideas on the zone of development were later expanded upon in various psychological and educational theories and practicesmost notably under the banner of
dynamic assessment, which seeks to measure developmental potential. This approach is exemplified in the work of
Reuven Feuerstein and his associates, who
criticized standard IQ testing for its assumption that intelligence or cognitive functioning is "fixed and immutable". Dynamic assessment has been further elaborated in the work of
Ann Brown, and
John D. Bransford and in theories of
multiple intelligences authored by
Howard Gardner and
Robert Sternberg.
J.P. Guilford's
Structure of Intellect (1967) model of intelligence used three dimensions, which, when combined, yielded a total of 120 types of intelligence. It was popular in the 1970s and early 1980s, but faded owing to both practical problems and
theoretical criticisms. It has influenced some recent IQ tests, and been seen as a complement to the Cattell–Horn–Carroll theory described above. The most commonly used individual IQ test series is the
Wechsler Adult Intelligence Scale (WAIS) for adults and the
Wechsler Intelligence Scale for Children (WISC) for school-age test-takers. Other commonly used individual IQ tests (some of which do not label their standard scores as "IQ" scores) include the updated
Stanford–Binet Intelligence Scales,
Woodcock–Johnson Tests of Cognitive Abilities, the
Kaufman Assessment Battery for Children, the
Cognitive Assessment System, and the
Differential Ability Scales. There are various other IQ tests, including: •
Raven's Progressive Matrices (RPM) •
Cattell Culture Fair III (CFIT) •
Reynolds Intellectual Assessment Scales (RIAS) •
Thurstone's Primary Mental Abilities •
Kaufman Brief Intelligence Test (KBIT) •
Multidimensional Aptitude Battery II •
Das–Naglieri Cognitive Assessment System (CAS) •
Naglieri Nonverbal Ability Test (NNAT) •
Wide Range Intelligence Test (WRIT) IQ scales are
ordinally scaled. The
raw score of the norming
sample is usually (rank order)
transformed to a
normal distribution with mean 100 and
standard deviation 15. While one standard deviation is 15 points, and two SDs are 30 points, and so on, this does not imply that mental ability is linearly related to IQ, such that IQ 50 would mean half the cognitive ability of IQ 100. In particular, IQ points are not percentage points. ==Reliability and validity==