Overview . This diagram represents one variant, and
there are many others. The scientific method is the process by which
science is carried out. As in other areas of inquiry, science (through the scientific method) can build on previous knowledge, and unify understanding of its studied topics over time. Historically, the development of the scientific method was critical to the
Scientific Revolution. The overall process involves making conjectures (
hypotheses), predicting their logical consequences, then carrying out experiments based on those predictions to determine whether the original conjecture was correct. It is generally recognized to develop advances in knowledge through the following elements, in varying combinations or contributions: In this sense, it is not a mindless set of standards and procedures to follow but is rather an
ongoing cycle, constantly developing more useful, accurate, and comprehensive models and methods. For example, when Einstein developed the Special and General Theories of Relativity, he did not in any way refute or discount Newton's
Principia. On the contrary, if the astronomically massive, the feather-light, and the extremely fast are removed from Einstein's theories – all phenomena Newton could not have observed – Newton's equations are what remain. Einstein's theories are expansions and refinements of Newton's theories and, thus, increase confidence in Newton's work. An iterative, • Define a question • Gather information and resources (observe) • Form an explanatory hypothesis • Test the hypothesis by performing an experiment and collecting data in a
reproducible manner • Analyze the data • Interpret the data and draw conclusions that serve as a starting point for a new hypothesis • Publish results • Retest (frequently done by other scientists) The iterative cycle inherent in this step-by-step method goes from point 3 to 6 and back to 3 again. While this schema outlines a typical hypothesis/testing method, many philosophers, historians, and sociologists of science, including
Paul Feyerabend, claim that such descriptions of scientific method have little relation to the ways that science is actually practiced.
Characterizations The basic elements of the scientific method are illustrated by the following example (which occurred from 1944 to 1953) from the discovery of the structure of DNA (marked with and indented). In 1950, it was known that
genetic inheritance had a mathematical description, starting with the studies of
Gregor Mendel, and that DNA contained genetic information (Oswald Avery's
transforming principle). But the mechanism of storing genetic information (i.e., genes) in DNA was unclear. Researchers in
Bragg's laboratory at
Cambridge University made
X-ray diffraction pictures of various
molecules, starting with
crystals of
salt, and proceeding to more complicated substances. Using clues painstakingly assembled over decades, beginning with its chemical composition, it was determined that it should be possible to characterize the physical structure of DNA, and the X-ray images would be the vehicle. The scientific method depends upon increasingly sophisticated characterizations of the subjects of investigation. (The
subjects can also be called
unsolved problems or the
unknowns.) For example,
Benjamin Franklin conjectured, correctly, that
St. Elmo's fire was
electrical in
nature, but it has taken a long series of experiments and theoretical changes to establish this. While seeking the pertinent properties of the subjects, careful thought may also
entail some definitions and
observations; these observations often demand careful
measurements and/or counting can take the form of expansive
empirical research. A
scientific question can refer to the explanation of a specific
observation, as in "Why is the sky blue?" but can also be open-ended, as in "How can I
design a drug to cure this particular disease?" This stage frequently involves finding and evaluating evidence from previous experiments, personal scientific observations or assertions, as well as the work of other scientists. If the answer is already known, a different question that builds on the evidence can be posed. When applying the scientific method to research, determining a good question can be very difficult and it will affect the outcome of the investigation. The systematic, careful collection of measurements or counts of relevant quantities is often the critical difference between
pseudo-sciences, such as alchemy, and science, such as chemistry or biology. Scientific measurements are usually tabulated, graphed, or mapped, and statistical manipulations, such as
correlation and
regression, performed on them. The measurements might be made in a controlled setting, such as a laboratory, or made on more or less inaccessible or unmanipulatable objects such as stars or human populations. The measurements often require specialized
scientific instruments such as
thermometers,
spectroscopes,
particle accelerators, or
voltmeters, and the progress of a scientific field is usually intimately tied to their invention and improvement.
Definition The scientific definition of a term sometimes differs substantially from its
natural language usage. For example,
mass and
weight overlap in meaning in common discourse, but have distinct meanings in
mechanics. Scientific quantities are often characterized by their
units of measure which can later be described in terms of conventional physical units when communicating the work. New theories are sometimes developed after realizing certain terms have not previously been sufficiently clearly defined. For example,
Albert Einstein's first paper on
relativity begins by defining
simultaneity and the means for determining
length. These ideas were skipped over by
Isaac Newton with, "I do not define
time, space, place and
motion, as being well known to all." Einstein's paper then demonstrates that they (viz., absolute time and length independent of motion) were approximations.
Francis Crick cautions us that when characterizing a subject, however, it can be premature to define something when it remains ill-understood. In Crick's study of
consciousness, he actually found it easier to study
awareness in the
visual system, rather than to study
free will, for example. His cautionary example was the gene; the gene was much more poorly understood before Watson and Crick's pioneering discovery of the structure of DNA; it would have been counterproductive to spend much time on the definition of the gene, before them.
Hypothesis development Linus Pauling proposed that DNA might be a
triple helix. This hypothesis was also considered by
Francis Crick and
James D. Watson but discarded. When Watson and Crick learned of Pauling's hypothesis, they understood from existing data that Pauling was wrong. and that Pauling would soon admit his difficulties with that structure. A
hypothesis is a suggested explanation of a phenomenon, or alternately a reasoned proposal suggesting a possible correlation between or among a set of phenomena. Normally, hypotheses have the form of a
mathematical model. Sometimes, but not always, they can also be formulated as
existential statements, stating that some particular instance of the phenomenon being studied has some characteristic and causal explanations, which have the general form of
universal statements, stating that every instance of the phenomenon has a particular characteristic. Scientists are free to use whatever resources they have – their own creativity, ideas from other fields,
inductive reasoning,
Bayesian inference, and so on – to imagine possible explanations for a phenomenon under study. Albert Einstein once observed that "there is no logical bridge between phenomena and their theoretical principles."
Charles Sanders Peirce, borrowing a page from
Aristotle (
Prior Analytics,
2.25) described the incipient stages of
inquiry, instigated by the "irritation of doubt" to venture a plausible guess, as
abductive reasoning. The history of science is filled with stories of scientists claiming a "flash of inspiration", or a hunch, which then motivated them to look for evidence to support or refute their idea.
Michael Polanyi made such creativity the centerpiece of his discussion of methodology.
William Glen observes that In general, scientists tend to look for theories that are "
elegant" or "
beautiful". Scientists often use these terms to refer to a theory that is following the known facts but is nevertheless relatively simple and easy to handle.
Occam's Razor serves as a rule of thumb for choosing the most desirable amongst a group of equally explanatory hypotheses. To minimize the
confirmation bias that results from entertaining a single hypothesis,
strong inference emphasizes the need for entertaining multiple alternative hypotheses, and avoiding artifacts.
Predictions from the hypothesis James D. Watson,
Francis Crick, and others hypothesized that DNA had a helical structure. This implied that DNA's X-ray diffraction pattern would be 'x shaped'. This prediction followed from the work of Cochran, Crick and Vand (and independently by Stokes). The Cochran-Crick-Vand-Stokes theorem provided a mathematical explanation for the empirical observation that diffraction from helical structures produces x-shaped patterns. In their first paper, Watson and Crick also noted that the
double helix structure they proposed provided a simple mechanism for
DNA replication, writing, "It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material".Any useful hypothesis will enable
predictions, by
reasoning including
deductive reasoning. It might predict the outcome of an experiment in a laboratory setting or the observation of a phenomenon in nature. The prediction can also be statistical and deal only with probabilities. It is essential that the outcome of testing such a prediction be currently unknown. Only in this case does a successful outcome increase the probability that the hypothesis is true. If the outcome is already known, it is called a consequence and should have already been considered while
formulating the hypothesis. If the predictions are not accessible by observation or experience, the hypothesis is not yet
testable and so will remain to that extent unscientific in a strict sense. A new technology or theory might make the necessary experiments feasible. For example, while a hypothesis on the existence of other intelligent species may be convincing with scientifically based speculation, no known experiment can test this hypothesis. Therefore, science itself can have little to say about the possibility. In the future, a new technique may allow for an experimental test and the speculation would then become part of accepted science. For example, Einstein's theory of
general relativity makes several specific predictions about the observable structure of
spacetime, such as that
light bends in a
gravitational field, and that the amount of bending depends in a precise way on the strength of that gravitational field.
Arthur Eddington's
observations made during a 1919 solar eclipse supported General Relativity rather than Newtonian
gravitation.
Experiments Watson and Crick showed an initial (and incorrect) proposal for the structure of DNA to a team from
King's College London –
Rosalind Franklin,
Maurice Wilkins, and
Raymond Gosling. Franklin immediately spotted the flaws which concerned the water content. Later Watson saw Franklin's
photo 51, a detailed X-ray diffraction image, which showed an X-shape and was able to confirm the structure was helical. Once predictions are made, they can be sought by experiments. If the test results contradict the predictions, the hypotheses which entailed them are called into question and become less tenable. Sometimes the experiments are conducted incorrectly or are not very well designed when compared to a
crucial experiment. If the experimental results confirm the predictions, then the hypotheses are considered more likely to be correct, but might still be wrong and continue to be subject to
further testing. The
experimental control is a technique for dealing with observational error. This technique uses the contrast between multiple samples, or observations, or populations, under differing conditions, to see what varies or what remains the same. We vary the conditions for the acts of measurement, to help isolate what has changed.
Mill's canons can then help us figure out what the important factor is.
Factor analysis is one technique for discovering the important factor in an effect. Depending on the predictions, the experiments can have different shapes. It could be a classical experiment in a laboratory setting, a
double-blind study or an archaeological
excavation. Even taking a plane from
New York to
Paris is an experiment that tests the
aerodynamical hypotheses used for constructing the plane. These institutions thereby reduce the research function to a cost/benefit, Current large instruments, such as CERN's
Large Hadron Collider (LHC), or
LIGO, or the
National Ignition Facility (NIF), or the
International Space Station (ISS), or the
James Webb Space Telescope (JWST), entail expected costs of billions of dollars, and timeframes extending over decades. These kinds of institutions affect public policy, on a national or even international basis, and the researchers would require shared access to such machines and their
adjunct infrastructure. Scientists assume an attitude of openness and accountability on the part of those experimenting. Detailed record-keeping is essential, to aid in recording and reporting on the experimental results, and supports the effectiveness and integrity of the procedure. They will also assist in reproducing the experimental results, likely by others. Traces of this approach can be seen in the work of
Hipparchus (190–120 BCE), when determining a value for the precession of the Earth, while
controlled experiments can be seen in the works of
al-Battani (853–929 CE) and
Alhazen (965–1039 CE).
Communication and iteration Watson and Crick then produced their model, using this information along with the previously known information about DNA's composition, especially Chargaff's rules of base pairing. After considerable fruitless experimentation, being discouraged by their superior from continuing, and numerous false starts, Watson and Crick were able to infer the essential structure of
DNA by concrete
modeling of the physical shapes of the
nucleotides which comprise it. They were guided by the bond lengths which had been deduced by
Linus Pauling and by
Rosalind Franklin's X-ray diffraction images. The scientific method is iterative. At any stage, it is possible to refine its
accuracy and precision, so that some consideration will lead the scientist to repeat an earlier part of the process. Failure to develop an interesting hypothesis may lead a scientist to re-define the subject under consideration. Failure of a hypothesis to produce interesting and testable predictions may lead to reconsideration of the hypothesis or of the definition of the subject. Failure of an experiment to produce interesting results may lead a scientist to reconsider the experimental method, the hypothesis, or the definition of the subject. This manner of iteration can span decades and sometimes centuries.
Published papers can be built upon. For example: By 1027,
Alhazen, based on his measurements of the
refraction of light, was able to deduce that
outer space was less dense than
air, that is: "the body of the heavens is rarer than the body of air". In 1079
Ibn Mu'adh's
Treatise On Twilight was able to infer that Earth's atmosphere was 50 miles thick, based on
atmospheric refraction of the sun's rays. This is why the scientific method is often represented as circular – new information leads to new characterisations, and the cycle of science continues. Measurements collected
can be archived, passed onwards and used by others. Other scientists may start their own research and
enter the process at any stage. They might adopt the characterization and formulate their own hypothesis, or they might adopt the hypothesis and deduce their own predictions. Often the experiment is not done by the person who made the prediction, and the characterization is based on experiments done by someone else. Published results of experiments can also serve as a hypothesis predicting their own reproducibility.
Confirmation Science is a social enterprise, and scientific work tends to be accepted by the scientific community when it has been confirmed. Crucially, experimental and theoretical results must be reproduced by others within the scientific community. Researchers have given their lives for this vision;
Georg Wilhelm Richmann was killed by
ball lightning (1753) when attempting to replicate the 1752 kite-flying experiment of
Benjamin Franklin. If an experiment cannot be
repeated to produce the same results, this implies that the original results might have been in error. As a result, it is common for a single experiment to be performed multiple times, especially when there are uncontrolled variables or other indications of
experimental error. For significant or surprising results, other scientists may also attempt to replicate the results for themselves, especially if those results would be important to their own work. Replication has become a contentious issue in social and biomedical science where treatments are administered to groups of individuals. Typically an
experimental group gets the treatment, such as a drug, and the
control group gets a placebo.
John Ioannidis in 2005 pointed out that the method being used has led to many findings that cannot be replicated.
Peer review—anonymous expert evaluation of research—assesses experimental soundness rather than certifying correctness. Some
journals request that the experimenter provide lists of possible peer reviewers, especially if the field is highly specialized. Specialists review methodology and design; if approved (sometimes requiring additional experiments), the prestige of the journal where the work is published indicates perceived quality. Scientists typically are careful in recording their data, a requirement promoted by
Ludwik Fleck (1896–1961) and others. Though not typically required, they might be requested to
supply this data to other scientists who wish to replicate their original results (or parts of their original results), extending to the sharing of any experimental samples that may be difficult to obtain. To protect against bad science and fraudulent data, government research-granting agencies such as the
National Science Foundation, and science journals, including
Nature and
Science, have a policy that researchers must archive their data and methods so that other researchers can test the data and methods and build on the research that has gone before.
Scientific data archiving can be done at several national archives in the U.S. or the
World Data Center. == Foundational principles ==