Early in his career Tukey worked on developing
statistical methods for computers at
Bell Labs, where he coined the word
bit in 1947.
Data analysis and foundations of data science John Tukey contributed greatly to statistical practice and data analysis in general. In fact, some regard John Tukey as the father of data science. At the very least, he pioneered many of the key foundations of what came later to be known as
data science. Making sense of data has a long history and has been addressed by statisticians, mathematicians, scientists, and others for many many years. During the 1960s, Tukey challenged the dominance at the time of what he called "confirmatory data analysis", statistical analyses driven by rigid mathematical configurations. Tukey emphasized the importance of having a more flexible attitude towards data analysis and of exploring data carefully to see what structures and information might be contained therein. He called this "exploratory data analysis" (EDA). In many ways, EDA was a precursor to data science. Tukey also realized the importance of computer science to EDA. Graphics are an integral part of EDA methodology and, while much of Tukey's work focused on static displays (such as box plots) that could be drawn by hand, he realized that computer graphics would be much more effective for studying multivariate data. PRIM-9, the first program for viewing multivariate data, was conceived by him during the early 1970s. This coupling of data analysis and computer science is what is now called data science. Tukey articulated the important distinction between
exploratory data analysis and
confirmatory data analysis, believing that much statistical methodology placed too great an emphasis on the latter. Though he believed in the utility of separating the two types of analysis, he pointed out that sometimes, especially in
natural science, this was problematic and termed such situations
uncomfortable science. A. D. Gordon of the
University of St. Andrews offered the following summary of Tukey's principles for statistical practice: Tukey's lectures were described to be unusual. McCullagh described his lecture given in London in 1977: ==Contributions to other fields==