The impact to date of the HMP may be partially assessed by examination of research sponsored by the HMP. Over 650 peer-reviewed publications were listed on the HMP website from June 2009 to the end of 2017, and had been cited over 70,000 times. At this point the website was archived and is no longer updated, although datasets do continue to be available. Major categories of work funded by HMP included: • Development of new database systems allowing efficient organization, storage, access, search and annotation of massive amounts of data. These include IMG, the
Integrated Microbial Genomes database and comparative analysis system; IMG/M, a related system that integrates metagenome data sets with isolate microbial genomes from the IMG system;
CharProtDB, a database of experimentally characterized protein annotations; and the
Genomes OnLine Database (GOLD), for monitoring the status of genomic and metagenomic projects worldwide and their associated metadata. • Development of tools for comparative analysis that facilitate the recognition of common patterns, major themes and trends in complex data sets. These include RAPSearch2, a fast and memory-efficient protein similarity search tool for next-generation sequencing data; Boulder ALignment Editor (ALE), a web-based RNA alignment tool; WebMGA, a customizable web server for fast metagenomic sequence analysis; and DNACLUST, a tool for accurate and efficient clustering of phylogenetic marker genes • Development of new methods and systems for assembly of massive sequence data sets. No single assembly algorithm addresses all the known problems of assembling short-length sequences, so next-generation assembly programs such as AMOS are modular, offering a wide range of tools for assembly. Novel algorithms have been developed for improving the quality and utility of draft genome sequences. • Assembly of a catalog of sequenced reference genomes of pure bacterial strains from multiple body sites, against which metagenomic results can be compared. The original goal of 600 genomes has been far surpassed; the current goal is for 3000 genomes to be in this reference catalog, sequenced to at least a high-quality draft stage. , 742 genomes have been cataloged. • Establishment of the Data Analysis and Coordination Center (DACC), which serves as the central repository for all HMP data. • Various studies exploring legal and ethical issues associated with whole genome sequencing research. Developments funded by HMP included: • New predictive methods for identifying active transcription factor binding sites. • Identification, on the basis of bioinformatic evidence, of a widely distributed, ribosomally produced electron carrier precursor • Time-lapse "moving pictures" of the human microbiome. • Identification of unique adaptations adopted by segmented filamentous bacteria (SFB) in their role as gut commensals. SFB are medically important because they stimulate
T helper 17 cells, thought to play a key role in
autoimmune disease. • Identification of factors distinguishing the microbiota of healthy and diseased gut. • Identification of a hitherto unrecognized dominant role of
Verrucomicrobiota in soil bacterial communities. • Identification of factors determining the virulence potential of
Gardnerella vaginalis strains in
vaginosis. • Identification of a link between oral microbiota and atherosclerosis. • Demonstration that pathogenic species of
Neisseria involved in
meningitis,
sepsis, and
sexually transmitted disease exchange virulence factors with commensal species. ==Milestones==