The term was coined as early as 1972 by the Austrian
linguist Edgar W. Schneider, in a discussion of how to build modular instructional systems for courses. In the late 1980s, the
University of Groningen and
University of Twente jointly began a project called Knowledge Graphs, focusing on the design of
semantic networks with edges restricted to a limited set of relations, to facilitate
algebras on the graph. In subsequent decades, the distinction between semantic networks and knowledge graphs was blurred. Some early knowledge graphs were topic-specific. In 1985,
Wordnet was founded, capturing semantic relationships between words and meaningsan application of this idea to language itself. In 2005, Marc Wirk founded
Geonames to capture relationships between different geographic names and locales and associated entities. In 1998, Andrew Edmonds of Science in Finance Ltd in the UK created a system called ThinkBase that offered
fuzzy-logic based reasoning in a graphical context. In 2007, both
DBpedia and
Freebase were founded as graph-based knowledge
repositories for general-purpose knowledge. DBpedia focused exclusively on data extracted from
Wikipedia, while Freebase also included a range of public datasets. Neither described themselves as a 'knowledge graph' but developed and described related concepts. In 2012, Google introduced their
Knowledge Graph, building on DBpedia and Freebase among other sources. They later incorporated
RDFa,
Microdata,
JSON-LD content extracted from indexed web pages, including the
CIA World Factbook,
Wikidata, and Wikipedia. Entity and relationship types associated with this knowledge graph have been further organized using terms from the
schema.org vocabulary. The Google Knowledge Graph became a complement to string-based search within Google, and its popularity online brought the term into more common use. In 2019,
IEEE combined its annual international conferences on "Big Knowledge" and "Data Mining and Intelligent Computing" into the International Conference on Knowledge Graph. The development of large language models expanded interest in knowledge graphs as a way to structure information from unstructured text, with advances in language processing enabling their automatic or semi-automatic generation and expansion. The term knowledge graph has since broadened to include the dynamically constructed and adaptive graph structures, which support retrieval, reasoning, and summarization in generative systems. Microsoft Research's GraphRAG (2024) exemplified this development by integrating LLM-generated graphs into retrieval-augmented generation. == Definitions ==