Lexical choice modules must be informed by linguistic knowledge of how the system's input data maps onto words. This is a question of
semantics, but it is also influenced by
syntactic factors (such as
collocation effects) and
pragmatic factors (such as context). Hence NLG systems need linguistic models of how meaning is mapped to words in the target domain (
genre) of the NLG system. Genre tends to be very important; for example the verb
veer has a very specific meaning in weather forecasts (wind direction is changing in a clockwise direction) which it does not have in general English, and a weather-forecast generator must be aware of this genre-specific meaning. In some cases there are major differences in how different people use the same word; for example, some people use
by evening to mean 6PM and others use it to mean midnight. Psycholinguists have shown that when people speak to each other, they agree on a common interpretation via lexical alignment; this is not something which NLG systems can yet do. Ultimately, lexical choice must deal with the fundamental issue of how language relates to the non-linguistic world. For example, a system which chose colour terms such as
red to describe objects in a digital image would need to know which RGB pixel values could generally be described as
red; how this was influenced by visual (lighting, other objects in the scene) and linguistic (other objects being discussed) context; what pragmatic connotations were associated with
red (for example, when an apple is called
red, it is assumed to be ripe as well as have the colour red); and so forth. ==Algorithms and models==