NooJ’s dictionaries are represented by
finite-state transducers and can represent simple words (e.g. table), compound words (e.g. as a matter of fact) as well as discontinuous expressions such as phrasal verbs (e.g. to turn … off), idiomatic expressions (e.g. to take the bull by the horns) as well as support verb/predicative noun associations (e.g. to take a nap). NooJ allows linguists to create, edit, debug and maintain a large number of grammars that belong to the four classes of generative grammars in the Chomsky-Schützenberger hierarchy: finite-state grammars, context-free grammars, context-sensitive grammars and unrestricted grammars. NooJ can often apply grammars to texts in linear time: for instance, most NooJ Context-Free Grammars can often be derecursived. NooJ Context-Sensitive Grammars are made of two parts: one part is a Context-Free (or even a Finite-State Grammar) that is applied to texts very efficiently, the second consists in a set of constraints applied to matching sequences, each one performed in constant time. NooJ unrestricted grammars are context-sensitive grammars that can contain variables and can modify the text input. They are typically used to perform transformational analysis & generation (see
Zellig Harris), but several teams of linguists have shown that, when used in conjunction with multilingual lexicons, they can be used to perform
Machine Translation == References ==