Every InChI starts with the string InChI= followed by the version number, currently 1. If the InChI is standard, this is followed by the letter S for
standard InChIs, which is a fully standardized InChI flavor maintaining the same level of attention to structure details and the same conventions for drawing perception. The remaining information is structured as a sequence of layers and sub-layers, with each layer providing one specific type of information. The layers and sub-layers are separated by the delimiter / and start with a characteristic prefix letter (except for the chemical formula sub-layer of the main layer). The six layers with important sublayers are: • Main layer (always present) •
Chemical formula (no prefix). This is the only sublayer that must occur in every InChI. Numbers used throughout the InChI are given in the formula's element order excluding hydrogen atoms. For example, /C10H16N5O13P3 implies that atoms numbered 1–10 are carbons, 11–15 are nitrogens, 16–28 are oxygens, and 29–31 are phosphorus. • Atom connections (/c). The atoms in the chemical formula (except for hydrogens) are numbered in sequence; this sublayer describes which atoms are connected by bonds to which other ones. The type of those bonds is later specified in the stereochemical layer (/b). •
Hydrogen atoms (/h). Describes how many hydrogen atoms are connected to each of the other atoms. •
Charge layer • charge sublayer (/q) • proton sublayer (/p for protons) •
Stereochemical layer • double bonds and
cumulenes (/b). • tetrahedral stereochemistry of atoms and
allenes. First /t describes the relative configuration, which implies a preference for one of the mirror forms. Then /m is used to choose whether to mirror the molecule described by /t, if an absolute configuration is requested. • type of stereochemistry information (/s). /s1 for absolute, /s2 for relative (unspecified mix of chiralities), /s3 for racemic (equal mix of both chiralities). •
Isotopic layer (/i), may include sublayers: • sublayer /h for isotopic hydrogen • sublayers /b, /t, /m, /s for isotopic stereochemistry • Fixed-H layer (/f) for tautomeric hydrogens; contains some or all of the above types of layers except atom connections; may end with o sublayer. • Reconnected layer (/r); contains the whole InChI of a structure with reconnected metal atoms The delimiter-prefix format has the advantage that a user can easily use a
wildcard search to find identifiers that match only in certain layers. Standard InChI adds the following constraints: • The /f, /o, and /r (sub)layers are never included in standard InChI. • If stereochemistry is specified, it can only be absolute /s1. Unknown stereo designations are treated as undefined. • Organometallic connectivity does not include bonds to the metal. == InChIKey ==