MarketANSI escape code
Company Profile

ANSI escape code

ANSI escape sequences are a standard for in-band signaling to control cursor location, color, font styling, and other options on video text terminals and terminal emulators. Certain sequences of bytes, most starting with an ASCII escape character and a bracket character, are embedded into text. The terminal interprets these sequences as commands, rather than text to display verbatim.

History
Almost all manufacturers of video terminals added vendor-specific escape sequences to perform operations such as placing the cursor at arbitrary positions on the screen. One example is the VT52 terminal, which allowed the cursor to be placed at an x,y location on the screen by sending the character, a character, and then two characters representing numerical values equal to the x,y location plus 32 (thus starting at the ASCII space character and avoiding the control characters). The Hazeltine 1500 had a similar feature, invoked using , and then the X and Y positions separated with a comma. While the two terminals had identical functionality in this regard, different control sequences had to be used to invoke them. As these sequences were different for different terminals, elaborate libraries such as termcap ("terminal capabilities") and utilities such as tput had to be created so programs could use the same API to work with any terminal. In addition, many of these terminals required sending numbers (such as row and column) as the binary values of the characters; for some programming languages, and for systems that did not use ASCII internally, it was often difficult to turn a number into the correct character. The ANSI standard attempted to address these problems by making a command set that all terminals would use and requiring all numeric information to be transmitted as ASCII numbers. The first standard in the series was ECMA-48, adopted in 1976. Others included the Qume QVT-108, Televideo TVI-970, Wyse WY-99GT as well as optional "VT100" or "VT103" or "ANSI" modes with varying degrees of compatibility on many other brands. The popularity of these gradually led to more and more software (especially bulletin board systems and other online services) assuming the escape sequences worked, leading to almost all new terminals and emulator programs supporting them. In 1981, ANSI X3.64 was adopted for use in the US government by FIPS publication 86. Later, the US government stopped duplicating industry standards, so FIPS pub. 86 was withdrawn. ECMA-48 has been updated several times and is currently at its 5th edition, from 1991. It is also adopted by ISO and IEC as standard ISO/IEC 6429. A version is adopted as a Japanese Industrial Standard, as JIS X 0211. Related standards include ITU T.61, the Teletex standard, and the ISO/IEC 8613, the Open Document Architecture standard (mainly ISO/IEC 8613-6 or ITU T.416). The two systems share many escape codes with the ANSI system, with extensions that are not necessarily meaningful to computer terminals. Both systems quickly fell into disuse, but ECMA-48 does mark the extensions used in them as reserved. == Platform support ==
Platform support
In the early 1980s, large amounts of software directly used these sequences to update screen displays. This included everything on VMS (which assumed DEC terminals), most software designed to be portable on CP/M home computers, and significant amounts of Unix software, as it was easier to use than the termcap libraries. Terminal emulators for communicating with remote machines almost always implement ANSI escape codes. This includes anything written to communicate with bulletin-board systems on home and personal computers. On Unix terminal emulators such as xterm also can communicate with software running on the same machine, and thus software running in X11 under a terminal emulator could assume the ability to write these sequences. As computers got more powerful even built-in displays started supporting them, allowing software to be portable between CP/M systems. There were attempts to extend the escape sequences to support printers and as an early PDF-like document storage format, the Open Document Architecture. The IBM PC, introduced in 1981, did not support these or any other escape sequences for updating the screen. Only a few control characters (BEL, CR, LF, BS) were interpreted by the underlying BIOS. Any display effects had to be done with BIOS calls, which were notoriously slow, or by directly manipulating the IBM PC hardware. This made any interesting software non-portable and led to the need to duplicate details of the display hardware in PC Clones. DOS version 2.0 included an optional device driver named . Poor performance, and the fact that it was not installed by default, meant software rarely (if ever) took advantage of it. The Windows Console did not support ANSI escape sequences, nor did Microsoft provide any method to enable them. Some replacements such as JP Software's TCC (formerly 4NT), Michael J. Mefford's ANSI.COM, Jason Hood's and Maximus5's ConEmu enabled ANSI escape sequences. Software such as the Python colorama package or Cygwin modified text in-process as it was sent to the console, extracting the ANSI Escape sequences and emulating them with Windows calls. In 2016, Microsoft released the Windows 10 version 1511 update which unexpectedly implemented support for ANSI escape sequences, over three decades after the debut of Windows. This was done alongside Windows Subsystem for Linux, apparently to allow Unix-like terminal-based software to use the Windows Console. Windows PowerShell 5.1 enabled this by default, and PowerShell 6 made it possible to embed the necessary ESC character into a string with . Windows Terminal, introduced in 2019, supports the sequences by default. Since Windows 11 22H2 and Windows Terminal 1.15, Windows Terminal replaces Windows Console as the default. == C0 control codes ==
C0 control codes
Almost all users assume some functions of some single-byte characters. Initially defined as part of ASCII, the default C0 control code set is now defined in ISO 6429 (ECMA-48), making it part of the same standard as the C1 set invoked by the ANSI escape sequences (although ISO 2022 allows the ISO 6429 C0 set to be used without the ISO 6429 C1 set, and vice versa, provided that 0x1B is always ESC). This is used to shorten the amount of data transmitted, or to perform some functions that are unavailable from escape sequences: Escape sequences vary in length. The general format for an ANSI-compliant escape sequence is defined by ANSI X3.41 (equivalent to ECMA-35 or ISO/IEC 2022). The escape sequences consist only of bytes in the range (all the non-control ASCII characters), and can be parsed without looking ahead. The behavior when a control character, a byte with the high bit set, or a byte that is not part of any valid sequence is encountered before the end is undefined. == Fe Escape sequences ==
Fe Escape sequences
If the #ESC| is followed by a byte in the range 0x40 to 0x5F, the escape sequence is of type . Its interpretation is delegated to the applicable C1 control code standard. Accordingly, all escape sequences corresponding to C1 control codes from ANSI X3.64 / ECMA-48 follow this format. The standard says that, in 8-bit environments, the control functions corresponding to type escape sequences (those from the set of C1 control codes) can be represented as single bytes in the 0x80–0x9F range. This is possible in character encodings conforming to the provisions for an 8-bit code made in ISO 2022, such as the ISO 8859 series. However, in character encodings used on modern devices such as UTF-8 or CP-1252, those codes are often used for other purposes, so only the 2-byte sequence is typically used. In the case of UTF-8, representing a C1 control code via the C1 Controls and Latin-1 Supplement block results in a different two-byte code (e.g. for ), but no space is saved this way. == Control Sequence Introducer commands ==
{{anchor|CSIsection}} Control Sequence Introducer commands
For Control Sequence Introducer, or CSI, commands, the (written as , or in several programming languages) is followed by any number (including none) of "parameter bytes" in the range 0x30–0x3F (ASCII 0–9:;?), then by any number of "intermediate bytes" in the range 0x20–0x2F (ASCII space and ), then finally by a single "final byte" in the range 0x40–0x7E (ASCII @A–Z[\]^_`a–z~) are private. The behavior of the terminal is undefined in the case where a CSI sequence contains any character outside of the range 0x20–0x7E. These illegal characters are either C0 control characters (the range 0–0x1F), DEL (0x7F), or bytes with the high bit set. Possible responses are to ignore the byte, to process it immediately, and furthermore whether to continue with the CSI sequence, to abort it immediately, or to ignore the rest of it. == Select Graphic Rendition parameters ==
Select Graphic Rendition parameters
The control sequence CSI m, named Select Graphic Rendition (SGR), sets display attributes. Several attributes can be set in the same sequence, separated by semicolons. Each display attribute remains in effect until a following occurrence of SGR resets it. ESC[38;5;m Select foreground color where n is a number from the table below ESC[48;5;m Select background color 0- 7: standard colors (as in ESC [ 30–37 m) 8- 15: high intensity colors (as in ESC [ 90–97 m) 16-231: 6 × 6 × 6 cube (216 colors): 16 + 36 × r + 6 × g + b (0 ≤ r, g, b ≤ 5) 232-255: grayscale from dark to light in 24 steps The colors displayed by these values vary across terminal/emulator implementations as the recognized ECMA-48 and ITU's T.416 specifications do not define a specific color palette for this lookup table. While it is common to use the above formula for the color palette, in particular the algorithm and choice of colors for the 16-231 cube values differs between implementations. The color palette and algorithm used by XTerm is specified below as a sample. uses ":" as separator characters instead: ESC[38:5:m Select foreground color where n is a number from the table below ESC[48:5:m Select background color 9) else ' %d ' % code)) code += 1 for r in range(len(rgbs)): for g in range(len(rgbs)): for b in range(len(rgbs)): triad = rgbs[r] + rgbs[g] + rgbs[b] outs[1].append(out % (fgs[g To calculate the RGB values of the colors in the table above, the following Python script can be used: • print a list of the 256-color red/green/blue values used by xterm. • • reference: • https://github.com/ThomasDickey/ncurses-snapshots/blob/master/test/xterm-16color.dat • https://github.com/ThomasDickey/xterm-snapshots/blob/master/XTerm-col.ad • https://github.com/ThomasDickey/xterm-snapshots/blob/master/256colres.pl print("colors 0-15 correspond to the ANSI and aixterm naming") for code in range(0, 16): if code > 8: level = 255 elif code == 7: level = 229 else: level = 205 r = 127 if code == 8 else level if (code & 1) != 0 else 92 if code == 12 else 0 g = 127 if code == 8 else level if (code & 2) != 0 else 92 if code == 12 else 0 b = 127 if code == 8 else 238 if code == 4 else level if (code & 4) != 0 else 0 print(f"{code:3d}: {r:02X} {g:02X} {b:02X}") print("colors 16-231 are a 6x6x6 color cube") for red in range(0, 6): for green in range(0, 6): for blue in range(0, 6): code = 16 + (red * 36) + (green * 6) + blue r = red * 40 + 55 if red != 0 else 0 g = green * 40 + 55 if green != 0 else 0 b = blue * 40 + 55 if blue != 0 else 0 print(f"{code:3d}: {r:02X} {g:02X} {b:02X}") print("colors 232-255 are a grayscale ramp, intentionally leaving out black and white") for gray in range(0, 24): level = gray * 10 + 8 code = 232 + gray print(f"{code:3d}: {level:02X} {level:02X} {level:02X}") There has also been a similar but incompatible 88-color encoding using the same escape sequence, seen in and . It uses a 4×4×4 color cube. 24-bit As "true color" graphic cards with 16 to 24 bits of color became common, applications began to support 24-bit colors. Terminal emulators supporting setting 24-bit foreground and background colors with escape sequences include Xterm, and iTerm, as well as all libvte based terminals, including GNOME Terminal. ESC[38;2;;;m Select RGB foreground color ESC[48;2;;;m Select RGB background color This syntax, initially implemented in XTerm, is based on a reading of the ISO/IEC 8613-6 specification, specifically that SGR 38 / SGR 48 followed by the parameter "2" can specify a "direct color" in RGB space.. As the first widely-used implementation, this specification of RGB values using semicolon separators is widely supported by terminal emulators that include 24-bit color support. As with the 8-bit color codes, there is a very similar specification of 24-bit color in the ITU's T.416 Open Document Architecture (ODA) and interchange format: Character content architectures, Unix environment variables relating to color support Rather than using the color support in termcap and terminfo introduced in SVr3.2 (1987), the S-Lang library (version 0.99-32, June 1996) used a separate environment variable to indicate whether a terminal emulator could use colors at all, and later added values to indicate if it supported 24-bit color. This system, although poorly documented, became widespread enough for Fedora and RHEL to consider using it as a simpler and more universal detection mechanism compared to querying the now-updated libraries. Some terminal emulators (urxvt, Konsole) set to report the color scheme of the terminal (mainly light vs. dark background). This behavior originated in S-Lang It has become conventional to use the environment variable to disable colors unconditionally. == Operating System Command sequences ==
Operating System Command sequences
Most Operating System Command sequences were defined by Xterm, but many are also supported by other terminal emulators. For historical reasons, Xterm can end the command with Bell character| (0x07) as well as the standard (0x9C or 0x1B 0x5C). iTerm2, among others. The Linux console uses to change the palette, which, if hard-coded into an application, may hang other terminals. However, appending will be ignored by Linux and form a proper, ignorable sequence for other terminals. == Fs Escape sequences ==
Fs Escape sequences
If the is followed by a byte in the range , the escape sequence is of type . This type is used for control functions individually registered with the ISO-IR registry. A table of these is listed under ISO/IEC 2022. == Fp Escape sequences ==
Fp Escape sequences
If the is followed by a byte in the range , the escape sequence is of type , which is set apart for up to sixteen private-use control functions. == nF Escape sequences ==
nF Escape sequences
If the is followed by a byte in the range , the escape sequence is of type . Said byte is followed by any number of additional bytes in this range, and then a byte in the range . These escape sequences are further subcategorised by the low two bits of the first byte, e.g. "type " for sequences where the first byte is ; and by whether the final byte is in the range indicating private use (e.g. "type ") or not (e.g. "type "). Most of the sequences are for changing the current character set, and are listed in ISO/IEC 2022. Some others: If the first byte is '#' the public sequences are reserved for additional ISO-IR registered individual control functions. No such sequences are presently registered. Type sequences (which includes ones starting with '#') are available for private-use control functions. == Examples ==
Examples
— This clears the screen and, on some devices, locates the cursor to the y,x position 1,1 (upper left corner). — This makes text green. The green may be a dark, dull green, so you may wish to enable Bold with the sequence which would make it bright green, or combined as . Some implementations use the Bold state to make the character Bright. — This reassigns the key F10 to send to the keyboard buffer the string "DIR" and ENTER, which in the DOS command line would display the contents of the current directory. (MS-DOS ANSI.SYS only) This was sometimes used for ANSI bombs. This is a private-use code (as indicated by the letter p), using a non-standard extension to include a string-valued parameter. Following the letter of the standard would consider the sequence to end at the letter D. — This saves the cursor position. Using the sequence will restore it to the position. Say the current cursor position is 7(y) and 10(x). The sequence will save those two numbers. Now you can move to a different cursor position, such as 20(y) and 3(x), using the sequence or . Now if you use the sequence CSI u the cursor position will return to 7(y) and 10(x). Some terminals require the DEC sequences / instead which is more widely supported. In shell scripting ANSI escape codes are often used in UNIX and UNIX-like terminals to provide syntax highlighting. For example, on compatible terminals, the following list command color-codes file and directory names by type. ls --color Users can employ escape codes in their scripts by including them as part of standard output or standard error. For example, the following GNU sed command embellishes the output of the make command by displaying lines containing words starting with "WARN" in reverse video and words starting with "ERR" in bright yellow on a dark red background (letter case is ignored). The representations of the codes are highlighted. make 2>&1 | sed -e 's/.*\bWARN.*/\x1b[7m&\x1b[0m/i' -e 's/.*\bERR.*/\x1b[93;41m&\x1b[0m/i' The following Bash function flashes the terminal (by alternately sending reverse and normal video mode codes) until the user presses a key. It can be used to alert a programmer when a lengthy command terminates, such as with . {{codett|2=bash|1=flasher () { while true; do printf}} '\e[?5h' '\e[?5l'{{codett|2=bash|1=; read -s -n1 -t1 && break; done; } }} The following command will reset the console, similar to the command on modern Linux systems; however it should work even on older Linux systems and on other (non-Linux) UNIX variants. In C This following program creates a table of numbers from 0 to 109, each of which is displayed in the format specified by the Select Graphic Rendition escape sequence using that number as the graphic rendition code. • include int main(void) { int row, col, n; for (row = 0; row 109) break; printf("\033[%dm %3d\033[m", n, n); } printf("\n"); } return 0; } == Terminal input sequences ==
Terminal input sequences
Pressing special keys on the keyboard, as well as outputting many xterm CSI, DCS, or OSC sequences, often produces a CSI, DCS, or OSC sequence, sent from the terminal to the computer as though the user typed it. When typing input on a terminal keypresses outside the normal main alphanumeric keyboard area can be sent to the host as ANSI sequences. For keys that have an equivalent output function, such as the cursor keys, these often mirror the output sequences. However, for most keypresses there isn't an equivalent output sequence to use. There are several encoding schemes, and unfortunately most terminals mix sequences from different schemes, so host software has to be able to deal with input sequences using any scheme. To complicate the matter, the VT terminals themselves have two schemes of input, normal mode and application mode that can be switched by the application. (draft section) -> char -> esc -> Alt-esc -> Alt-keypress or keycode sequence '[' -> Alt-[ '[' () -> keycode sequence, is a decimal number and defaults to 1 (xterm) '[' () (';') '~' -> keycode sequence, and are decimal numbers and default to 1 (vt) If the terminating character is '~', the first number must be present and is a keycode number, the second number is an optional modifier value. If the terminating character is a letter, the letter is the keycode value, and the optional number is the modifier value. The modifier value defaults to 1, and after subtracting 1 is a bitmap of modifier keys being pressed: . So, for example, <esc>[4;2~ is , <esc>[20~ is function key , <esc>[5C is . In other words, the modifier is the sum of the following numbers: vt sequences: [1~ - Home [16~ - [31~ - F17 [2~ - Insert [17~ - F6 [32~ - F18 [3~ - Delete [18~ - F7 [33~ - F19 [4~ - End [19~ - F8 [34~ - F20 [5~ - PgUp [20~ - F9 [35~ - [6~ - PgDn [21~ - F10 [7~ - Home [22~ - [8~ - End [23~ - F11 [9~ - [24~ - F12 [10~ - F0 [25~ - F13 [11~ - F1 [26~ - F14 [12~ - F2 [27~ - [13~ - F3 [28~ - F15 [14~ - F4 [29~ - F16 [15~ - F5 [30~ - xterm sequences: [A - Up [K - [U - [B - Down [L - [V - [C - Right [M - [W - [D - Left [N - [X - [E - [O - [Y - [F - End OP - F1 [Z - [G - Keypad 5 OQ - F2 [H - Home OR - F3 [I - OS - F4 [J - [T - <esc>[A to <esc>[D are the same as the ANSI output sequences. The <modifier> is normally omitted if no modifier keys are pressed, but most implementations always emit the <modifier> for . (draft section) Xterm has a comprehensive documentation page on the various function-key and mouse input sequence schemes from DEC's VT terminals and various other terminals it emulates. Thomas Dickey has added a lot of support to it over time; he also maintains a list of default keys used by other terminal emulators for comparison. • On the Linux console, certain function keys generate sequences of the form CSI [ char. The CSI sequence should terminate on the [. • Old versions of Terminator generate SS3 1; modifiers char when are pressed with modifiers. The faulty behavior was copied from GNOME Terminal. • xterm replies CSI row ; column R if asked for cursor position and CSI 1 ; modifiers R if the key is pressed with modifiers, which collide in the case of row == 1. This can be avoided by using the ? private modifier as CSI ? 6 n, which will be reflected in the response as CSI ? row ; column R. • many terminals prepend ESC to any character that is typed with the alt key down. This creates ambiguity for uppercase letters and symbols @[\]^_, which would form C1 codes. • Konsole generates SS3 modifiers char when are pressed with modifiers. • Some terminals, including iTerm2 and kitty, support reporting additional keys via an enhanced CSI u mode. == See also ==
tickerdossier.comtickerdossier.substack.com