MarketHTML element
Company Profile

HTML element

An HTML element is a type of HTML document component, one of several types of HTML nodes. The first used version of HTML was written by Tim Berners-Lee in 1993 and there have since been many versions of HTML. The current de facto standard is governed by the industry group WHATWG and is known as the HTML Living Standard.

Concepts
Elements vs. tags As is generally understood, the position of an element is indicated as spanning from a start tag and is terminated by an end tag. This is the case for many, but not all, elements within an HTML document. The distinction is explicitly emphasised in HTML 4.01 Specification: Similarly the W3C Recommendation HTML 5.1 2nd Edition explicitly says: and: its parsing also depends on the Document Type Definition (DTD), specifically an HTML DTD (e.g. HTML 4.01). The DTD specifies which element types are possible (i.e. it defines the set of element types) and also the valid combinations in which they may appear in a document. It is part of general SGML behavior that, where only one valid structure is (per the DTD), its explicit statement in any given document is not generally required. As a simple example, the tag indicating the start of a paragraph element should be complemented by a tag indicating its end. But since the DTD states that paragraph elements cannot be nested, an HTML document fragment is thus inferred to be equivalent to . (If one paragraph element cannot contain another, any currently open paragraph must be closed before starting another.) Because this implication is based on the combination of the DTD and the individual document, it is not usually possible to infer elements from document tags but only by using an SGML—or HTML—aware parser with knowledge of the DTD. HTML5 creates a similar result by defining what tags can be omitted. SGML vs. XML SGML is complex, which has limited its widespread understanding and adoption. XML was developed as a simpler alternative. Although both can use the DTD to specify the supported elements and their permitted combinations as document structure, XML parsing is simpler. The relation from tags to elements is always that of parsing the actual tags included in the document, without the implied closures that are part of SGML. HTML as used on the current web is likely to be either treated as XML, by being XHTML, or as HTML5; in either case the parsing of document tags into Document Object Model (DOM) elements is simplified compared to legacy HTML systems. Once the DOM of elements is obtained, behavior at higher levels of interface (example: screen rendering) is identical or nearly so. %block; vs. box Part of this CSS presentation behavior is the notion of the "box model". This is applied to those elements that CSS considers to be "block" elements, set through the CSS declaration. HTML also has a similar concept, although different, and the two are very frequently confused. %block; and %inline; are groups within the HTML DTD that group elements as being either "block-level" or "inline". This is used to define their nesting behavior: block-level elements cannot be placed into an inline context. This behavior cannot be changed; it is fixed in the DTD. Block and inline elements have the appropriate and different CSS behaviors attached to them by default, ==Overview==
Overview
Syntax {{Image frame|caption=Parts of an HTML container element \overbrace{ \overbrace{\mathtt{\color{BrickRed}}}^\mathsf{Start\ tag} \overbrace\mathtt{\color{Green}This\ is\ a\ paragraph.}^\mathsf{Content} \overbrace\mathtt{\color{BrickRed}}^\mathsf{End \atop tag} }^\mathsf{Element} }} In the HTML syntax, most elements are written with a start tag and an end tag, with the content in between. An HTML tag is composed of the name of the element, surrounded by angle brackets. An end tag also has a slash after the opening angle bracket. For example, a paragraph, which is represented by the element, would be written as: In the HTML syntax, most elements are written ... However, not all of these elements the end tag, or even the start tag, to be present. The canonical form was upper-case until HTML 4, and was used in HTML specifications, but in recent years, lower-case has become more common. Types of element There are three kinds of HTML elements: normal elements, raw text elements, and void elements. '''''' usually have both a start tag and an end tag, although for some elements the end tag, or both tags, can be omitted. It is constructed in a similar way: • a start tag () marking the beginning of an element, which may incorporate any number of HTML attributes; • some amount of content, including text and other elements; • an end tag, in which the element name is prefixed with a slash: . '''''' (also known as text or text-only elements) are constructed with: • a start tag (in the form ) marking the beginning of an element, which may incorporate any number of HTML attributes; • some amount of text content, but no elements (all tags, apart from the applicable end tag, will be interpreted as content); • an end tag, in which the element name is prefixed with a slash: . In some versions of HTML, the end tag is optional for some elements. The end tag is required in XHTML. An example is the element, which must not contain other elements (including markup of text), only text. '''''' (also sometimes called empty elements, single elements or stand-alone elements) only have a start tag (in the form ), which contains any HTML attributes. They may not contain any children, such as text or other elements. For compatibility with XHTML, the HTML specification allows an optional space and slash ( is permissible). The slash is required in XHTML and other XML applications. Two common void elements are (for a hard line-break, such as in a poem or an address) and (for a thematic break). Other such elements are often place-holders which reference external files, such as the image () element. The attributes included in the element will then point to the external file in question. Another example of a void element is , for which the syntax is: This element points the browser at a style sheet to use when presenting the HTML document to the user. In the HTML syntax attributes do not have to be quoted if they are composed only of certain characters: letters, digits, the hyphen-minus and the period. When using the XML syntax (XHTML), on the other hand, all attributes must be quoted, and a spaced trailing slash is required before the last angle bracket: Attributes HTML attributes define desired behavior or indicate additional element properties. Most attributes require a value. In HTML, the value can be left unquoted if it does not include spaces (=), or it can be quoted with single or double quotes (='' or =""). In XML, those quotes are required. Boolean attributes, on the other hand, do not require a value to be specified. An example is the checked for checkboxes: In the XML (and thus XHTML) syntax, though, a value is required, and the name should be repeated as the value: Element standards HTML elements are defined in a series of freely available open standards issued since 1995, initially by the IETF and subsequently by the W3C. During the browser wars of the 1990s, developers of user agents (e.g. web browsers) often developed their own elements, some of which have been adopted in later standards. Other user agents may not recognize non-standard elements, and they will be ignored, possibly causing the page to be displayed improperly. In 1998, XML (a simplified form of SGML) introduced mechanisms to allow anyone to develop their own elements and incorporate them in XHTML documents, for use with XML-aware user agents. Subsequently, HTML 4.01 was rewritten in an XML-compatible form, XHTML 1.0 (eXtensible HTML). The elements in each are identical, and in most cases valid XHTML 1.0 documents will be valid or nearly valid HTML 4.01 documents. This article mainly focuses on real HTML, unless noted otherwise; however, it remains applicable to XHTML. See HTML for a discussion of the minor differences between the two. Element status Since the first version of HTML, several elements have become outmoded, and are deprecated in later standards, or do not appear at all, in which case they are invalid (and will be found invalid, and perhaps not displayed, by validating user agents). In HTML 4.01 / XHTML 1.0, the status of elements is complicated by the existence of three types of DTD: • Transitional, which contain deprecated elements, but which were intended to provide a transitional period during which authors could update their practices; • Frameset, which are versions of the Transitional DTDs which also allow authors to write frameset documents; • Strict, which is the up-to-date (as at 1999) form of HTML. HTML5 instead provides a listing of obsolete features to go along with the standardized normative content. They are broken down into "obsolete but conforming" for which implementation instructions exist and "non-conforming" ones that should be replaced. The first Standard (HTML 2.0) contained four deprecated elements, one of which was invalid in HTML 3.2. All four are invalid in HTML 4.01 Transitional, which also deprecated a further ten elements. All of these, plus two others, are invalid in HTML 4.01 Strict. While the frame elements are still current in the sense of being present in the Transitional and Frameset DTDs, there are no plans to preserve them in future standards, as their function has been largely replaced, and they are highly problematic for user accessibility. (Strictly speaking, the most recent XHTML standard, XHTML 1.1 (2001), does not include frames at all; it is approximately equivalent to XHTML 1.0 Strict, but also includes the Ruby markup module.) A common source of confusion is the loose use of deprecated to refer to both deprecated and invalid status, and to elements that are expected to be formally deprecated in the future. Content vs. presentation and behavior Since HTML 4, HTML has increasingly focused on the separation of content (the visible text and images) from presentation (like color, font size, and layout). This is often referred to as a separation of concerns. HTML is used to represent the structure or content of a document, its presentation remains the sole responsibility of CSS style sheets. A default style sheet is suggested as part of the CSS standard, giving a default rendering for HTML. Behavior (interactivity) is also kept separate from content, and is handled by scripts. Images are contained in separate graphics files, separate from text, though they can also be considered part of the content of a page. Separation of concerns allows the document to be presented by different user agents according to their purposes and abilities. For example, a user agent can select an appropriate style sheet to present a document by displaying on a monitor, printing on paper, or to determine speech characteristics in an audio-only user agent. The structural and semantic functions of the markup remain identical in each case. Historically, user agents did not always support these features. In the 1990s, as a stop-gap, presentational elements (like and ) were added to HTML, at the cost of creating problems for interoperability and user accessibility. This is now regarded as outmoded and has been superseded by style sheet-based design; most presentational elements are now deprecated. External image files are incorporated with the or elements. (With XHTML, the SVG language can also be used to write graphics within the document, though linking to external SVG files is generally simpler.) Where an image is not purely decorative, HTML allows replacement content with similar semantic value to be provided for non-visual user agents. An HTML document can also be extended through the use of scripts to provide additional behaviors beyond the abilities of HTML hyperlinks and forms. The elements and , with related HTML attributes, provide style sheets and scripts. • In the document head, and may link to shared external documents, or and may contain embedded instructions. (The element can also be used to link style sheets.) • or can occur at any point in the document (head or body). • The style attribute is valid in most document body elements (e.g. ) for inclusion of inline style instructions. • Event-handling attributes, which provide links to scripts, are optional in most elements. • For user agents which do not operate scripts, the element provides embedded alternative content where appropriate; however, it can only be used in the document head and in the body as a block-level element. ==Document structure elements==
Document head elements
3.0. ==Document body elements==
Document body elements
In visual browsers, displayable elements can be rendered as either block or inline. While all elements are part of the document sequence, block elements appear within their parent elements: • as rectangular objects which do not break across lines; • with block margins, width, and height properties which can be set independently of the surrounding elements. Conversely, inline elements are treated as part of the flow of document text; they cannot have margins, width, or height set, and do break across lines. Block elements Block elements, or block-level elements, have a rectangular structure. By default, these elements will span the entire width of its parent element, and will thus not allow any other element to occupy the same horizontal space as it is placed on. The rectangular structure of a block element is often referred to as the CSS box model, and is made up of several parts. Each element contains the following: • The content of an element is the actual text (or other media) placed between the opening and closing tags of an element. • The padding of an element is the space around the content but which still forms part of the element. Padding should not be used to create white space between two elements. Any background style assigned to the element, such as a background image or color, will be visible within the padding. Increasing the size of an element's padding increases the amount of space this element will take up. • The border of an element is the absolute end of an element and spans the perimeter of that element. The thickness of a border increases the size of an element. • The margin of an element is the white space that surrounds an element. The content, padding, and border of any other element will not be allowed to enter this area unless forced to do so by some advanced CSS placement. Using most standard DTDs, margins on the left and right of different elements will push each other away. Margins on the top or bottom of an element, on the other hand, will not stack or will intermingle. This means that the white space between these elements will be as big as the larger margin between them. The above section refers only to the detailed implementation of CSS rendering and has no relevance to HTML elements themselves. Basic text Lists {{defn|1= An ordered (enumerated) list. The type attribute can be used to specify the kind of marker to use in the list, but style sheets give more control. The default is Arabic numbering. In an HTML attribute: ; or in a CSS declaration: {{nobr|{{code|lang=css|code=ol { list-style-type: foo; } }}}} – replacing foo with one of the following: • A, B, C ... – HTML value: A; CSS value: upper-alpha • a, b, c ... – HTML value: a; CSS value: lower-alpha • I, II, III ... – HTML value: I; CSS value: upper-roman • i, ii, iii ... – HTML value: i; CSS value: lower-roman • 1, 2, 3 ... – HTML value: 1; decimal CSS provides several other options not available as pure-HTML markup, including none, and options for CJK, Hebrew, Georgian, and Armenian script. The attribute is deprecated in HTML 3.2 and 4.01, but not in HTML 5. }} {{defn|1= An unordered (bulleted) list. The type of list item marker can be specified in an HTML attribute: ; or in a CSS declaration: {{nobr|{{code|lang=css|code=ul { list-style-type: foo; } }}}} – replacing foo with one of the following (the same values are used in HTML and CSS): disc (the default), square, or circle. Only the CSS method is supported in HTML5; the attribute is deprecated in HTML 3.2 and 4.01. CSS also provides none, and the ability to replace these bullets with custom images.}} Other block elements {{defn|1= Pre-formatted text. Text within this element is typically displayed in a non-proportional font exactly as it is laid out in the file (see ASCII art). Whereas browsers ignore white-space for other HTML elements, in , white-space should be rendered as authored. (With the CSS properties: {{code|lang=css|code={ white-space: pre; font-family: monospace; }}}, other elements can be presented in the same way.) This element can contain any inline element except: #image|, #object|, #big|, #small|, #sup|, and #sub|.}} Inline elements Inline elements cannot be placed directly inside the element; they must be wholly nested within block-level elements. Anchor Phrase elements Phrase elements are used for marking up phrases and adding structure or semantic meaning to text fragments. For example, the and tags can be used for adding emphasis to text. General Computer phrase elements These elements are useful primarily for documenting computer code development and user interaction through differentiation of source code (), variables (), user input (), and terminal or other output (). Presentation As visual presentational markup only applies directly to visual browsers, its use is discouraged. Style sheets should be used instead. Several of these elements are deprecated or invalid in HTML 4 / XHTML 1.0, and the remainder are invalid in the current draft of XHTML 2.0. The current draft of HTML5, however, re-includes , , and , assigning new semantic meaning to each. In an HTML5 document, the use of these elements is no longer discouraged, provided that it is semantically correct. {{defn|1= In HTML 4, set font to boldface where possible. Equivalent CSS: {{code|lang=css|code={ font-weight: bold; } }}. The element usually has the same effect in visual browsers, as well as having more semantic meaning, under HTML 4.01. In HTML5, however, has its own meaning, distinct from that of . It denotes "text to which attention is being drawn for utilitarian purposes without conveying any extra importance and with no implication of an alternate voice or mood." }} {{defn|1= In HTML 4, set font to italic where possible. Equivalent CSS: {{code|lang=css|code={ font-style: italic; } }}. Using has the same visual effect in most browsers, as well as having a semantic meaning as emphasis, under HTML 4.01. (Purely typographic italics have many non-emphasis purposes, as HTML 5 more explicitly recognized.) In HTML5, however, has its own semantic meaning, distinct from that of . It denotes "a different quality of text" or "an alternate voice or mood" e.g., a thought, a ship name, a binary species name, a foreign-language phrase, etc. }} {{defn|1= In HTML 4, underlined text. Equivalent CSS: {{code|lang=css|code={ text-decoration: underline; } }}. Deprecated in HTML 4.01. Restored in HTML5. In HTML5, the element denotes "a span of text with an unarticulated, though explicitly rendered, non-textual annotation, such as labelling the text as being a proper name in Chinese text (a Chinese proper name mark), or labelling the text as being misspelt." The HTML5 specification reminds developers that other elements are almost always more appropriate than and admonishes designers not to use underlined text where it could be confused for a hyper-link. }} {{defn|1= In HTML 4, decreased font size (smaller text). Equivalent CSS: {{code|lang=css|code={ font-size: smaller; } }} In HTML5, the element denotes "side comments such as small print." This has caused some confusion with the element. }} {{defn|1= Increased font size (bigger text). Equivalent CSS: {{code|lang=css|code={ font-size: larger; } }}}} {{defn|1= Strike-through text (Strikethrough), (Equivalent CSS: {{code|lang=css|code={ text-decoration: line-through; } }})}} {{defn|1= Fixed-width font (), also known as teletype, thus "tt". (Equivalent CSS: {{code|lang=css|code={ font-family: monospace; } }})}} {{defn|1= Can specify the font color with the color attribute (note the American spelling), typeface with the face attribute, and absolute or relative size with the size attribute. Examples (all uses are deprecated, use CSS equivalents if possible): • creates . • creates text with hexadecimal color #114499. • creates text with size 4. Sizes are from 1 to 7. The standard size is 3, unless otherwise specified in the <body> or other tags. • creates text with size 1 bigger than the standard. is opposite. • makes text with Courier font. Equivalent CSS for font attributes: • <font size="N"> corresponds to {font-size: Yunits} (the HTML specification does not define the relationship between size N and unit-size Y, nor does it define a unit). • corresponds to {{code|lang=css|code={ color: red; } }} • corresponds to {{code|lang=css|code={ font-family: 'Times New Roman', Times, serif; } }} – CSS supports a font stack, of two or more alternative fonts. }} Span Other inline elements {{defn|1= Mark subscripted or superscripted text. (Equivalent CSS: {{code|lang=css|code={ vertical-align: sub; } }} and {{code|lang=css|code={ vertical-align: super; } }}, respectively.)}} Images and objects Forms These elements can be combined into a form or in some instances used separately as user-interface controls; in the document, they can be simple HTML or used in conjunction with Scripts. HTML markup specifies the elements that make up a form, and the method by which it will be submitted. However, some form of scripts (server-side, client-side, or both) must be used to process the user's input once it is submitted. (These elements are either block or inline elements, but are collected here as their use is more restricted than other inline or block elements.) Tables The format of HTML Tables was proposed in the HTML 3.0 Drafts and the later RFC 1942 HTML Tables. They were inspired by the CALS Table Model. Some elements in these proposals were included in HTML 3.2; the present form of HTML Tables was standardized in HTML 4. (Many of the elements used within tables are neither block nor inline elements.) ==Frames==
Frames
Frames allow a visual HTML browser window to be split into segments, each of which can show a different document. This can lower bandwidth use, as repeating parts of a layout can be used in one frame, while variable content is displayed in another. This may come at a certain usability cost, especially in non-visual user agents, due to separate and independent documents (or websites) being displayed adjacent to each other and being allowed to interact with the same parent window. Because of this cost, frames (excluding the element) are only allowed in HTML 4.01 Frame-set. Iframes can also hold documents on different servers. In this case the interaction between windows is blocked by the browser. Sites like Facebook and Twitter use iframes to display content (plugins) on third party websites. Google AdSense uses iframes to display banners on third party websites. In HTML 4.01, a document may contain a and a a and a , but not both a and a . However, can be used in a normal document body. longdesc attribute In HTML, longdesc is an attribute used within the , , or elements. It is supposed to be a URL to a document that provides a long description for the image, frame, or iframe in question. This attribute should contain a URL, – as is commonly mistaken – the text of the description itself. longdesc was designed to be used by screen readers to display image information for computer users with accessibility issues, such as the blind or visually impaired, and is widely implemented by both web browsers and screen readers. Some developers object that it is actually seldom used for this purpose because there are relatively few authors who use the attribute and most of those authors use it incorrectly; thus, they recommend deprecating longdesc. The publishing industry has responded, advocating the retention of longdesc. Example Content of description.html: This is an image of a two-layered birthday cake. ... Linking to the long description in the text Since very few graphical browsers support making the link available natively (Opera and iCab being the exceptions), it is useful to include a link to the description page near the element whenever possible, as this can also aid sighted users. Example [D] ==Historic elements==
Historic elements
The following elements were part of the early HTML developed by Tim Berners-Lee from 1989 to 1991; they are mentioned in HTML Tags, but deprecated in HTML 2.0 and were never part of HTML standards. ==Non-standard elements==
Non-standard elements
This section lists some widely used obsolete elements, which means they are not used in valid code. They may not be supported in all user agents. {{defn|1= Causes text to blink. Introduced in imitation of the ANSI escape codes. Can be done with CSS where supported: {{code|lang=css|code={text-decoration: blink} }} (This effect may have negative consequences for people with photosensitive epilepsy; its use on the public Internet should follow the appropriate guidelines.)}} {{defn|1= Causes text to not break at end of line, preventing word wrap where text exceeds the width of the enclosing object. Adjacent text may break before and after it. Can be done with CSS: {{code|lang=css|code={white-space: nowrap;} }}}} ==Comments==
Comments
{{defn|1= A comment in HTML (and related XML, SGML and SHTML) uses the same syntax as the SGML comment or XML comment, depending on the doctype. Unlike most HTML tags, comments do not nest. More generally, there are some strings that are not allowed to appear in the comment text. Those are (the beginning of a comment),(this ends the comment so it trivially follows it can not appear inside it) and . Additionally, the strings and cannot appear at the beginning of a comment and cannot appear at the end. As a result, the markup is ill-formed and will yield the comment and the text after it, or sometimes just , depending on the browser. Comments can appear anywhere in a document, as the HTML parser is supposed to ignore them no matter where they appear so long as they are not inside other HTML tag structures (i.e., they cannot be used next to attributes and values; this is invalid markup: "extension ). Comments can even appear before the doctype declaration; no other tags are permitted to do this. However, not all browsers and HTML editors are fully compliant with the HTML syntax framework and may do unpredictable things under some syntax conditions. Defective handling of comments only affects about 5% of all browsers and HTML editors in use, and even then only certain versions are affected by comment mishandling issues (Internet Explorer 6 accounts for most of this high percentage). There are a few compatibility quirks involving comments: • Placing comments – or indeed any characters except for white-space – before the doctype will cause Internet Explorer 6 to use quirks mode for the HTML page. None of the doctype information will be processed. • For compatibility with some pre-1995 browsers, the contents of and elements are still sometimes surrounded by comment delimiters, and CSS- and script-capable browsers are written to specifically ignore that comment markup as not actually a comment. This means that attempts to actually comment out CSS and script markup by change the elements inside the comment to not be recognized, e.g. [/script] . • The BlueGriffon HTML editor, in versions 1.7., makes comments that are not embedded in the syntax structure; {{code|lang=html|code= ... {comment tags} ...}} will show up on-screen. Other HTML editors may have this same defect. }} ==See also==
tickerdossier.comtickerdossier.substack.com