Below are the standard tag types that apply to tagged PDFs. These standard tags provide assistive software and devices with semantic and structural elements to use to interpret document structure and present content in a useful manner.
Container elements
Container elements are the highest level of element and provide hierarchical grouping for other block-level elements.
Document
Document element. The root element of a document’s tag tree.
Part
Part element. A large division of a document; may group smaller units of content together, such as division elements, article elements, or section elements.
Div
Division element. A generic block-level element or group of block-level elements.
Art
Article element. A self-contained body of text considered to be a single narrative.
Sect
Section element. A general container element type, comparable to Division (DIV) in HTML, which is usually a component of a part element or an article element.
Heading and paragraph elements
Heading and paragraph elements are paragraph-like, block-level elements that include specific level heading and generic paragraph (P) tags. A heading (H) element should appear as the first child of any higher-level division. Six levels of headings (H1to H6) are available for applications that don’t hierarchically nest sections.
Label and list elements
Label and list elements are block-level elements used for structuring lists.
L
List element. Any sequence of items of similar meaning or other relevance; immediate child elements should be list item elements.
LI
List item element. Any one member of a list; may have a label element (optional) and a list body element (required) as a child.
LBL
Label element. A bullet, name, or number that identifies and distinguishes an element from others in the same list.
LBody
List item body element. The descriptive content of a list item.
Special text elements
Special text elements identify text that isn’t used as a generic paragraph (P).
BlockQuote
Block quote element. One or more paragraphs of text attributed to someone other than the author of the immediate surrounding text.
Caption
Caption element. A brief portion of text that describes a table or a figure.
Index
Index element. A sequence of entries that contain identifying text and reference elements that point out the occurrence of the text in the main body of the document.
TOC
Table of contents element. An element that contains a structured list of items and labels identifying those items; has its own discrete hierarchy.
TOCI
Table of contents item element. An item contained in a list associated with a table of contents element.
Table elements
Table elements are special elements for structuring tables.
Table
Table element. A two-dimensional arrangement of data or text cells that contains table row elements as child elements and may have a caption element as its first or last child element.
TR
Table row element. One row of headings or data in a table; may contain table header cell elements and table data cell elements.
TD
Table data cell element. A table cell that contains nonheader data.
TH
Table header cell element. A table cell that contains header text or data describing one or more rows or columns of a table.
Inline-level elements
Inline-level elements identify a span of text that has specific formatting or behavior. They are differentiated from block-level elements. Inline-level elements may be contained in or contain block-level elements.
BibEntry
Bibliography entry element. A description of where some cited information may be found.
Quote
Quote entry element. An inline portion of text that is attributed to someone other than the author of the text surrounding it; different from a block quote, which is a whole paragraph or multiple paragraphs, as opposed to inline text.
Span
Span entry element. Any inline segment of text; commonly used to delimit text that is associated with a set of styling properties.
Special inline-level elements
Similar to inline-level elements, special inline-level elements describe an inline portion of text that has special formatting or behavior.
Code
Code entry element. Computer program text embedded within a document.
Figure
Figure entry element. A graphic or graphic representation associated with text.
Form
Form entry element. A PDF form annotation that can be or has been filled out.
Formula
Formula entry element. A mathematical formula.
Link
Link entry element. A hyperlink that is embedded within a document. The target can be in the same document, in another PDF document, or on a website.
Note
Note entry element. Explanatory text or documentation, such as a footnote or endnote, that is referred to in the main body of text.
Reference
Reference entry element. A citation to text or data that is found elsewhere in the document.