Does it matter to AI whether I use vs ?

Yes. and carry semantic meaning that AI extractors use to identify the page's main content. A is generic and provides no structural cue. Use for thematic groupings inside an article and for the article itself.

Should every H2 be phrased as a question?

No, but every H2 that is supposed to be extractable as a direct answer should be. Mix question-shaped H2s for canonical Q&A sections with descriptive H2s for narrative or workflow sections. A 100% question-only outline reads awkwardly to humans and gives diminishing returns.

Are description lists ( ) really used by AI systems?

Yes. The / pairing is a strong, machine-readable signal that the element is a term being defined. Glossaries, parameter references, and FAQ-style term lists extract more reliably from than from styled blocks.

What about ARIA roles — do they help AI?

They can. ARIA roles like role="main" or role="article" reinforce semantic intent when you cannot use the corresponding HTML5 element. Prefer the native HTML5 element first; use ARIA only when the native element is impractical for layout or framework reasons.

How do I check whether my page is AI-readable?

View the raw HTML source (not the rendered DOM). Confirm: one , ordered / , real lists and tables, / wrappers, and the article body present in the server response. A simple curl check (curl -sL | grep -E '<h1|<h2|<main|<article') catches most JavaScript-only rendering issues. : Barry Adams, "Why Semantic HTML matters for SEO and AI." https://www.seoforgooglenews.com/p/why-semantic-html-matters-for-seo : Microsoft Advertising, "Optimizing Your Content for Inclusion in AI Search Answers" (October 2025). https://about.ads.microsoft.com/en/blog/post/october-2025/optimizing-your-content-for-inclusion-in-ai-search-answers : Gur, Furuta et al., "Understanding HTML with Large Language Models," arXiv:2210.03945. https://arxiv.org/abs/2210.03945 : r/SEO, "Headings in...

AI systems read HTML the way assistive technologies do: they look for semantic landmarks (

,

/: pairing as a strong signal that the
is a defined term. Anti-patterns A series of paragraphs that visually look like a list but carry no semantic grouping. A whose items are unrelated multi-paragraph essays — lists imply parallelism; if items aren't parallel, use sub-headings instead. A definition list rendered as a styled two-column grid — the term/definition pairing is invisible to AI. Definition patterns For reference and definition pages, two patterns extract reliably: Pattern A: H2 question + immediate answer paragraph ("answer target"). As eSEOspace describes it, "an answer target is a concise, standalone paragraph designed specifically to directly answer a targeted query. It usually sits immediately below an H2 or H3 heading." <h2>What is semantic HTML?</h2> <p>Semantic HTML is the practice of using HTML elements according to their intended meaning rather than for visual appearance.</p> Pattern B: with term and definition. <dl> <dt>Semantic HTML</dt> <dd>HTML markup whose tags convey meaning and structure, not just visual presentation.</dd> </dl> Mix them: use Pattern A for the canonical first answer on the page, and Pattern B for additional terms in a glossary block at the bottom. Tables Real elements (not grids) carry structural meaning AI extractors use to align rows and columns. Rules Wrap header cells in with . Wrap row labels in when the first column is a label. Use for a one-line summary of what the table shows. AI extractors and screen readers both pick this up. Keep one logical concept per table. Two unrelated comparisons → two tables. Anti-patterns A used purely for layout (use CSS Grid or Flexbox). Headerless tables (no ) — AI cannot reliably tell which row is the header. Merged cells used decoratively. Merge only when the data semantically warrants it. Document landmarks (HTML5) Wrap the page in HTML5 landmarks so AI systems can ignore boilerplate and zoom in on the article content: <header>…site nav…</header> <main> <article> <h1>…page title…</h1> …answer-first content… </article> <aside>…related links…</aside> </main> <footer>…</footer> The and pair is the strongest single "the answer is in here" signal for LLM extractors. Without them, an AI parser must guess which block is the article — and often guesses wrong on -soup pages. Answer-first chunking LLM-driven AI search retrieves passage-level chunks, not whole pages. To make each chunk self-contained: Put the direct answer in the first 1-2 sentences after the H2 or H3. Do not bury it after a long preamble. Repeat the entity name in each section. Avoid pronouns like "it" or "this" as the section opener — a chunk extracted out of context loses the antecedent. Keep paragraphs short. Two to four sentences each, so the chunker can pick up clean boundaries. Use bold sparingly. Bold the term being defined, not whole sentences — over-bolding flattens the signal. Quick reference: do / don't / Concern Do Don't Headings Single , ordered / Multiple s, skipped levels Lists Real , , blocks styled to look like lists Tables with / grid masquerading as a table Landmarks , , Generic everywhere Definitions H2 question + answer paragraph or Bolded term in middle of a paragraph Bold/italic Emphasize the entity term Bold whole paragraphs Pronouns Repeat entity name per section "It..." / "This..." as section opener Common mistakes Building the page in a visual editor that emits soup. Audit the source HTML; if you don't see real headings, lists, and landmarks, rebuild the template. Using as a style hook for any large bold text. Style with CSS classes; reserve heading tags for actual section breaks. Treating as deprecated. It isn't — it remains the correct semantic for term/definition pairs and is well supported by AI extractors. Rendering content client-side without server-side fallback. Many AI crawlers do not execute JavaScript; if the article body is rendered only by JS, the AI sees an empty page. Validation checklist [ ] Exactly one , matching (or closely reflecting) the page title. [ ] No skipped heading levels. [ ] H2/H3 phrased as questions or direct answers where extraction matters. [ ] Lists use / / , never simulated with blocks. [ ] Tables use and (when helpful) . [ ] Page wrapped in and HTML5 landmarks. [ ] Article content present in the initial server-rendered HTML. [ ] Each section opens by repeating the entity name, not a pronoun. [ ] Answer paragraphs sit immediately under their H2/H3 heading. FAQ Q: Does it matter to AI whether I use vs ? Yes. and carry semantic meaning that AI extractors use to identify the page's main content. A is generic and provides no structural cue. Use for thematic groupings inside an article and for the article itself. Q: Should every H2 be phrased as a question? No, but every H2 that is supposed to be extractable as a direct answer should be. Mix question-shaped H2s for canonical Q&A sections with descriptive H2s for narrative or workflow sections. A 100% question-only outline reads awkwardly to humans and gives diminishing returns. Q: Are description lists ( ) really used by AI systems? Yes. The / pairing is a strong, machine-readable signal that the element is a term being defined. Glossaries, parameter references, and FAQ-style term lists extract more reliably from than from styled blocks. Q: What about ARIA roles — do they help AI? They can. ARIA roles like role="main" or role="article" reinforce semantic intent when you cannot use the corresponding HTML5 element. Prefer the native HTML5 element first; use ARIA only when the native element is impractical for layout or framework reasons. Q: How do I check whether my page is AI-readable? View the raw HTML source (not the rendered DOM). Confirm: one , ordered / , real lists and tables, / wrappers, and the article body present in the server response. A simple curl check (curl -sL | grep -E ' : Barry Adams, "Why Semantic HTML matters for SEO and AI." https://www.seoforgooglenews.com/p/why-semantic-html-matters-for-seo : Microsoft Advertising, "Optimizing Your Content for Inclusion in AI Search Answers" (October 2025). https://about.ads.microsoft.com/en/blog/post/october-2025/optimizing-your-content-for-inclusion-in-ai-search-answers : Gur, Furuta et al., "Understanding HTML with Large Language Models," arXiv:2210.03945. https://arxiv.org/abs/2210.03945 : r/SEO, "Headings in the age of AI crawlers." https://www.reddit.com/r/SEO/comments/1opffly/headings_in_the_age_of_ai_crawlers/ : W3C Web Accessibility Initiative, "Content Structure." https://www.w3.org/WAI/tutorials/page-structure/content/ : eSEOspace, "How to Structure a Page So AI Can Extract Answers Instantly." https://eseospace.com/blog/ai-content-structure-extraction/ : Franco Folini, "The Curious Case of the Vanishing Definition List: Why DL Deserves Your Love" (March 2026). https://francofolini.com/2026/03/15/the-curious-case-of-the-vanishing-definition-list-why-dl-deserves-your-love/ Related Articles checklist Direct answer optimization: patterns for getting picked as the answer Checklist of direct answer patterns — definition-first openings, answer boxes, constraints, and evidence — to get picked as the cited source by AI engines. specification Agent Knowledge Base Specification: Structure, Refresh, and Versioning Production specification for AI agent knowledge bases: document model, chunking strategies, metadata enrichment, refresh cadence, version pinning, and rollback. guide AI search ranking signals: what likely matters (and how to test) What likely matters for AI search ranking in 2026 — retrieval, authority, freshness, and structure — plus a reproducible way to test each signal instead of guessing. Topics #geo #source-selection On this page TL;DR Why semantic HTML matters for AI Heading hierarchy Rules Anti-patterns Lists Anti-patterns Definition patterns Tables Rules Anti-patterns Document landmarks (HTML5)Answer-first chunking Quick reference: do / don't Common mistakes Validation checklist FAQ Q: Does it matter to AI whether I use <section> vs <div>?Q: Should every H2 be phrased as a question?Q: Are description lists (<dl>) really used by AI systems?Q: What about ARIA roles — do they help AI?Q: How do I check whether my page is AI-readable? Stay Updated GEO & AI Search Insights New articles, framework updates, and industry analysis. No spam, unsubscribe anytime. Structured knowledge for AI search visibility. The canonical reference for GEO, AEO, and AI search optimization. Learn What Is GEO? What Is AEO? GEO vs SEO GEO Glossary Build llms.txt Reference Create llms.txt Structured Data ai.txt Reference Strategy AI Visibility Content Strategy GEO ROI AEO Checklist Resources GitHub Contact Tags Sitemap llms.txt ai.txt © 2026 Geodocs.dev. All rights reserved. contact@geodocs.dev · Built for humans and AI agents.

HTML semantic structure for AI readability: headings, lists, and tables

,

TL;DR

, ordered

/

headings phrased as questions or direct answers, real // lists, properly headered

Tables

Rules

Anti-patterns

Document landmarks (HTML5)

Answer-first chunking

Quick reference: do / don't

, ordered

/

s, skipped levels

Common mistakes

as a style hook for any large bold text. Style with CSS classes; reserve heading tags for actual section breaks.

Validation checklist

, matching (or closely reflecting) the page title.

FAQ

Q: Does it matter to AI whether I use vs ?

Q: Should every H2 be phrased as a question?

Q: Are description lists () really used by AI systems?

Q: What about ARIA roles — do they help AI?

Q: How do I check whether my page is AI-readable?

, ordered

/

Related Articles

Direct answer optimization: patterns for getting picked as the answer

Agent Knowledge Base Specification: Structure, Refresh, and Versioning

AI search ranking signals: what likely matters (and how to test)

GEO & AI Search Insights

headings phrased as questions or direct answers, real
/
/
lists, properly headered

Q: Does it matter to AI whether I use
vs
?

Q: Are description lists (
) really used by AI systems?