Insights/Tutorial
How to Write Content AI Engines Actually Cite
AI engines cite chunks, not pages. The writing rules that make content extractable: answer-first sections, fact density, question headings, and the formats that fail.
AI engines do not cite pages — they cite chunks: a paragraph or small group of paragraphs under a single heading, extracted and attributed to your URL. Writing for citation means making every section of a page a clean, liftable chunk. The rules are specific and testable.
Rule 1 — Answer in the first 40–60 words
Every section must open with a complete, self-contained answer to the question its heading poses. No throat-clearing, no "in today's rapidly evolving landscape." Extraction analyses consistently find that the opening passage of a section is the most-lifted unit — ChatGPT Search in particular often quotes a page's summary verbatim when the query matches the topic.
Fails: "When it comes to estate planning, there are many factors to consider, and costs can vary widely depending on your situation."
Works: "Most Austin estate plans cost $2,000–$6,000. A basic will package sits at the low end; revocable trusts and asset-protection structures at the high end. Three factors set the price: asset complexity, family structure, and whether you need ongoing administration."
Rule 2 — Phrase headings as questions
H2s should mirror how users phrase queries — "How much does a CPA cost for a small business?" not "Pricing considerations." Engines match retrieval at the passage level, and a heading that restates the query is the strongest possible relevance signal for the chunk beneath it. Keep hierarchy strict: one H1, question-shaped H2s, H3s only beneath their H2, no skipped levels.
Rule 3 — One specific fact per section
Fact density is a measured citation signal. Research on generative engine optimization (including the original Princeton GEO study) found that adding statistics, named entities, and citations measurably increases the probability of being sourced. Replace every vague quantifier with a number: not "many searches end without a click" but "68% of U.S. searches ended without a click in early 2026 (SparkToro)." Specific numbers anchor the model and make the chunk verifiable — which makes it safer to cite.
Rule 4 — Keep sections under 150 words
The chunk is the unit. Sections of roughly 120–180 words align with the paragraph size of AI responses; longer sections force the engine to truncate, which usually means it picks a cleaner competitor instead. A useful self-test: score each H2 section on three criteria — complete on its own (can it be understood without the rest of the page?), under 150 words, contains a specific claim with numbers or names. Pages scoring 3/3 on every section get cited dramatically more often.
Rule 5 — Cite primary sources outbound
Pages that cite government data, academic research, and named studies outrank pages that cite nothing — particularly on Perplexity, which rewards inline outbound citations. Linking to sources positions your page inside a high-trust neighborhood. Citing other blogs does not have the same effect; go to the primary source.
Rule 6 — Write in neutral, declarative prose
Engines prefer sources that match their own register: neutral, structured, objective. Salesy or fragmented writing requires the model to rephrase heavily, which raises the cost of citing you. Declarative sentences, inverted-pyramid structure, tables for comparisons, lists for sequences.
What this looks like as a checklist
| Check | Target |
|---|---|
| Section opens with complete answer | First 40–60 words |
| Heading phrased as a user query | Every H2 |
| Specific stat or named entity | Every section |
| Section length | Under 150 words |
| Outbound citations | Primary sources only |
| FAQ block | 4–6 questions, matching FAQPage schema |
| Dates | Visible, in schema, honestly maintained |
Format alone does not earn citations — authority and entity work decide whether engines trust the domain at all. But between two trusted sources, the extractable one wins. See how to get cited by Perplexity for the engine where these rules pay off first.