Research Pipeline¶

180+ Perplexity prompts, automated browser-driven search, synthesis scripts that extract stats and quotes with credibility scores, and a citation management system that kept 775 footnotes consistent across 81 sections. This section covers how research runs before writing starts -- and why that order matters.

Contents¶

Research Architecture -- The 4-phase pipeline: preparation (web research pre-search), execution (Perplexity automation), processing (synthesis extraction), and integration (writer and reviewer agents). Covers the folder structure that mirrors prompts to answers, two-phase prompt design, prompt types, and how research-first flipped the economics of evidence. Includes a Mermaid diagram of the full flow.
Perplexity Automation -- Playwright-driven browser automation that submits prompts to Perplexity Pro, waits for responses, and saves them to the matching answers folder. Key details: context bleed prevention (delete thread every 4 prompts), 12-second delays to avoid rate limits, failure handling with skip-existing logic, and practical tips from running 180+ prompts.
Citation Management -- The citation format (named footnote keys with source URLs), the one-tag-per-source-URL rule, the audit and standardization scripts, citation density benchmarks (1 per 105 words achieved), internal research tracking blocks, and the fact verification protocol. Covers the 3-stage citation workflow from pre-staged through reviewer-caught.
Synthesis and Extraction -- How raw Perplexity output gets transformed into writer-ready material. Credibility scoring for statistics (HIGH/MEDIUM/LOW), confidence scoring for quotes, the synthesize-research skill, output format, and integration with the research reader's 9 extraction scripts. The gap between a 1,500-word research dump and the 3 specific pieces a writer actually needs.

Key Takeaways¶

Research before writing. Always. The pipeline made citation cheap, which made citation ubiquitous -- 775 citations weren't a goal, they were a side effect.
Two-phase prompt design (pre-research web search, then informed prompts) produces dramatically sharper results than writing prompts cold.
Not all research is equal. Credibility and confidence scoring at extraction time offloads source evaluation from the creative process.

Previous: Agent System | Next: Obsidian Vault | Back to AI Writing Process