The company: Two technical founders building an LLM observability platform — think "Datadog for AI agents." Pre-revenue when they signed up, ~$140k ARR by end of the 22-day cohort. Bootstrapped, no marketing hire, both founders writing code full-time.
The problem: Their category was getting crowded fast — Langfuse, LangSmith, Helicone, Phoenix, Honeycomb's AI tier, plus three YC-backed companies launched in the same quarter as them. Every "best LLM observability tool" query in ChatGPT returned the same five names. They were not one of them.
They had hired a $4,000/month content marketing agency three months prior. The agency shipped three blog posts in their first 30 days (one per ~10 days), all keyword-optimized for Google but none structured for AEO. None pinged IndexNow. None had Article schema. The two founders kept asking: "Why is none of this content showing up when we ask ChatGPT about our space?"
"We were paying $4k/month to write posts our buyers' LLMs were ignoring. The mismatch between what 'content marketing' was producing and what actually drove citations was the moment we realized AEO needs a different stack."
The trigger
One of the founders ran ZenithStack's free scorecard on a Saturday night. Their lead comparison query — "best open-source LLM observability for AI agents" — returned 5 brands. None were them. The verdict pulled no punches: "You are not in the answer. The 5 brands cited above are all newer than you in market presence but have more AEO-ready content surface."
They cancelled the content agency on Monday morning and signed up for ZenithStack Growth ($499/mo) the same day. Net infrastructure cost change: -$3,501/mo.
Days 1–3 — backfill + onboarding
The team had 11 existing blog posts. The ZenithStack engine ran an AEO backfill on all of them — adding Article + Organization + FAQ JSON-LD schema, pinging IndexNow for each, and rewriting the H1/H2 hierarchy on three posts where the headings weren't semantic enough to be parsed as answer chunks.
They configured the daily auto-blog cadence at 11:00 UTC with a focus on three head queries that the audit had flagged as the highest-ROI gaps. They left auto-publish on; reviewing each draft would have eaten time both founders didn't have.
Days 4–22 — the daily ship
Cadence: 1 post per weekday at 11:00 UTC, fully automated. Posts the auto-blog shipped during the window:
- 5 head-query "best of category" posts targeting "best LLM observability tool for [persona]" variants. Two of these eventually landed on the front page of Perplexity for their target query.
- 4 deep-comparison posts — "ZenithStack-customer vs LangSmith", "ZenithStack-customer vs Helicone", etc. These tend to convert lowest on direct intent but get cited heavily when the LLM is asked "alternatives to X."
- 3 "use-case" posts — concrete walkthroughs of monitoring specific AI workflows (RAG pipelines, multi-agent debugging, eval harnesses). These got cited on long-tail engineering queries.
- 7 "technical primer" posts — what is hallucination detection, what is observability for AI agents, etc. Slower-converting but very high authority signal. Citing them gave the LLMs a reason to trust the rest of the domain's posts.
Every post: 1,500+ words, PAS structure, 2–3 sourced statistics, Article + Organization JSON-LD, IndexNow pinged within 60 seconds of publish, Reddit + HN share bar.
Day 12 — first Perplexity ranking
The first measurable win came on day 12. Perplexity's response to "best open-source LLM observability for AI agents" now returned them as the 4th cited brand. The 1st-3rd were unchanged (LangSmith, Langfuse, Helicone). Their citation traced to a 1,800-word post the auto-blog had written on day 7, titled "Open-source observability for agentic AI — what to look for in 2026."
The post had hit r/MachineLearning the day after publish (via the share bar), got 47 upvotes, and Perplexity reactively pulled it into the response within 36 hours. This is exactly the IndexNow → Bing → Perplexity loop we'd designed for.
Day 19 — top-3 on the lead query
By day 19, they were the 3rd cited brand on the lead query, ahead of Helicone and just behind LangSmith and Langfuse. Their share of voice on that single query had gone from 0% to ~17% in three weeks. ChatGPT showed slower lift on the same query but began including them in 4 of 10 daily samples by day 21.
The business impact (22 days)
For a pre-revenue startup, the cleanest measure is signup quality:
- 23 new trial signups in days 12–22 (vs ~2/week historically). 14 of them mentioned "found you on Perplexity" or "ChatGPT recommended you" in their welcome email reply or first conversation with the founders.
- 3 paid conversions from those signups before the 22-day window closed. ACV ~$3,500. Their total ARR went from ~$40k to ~$140k in the window — though they're careful to attribute only ~$30k of that directly to the ZenithStack flywheel; the rest came from existing pipeline.
- Saved $4,000/mo agency cost. Net infrastructure spend: $4,000 down to $499 = $3,501/mo savings.
The honest part
Two things didn't go perfectly:
Day 9: The engine wrote a post that contained one stale statistic. A cited statistic was from 2024 when 2025 data was available. The auto-fact-check pass had passed it because it was sourced (the URL was valid), but didn't catch the staleness. They edited the post post-publish, the engine re-pinged IndexNow, and within 18 hours Perplexity had the updated version cached.
Day 16: The Reddit share fired into r/devops instead of r/MachineLearning. One post about LLM eval pipelines was framed by the engine as a DevOps post and got auto-posted to the wrong subreddit, where it got 4 upvotes and slid off page 1 within an hour. They added a subreddit-routing rule to fix it for future posts. Marginal damage; just unhelpful.
The lesson, as the founders described it
"In 22 days we shipped more AEO-grade content than the agency had in three months. We didn't sacrifice quality — every post is something we'd send to a customer. But the cadence is the moat. Asking a human agency to ship 19 1,500-word posts in 22 days with JSON-LD and IndexNow pings, for $499/mo total, is just not a thing that exists."
Want to see your category's scorecard?
Drop your domain. We'll ask ChatGPT what it recommends for your category and show you exactly where you sit — 60 seconds, no signup.
Run my free scorecard →