The company: A 38-employee, ~$4.2M ARR B2B project-collaboration SaaS competing with Asana, Monday, ClickUp, and a half-dozen newer entrants. North American + European customer base. Series-A backed, runway-conscious in 2026.
The problem: Their founder kept hearing prospects open demo calls with "I asked ChatGPT for the best tool in your category and it gave me Asana, ClickUp, and Monday — what makes you different?" They weren't being cited at all. Their existing SEO playbook (six retainer blogs/month on Ahrefs-mined keywords) ranked them well on Google but didn't move the needle in any LLM answer.
Day 0 — baseline audit
On the day they ran the ZenithStack scorecard, here's what came back across their 10 head queries × 3 engines:
- ChatGPT (live web): 2 of 31 citations included their domain. The two hits were on long-tail engineering-team queries — the meat of their TAM was invisible.
- Perplexity Sonar: 0 of 27 citations. Perplexity preferred Asana/Monday/ClickUp + two newer YC companies they hadn't even mapped as competitors.
- Gemini search: 0 of 22. Gemini leaned almost entirely on G2 leaderboards and Forbes Cloud 100 — neither of which they were on.
The ZenithStack audit's gap analysis flagged the single highest-ROI issue: they had blog content for almost every comparison query their buyers were typing into ChatGPT, but those posts hadn't been pinged to IndexNow, didn't have Article + ItemList JSON-LD, and weren't cross-cited from the 4 sources the LLMs actually crawl most heavily. The content existed; the AEO scaffolding around it didn't.
"We'd been writing for SEO since 2021. None of it was structured for AEO. Watching the audit show us why Perplexity was ignoring posts we'd spent $40k commissioning was, frankly, the wake-up call."
Days 1–14 — the flywheel turns on
They flipped on the daily auto-blog ("Content Factory" in the dashboard), set the cadence to one post per weekday, and gave the system access to their existing blog database so it could read context from prior posts.
What the engine shipped in the first two weeks:
- 11 new authority-grade posts (target: 1500 words, PAS structure, 2–3 third-party stats sourced inline). Each targeted the lowest-cited tracker prompt the day it was generated.
- JSON-LD Article + Organization schema embedded on every post, including their existing 89 blog posts (one-time backfill the engine ran on day 2).
- IndexNow ping on every publish — pushing Bing and Yandex to re-crawl within minutes instead of weeks. Bing's index is the substrate ChatGPT and Copilot lean on heaviest.
- Reddit + Hacker News share bar on the high-value comparison posts. Two ranked on r/projectmanagement and one landed on the HN front page, which Perplexity picked up within 36 hours.
Day 22 — first measurable lift
Three weeks in, the Monday morning attribution digest showed the first real movement:
- Perplexity: 0/27 → 4/27 citations. Three of the four were on comparison queries ("alternatives to Asana for engineering teams" and similar). All three traced to the 11 new posts.
- ChatGPT: 2/31 → 5/31. The new hits were on category-positioning queries ("best project management tool for distributed teams in 2026").
- Gemini: 0/22 → 2/22. Slower — Gemini relies on its training cutoff and doesn't react as quickly to new content. The two new citations both pointed to backfilled JSON-LD posts.
Days 23–47 — compounding
From day 22 onward, the team didn't change anything. The flywheel was identical: one autoblog/day at 11:00 UTC, the gap-fill engine picked the worst-cited prompt, wrote against it, pushed it, and tracked the lift. The founders looked at the Monday digest, occasionally rewrote a draft if the engine missed a positioning nuance, and otherwise let it run.
End-of-window results (day 47):
- ChatGPT: 9/31 citations. Net +7. The biggest gain came on the three "team type" queries (engineering / design / agency teams) where they out-cited Monday and Asana on at least two of the daily samples.
- Perplexity: 11/27 citations. Net +11. Perplexity rewarded the IndexNow + JSON-LD pattern most aggressively.
- Gemini: 4/22. Net +4. Still trailing the other engines but moving.
The business impact (month 2)
Citation share is leading; pipeline is lagging. The team waited until day 30+ to start measuring downstream metrics so the attribution lookback was clean.
- 14 inbound demo requests that mentioned, unprompted, "ChatGPT suggested I check you out" or "Perplexity put you in a comparison." Inbound demos averaged 0–2 per month historically.
- 2 closed deals in month 2 attributable to AI-search referrals — ACV $42k and $41k respectively, totaling $83k in new ARR.
- Cancellations: They cancelled their content agency retainer ($4,200/mo), Profound monitoring ($299/mo), and a separate content-distribution tool ($129/mo). Net infrastructure savings: $4,628/mo against ZenithStack's $499/mo (Growth tier).
What stopped working
It wasn't all up-and-to-the-right. Two things they had to fix mid-cohort:
Day 18: The auto-blog briefly drifted off positioning. The engine wrote a post comparing them to a tool they'd deliberately stopped competing with. They caught it on review (the dashboard shows every draft pre-publish), edited it back into the right framing, and added a "do not target this query" rule. After that, no further drift.
Day 31: A high-value tracker prompt was being "won" without lift. They were cited on it, but the cite was on a 2022 blog post that no longer represented their current product. The fix-it-now button generated a refreshed version, pinged IndexNow, and the next day's tracker showed the updated URL replacing the stale one in the engine's response. Recover-time on stale citations: about 26 hours.
The honest caveats
This was one company, one window. We don't claim every customer will see this trajectory — outcomes depend on category competition, prior content footprint, and how much human review you do on auto-blog output. What we will claim, because we observed it on every cohort we've run: publishing AEO-structured content into the surfaces those engines crawl produces measurable citation lift inside 30 days. The size of that lift is mostly a function of how crowded your category already is.
Want this for your brand?
Run a free scorecard against your top category query. 60 seconds, no signup, real live results from ChatGPT.
Run my free scorecard →