Briefs · 2026-05-19 · Customer care AI landscape · Companion to the 2026-05-15 Agentic AI Primer §4.2-4.3

Customer Care-as-a-Service — A Working Primer

0. Audience + anchor question

Pre-call orienting material on AI-driven customer care. The anchor question: a mid-market company wants to "rent" tiered AI customer care — Tier 1 (password resets, balance checks, order status) fully AI-handled, Tier 2 AI with supervisor escalation, Tier 3 human at the company. Is that a ServiceNow service today? Is anyone selling the whole stack as a managed service, or is it still piecemeal? After reading, the reader should be able to read an Intercom, ServiceNow, Sierra, or Concentrix earnings transcript and place the disclosed metrics — automation rate, per-resolution price, FTE-equivalent savings — against the realistic operating range, and know which of the named vendors actually offers a "rent the whole stack" service versus which sells one layer of it.

Sister deliverable: docs/briefs/2026-05-15-agentic-ai-primer.md §4.2 (voice customer service) and §4.3 (non-voice / tickets). Read those first for the flow mechanics — speech-to-text → LLM → text-to-speech latency budgets, RAG over a help-desk knowledge base, the deflection-rate metric. This primer builds on that with the landscape, unit economics, and named players, and ends on whether the "rent-the-whole-stack" gap is real.

A note on vocabulary before the arc begins: CCaaS stands for contact-center-as-a-service — the cloud version of the on-premises call-center platform that emerged in the early 2010s (Five9, NICE CXone, Genesys Cloud, Twilio Flex). It handles call routing, queueing, IVR ("press 1 for billing"), and CRM integration. BPO stands for business process outsourcing — the labor-side outsourcer that staffs the human agents who sit on the other end of the CCaaS platform (Teleperformance, Concentrix, TaskUs, Foundever). Deflection rate is the share of incoming interactions that the AI handles end-to-end without human involvement; automation rate is Intercom's preferred metric and decomposes deflection into involvement rate (share of eligible conversations the bot touches) times resolution rate (share of those that get resolved without escalation). Customer care-as-a-service, the topic of this primer, is the emergent category sitting on top of CCaaS plus BPO — the AI-first vendor that promises to absorb a defined chunk of the support workload at a per-resolution price.


1. The shift

Customer service is being re-architected from a labor-priced service into a transaction-priced service. In the old shape — call center plus help-desk software plus tickets-per-agent staffing — a company bought CCaaS seats ($75–$200/seat/month for Five9 or NICE CXone), staffed them with in-house or BPO agents (US fully-loaded $35–$65/hour; offshore $8–$18/hour), and measured throughput in tickets-per-agent-per-hour. The unit was the agent-hour. Pricing was set by labor, and the labor was the headcount line on the P&L.

The AI-driven layer prices the unit of output, not the unit of input. Intercom's Fin charges $0.99 per resolution (snapshot); Zendesk's AI agents charge $1.50–$2.00 per automated resolution since August 2024; HubSpot's Breeze Customer Agent moved to $0.50 per resolved conversation in April 2026; Salesforce's Agentforce launched at $2 per conversation in late 2024, added Flex Credits at $0.10/action in May 2025, then layered a $125/user/month "Agentic Enterprise License Agreement" on top. Sierra (Bret Taylor's company) and Decagon — the two best-capitalized AI-first pure-plays — sell exclusively on outcome pricing. The mechanics of how an LLM-driven agent does the resolving — RAG over the knowledge base, tool calls to look up order status or process a refund, escalation handoff to a human — are covered in docs/briefs/2026-05-15-agentic-ai-primer.md §4.3. The shift this primer cares about is the commercial one: where the agent-hour used to be the unit of sale, the resolution is now.


2. Today's landscape by archetype

The market sorts into four archetypes by where the vendor started: BPO incumbents now selling AI on top of human labor, SaaS incumbents now selling AI on top of help-desk software, AI-first pure-plays who never sold either, and the voice infrastructure vendors who supply the speech-to-text and text-to-speech that the voice-channel vendors all run on. CCaaS legacy incumbents (Five9, NICE, Genesys) get their own treatment in §7 because they're the at-risk leg of the investment angle.

2.1 BPO incumbents moving to AI

The labor-side outsourcers carry the deepest exposure to AI substitution, and each is building the same defense: AI tooling on top of human-agent productivity, with the human agent as the wrapper rather than the deliverable. The numbers below are FY2025 unless stated.

Teleperformance — France-listed, €10.2B revenue, 14.6% recurring EBITA margin per the FY25 results press release (Feb 26 2026). Committed €100M to AI investment in 2025 per Markets Herald's summary of the FY25 results. The signature partnership is with Sanas, whose real-time speech-understanding platform is described in the joint announcement as "softening TP Experts' accents to more closely align with clear spoken English" (snapshot), with planned enhancement for "TP Experts in India, Philippines, Latin American, Africa and Asia." Teleperformance has invested $13M in Sanas and secured exclusive resale rights. The Sanas deal frames Teleperformance's AI defense: keep the agent on the call, use AI to make the agent easier to understand to US-bound callers. Note the bet — Teleperformance is not yet pricing AI resolutions directly to its clients.

Concentrix — combined entity (Concentrix + Webhelp), Nasdaq-listed, FY25 revenue $9.8B at 2.1% constant-currency growth, FY26 guidance $10.04–$10.18B (Concentrix Q4 2025 results press release; see Concentrix Q4 slides commentary for FY26 guidance figures). The flagship AI product is iX Hero, an agent-assist platform that consolidates customer context from multiple back-end systems into one pane and writes suggested responses for the human agent. Concentrix's iX Hero feature announcement (snapshot) discloses aggregate performance "based on studies in production": up to 11% improvement in first-contact resolution, up to 9% improvement in CSAT, and up to 15% improvement in speed-to-proficiency. The press release does not attach these aggregate metrics to specific industries or named customers. Like Teleperformance, the iX Hero pitch is "make the human agent faster" rather than "remove the human agent."

TaskUs — Nasdaq-listed, Q3 2025 record revenue $298.7M at 17.0% YoY growth, FY25 guidance $1.173–1.175B (TaskUs Q3 2025 8-K; snapshot). The disclosure that matters: AI Services was TaskUs's fastest-growing service line in Q3 2025 with >60% YoY growth (third consecutive quarter), and 63.7% YTD growth through the first nine months of 2025 per CFO Balaji Sekar — with Trust + Safety growth at nearly 20% and total Q3 revenue growth at 17%. TaskUs is the cleanest disclosure inside the BPO complex of the labor-to-AI mix shift inside the business: AI Services is the fastest-growing line by a wide margin, while the legacy Digital Customer Experience line is the slower-growing remainder of the book.

Foundever (formerly Sitel) — privately held (Sitel Group + Sykes merger). Foundever's own Cognigy-partnership announcement (snapshot) cites 150,000 associates across 45 countries, supporting interactions in over 60 languages daily. The AI strategy is structured around the strengthened Cognigy partnership — the same Cognigy that NICE bought for $955M two months later (see §7) — putting Foundever in the awkward position of building its AI strategy on top of a stack now owned by a CCaaS competitor. Foundever launched an AI Lab in Barcelona in March 2025 and has staffed a CEO succession (Chris Halbard) explicitly tied to AI transformation.

The BPO incumbents share a common AI strategy: layer agent-assist on top of human labor, partner with an AI-first vendor for the conversational layer rather than build it. The strategic risk is that the AI-first vendor either gets acquired by a competitor (Cognigy → NICE) or starts selling direct to enterprise (Sierra, Decagon), bypassing the BPO entirely.

2.2 SaaS incumbents

The help-desk and CRM incumbents already own the workflow seat — the queue, the macro library, the customer record. Their AI play is to add the resolution layer on top of seats they're already billing for, and to price the resolution layer outcome-based on top of the seat fee.

Intercom Fin — the published per-resolution anchor for the industry. $0.99 per outcome (snapshot) — where an outcome is "Fin resolves the issue end to end, or successfully executes a Procedure you've configured to end in a handoff to a human or a workflow" — with a 50-outcomes-per-month minimum commitment. The pricing page does not list a separate monthly subscription fee beyond the per-outcome charge and the minimum. Intercom's separate help-center article on automation rate defines automation rate as Involvement × Resolution and presents it as a measurement framework, not as a guaranteed-rate offering. Fin.ai's published headline number: 67% average resolution rate across 7,000+ customers, with top-performing customers reaching 80–84%. The $0.99 anchor is the industry reference point — when an enterprise procurement team looks at a Sierra or Decagon quote, the Intercom price is the comp.

Zendesk AI — moved to outcome-based pricing per Hiver's pricing breakdown: per-resolution committed pricing in the $1.50 range, with usage-based rates often cited around $2.00 per resolution depending on contract terms, plus an Advanced AI add-on priced separately through sales (the article notes AI costs scale per agent rather than as a flat tier). Zendesk acquired Ultimate.ai (an AI customer-service pure-play) in early 2024, and as of March 2026 acquired Forethought — making Zendesk the most acquisitive SaaS-incumbent in the category.

Salesforce Agentforce — the most-marketed agent platform from the largest CRM incumbent, launched in late 2024 at $2 per conversation. Enterprise pushback (one disclosed analysis projected a five-agent team running ~$900/day) forced Salesforce to add Flex Credits at $0.10 per action in May 2025, and a $125+/user/month "Agentic Enterprise License Agreement" in late 2025. Adoption is below Salesforce's marketing tempo: ~8% of Salesforce's 150,000+ customer base has adopted Agentforce. The three-pricing-model setup signals Salesforce is still searching for product-market fit on pricing in a category where Intercom's single $0.99 is the consensus anchor.

ServiceNow Now Assist — the closest answer to the "is that a ServiceNow service?" question. CEO Bill McDermott on the Q4 FY2025 call (January 28 2026): "Now Assist NNACV outperformed expectations in Q4 and surpassed $600,000,000 in ACV" — more than doubled year-over-year (ServiceNow Q4 2025 earnings call transcript; snapshot). Q4 subscription revenue $3.47B (+21% YoY). CFO Gina Mastantuono: "In Q4, deals greater than $1,000,000 nearly tripled quarter over quarter" — 35 deals over $1M in Q4 alone, with customers spending more than $1M annually on Now Assist growing >40% year-over-year, and deals attaching 5+ Now Assist products up >10× YoY. The 2026 guide is to significantly exceed $1B in Now Assist ACV. ServiceNow's case-study disclosures: Orica (mining) reported its IT-service-desk virtual-agent deflection rate moved from 18% to 94% within two months of Now Assist production use (iTnews coverage; snapshot). ServiceNow's structural advantage is that the AI agents sit inside the same workflow platform that already runs IT service management, HR case management, and customer service management for ~80% of the Fortune 500 — the "tier 3" knowledge base, ticketing system, and integration layer are already there. Now Assist is the closest thing in the public market to the "rent the whole stack" answer, but only for ServiceNow workflow customers (see §3).

HubSpot Breeze Customer Agent — moved to $0.50 per resolved conversation on April 14 2026, the most aggressive per-resolution price among the SaaS incumbents. Disclosed performance: resolves 65% of conversations and cuts resolution time 39% across 8,000+ users. HubSpot's bet is that the small- and mid-market customer base it already serves on the marketing/CRM hub will absorb the customer-service agent at the lowest-friction price point in the category.

2.3 AI-first pure-plays

The pure-plays never owned the seat, the BPO labor, or the CCaaS infrastructure — they sell the agent and the price-per-resolution as the entire deliverable. The two best-capitalized are Sierra and Decagon; both have raised in the last six months at valuations that imply the public-market incumbents are paying for the right defense.

Sierra — founded early 2024 by Bret Taylor (ex-Salesforce co-CEO, ex-Twitter chair) and Clay Bavor (ex-Google VR). Latest round: $950M Series E in May 2026 at a post-money valuation above $15B, with Tiger Global and GV leading — up from a $10B valuation in September 2025 when Greenoaks led a $350M round (snapshot). Per the May 4 2026 TechCrunch coverage, the Series E brings Sierra's cumulative capital to north of $1 billion (live URL; no dated local snapshot at time of authoring). ARR: $100M in late November 2025, $150M+ by early February 2026 — among the fastest enterprise-software ramps disclosed publicly. Customers explicitly named in the May 2026 and September 2025 TechCrunch coverage: SoFi, Ramp, Brex; Sierra claims more than 40% of the Fortune 50 as customers and that agents on its platform handle billions of interactions. Pricing is primarily outcome-based — Sierra only bills when the agent resolves on the outcome-priced portion of the contract; per Sierra's own blog, "outcome-based pricing may not always be the best fit," and for routing/greeter-style interactions or other non-resolution work Sierra layers a consumption-based component into a blended pricing approach. Public anchor from a third-party analyst writeup: a consumer brand running 40,000 monthly tickets at a 65% deflection rate pays ~26,000 × $1.50 = ~$39,000/month, plus platform minimums and implementation fees that put first-year cost at $200–350K+. Sierra is the highest-conviction private-market vote that the AI-first pure-play category captures enterprise share from the SaaS incumbents.

Decagon — founded 2023; Series C $131M at $1.5B in June 2025 (Accel and a16z Growth co-led); a follow-on $250M Series D closed January 28 2026, tripling the valuation to $4.5B (snapshot), led by Coatue Management and Index Ventures with ChemistryVC, Definition Capital, and Starwood Capital joining and existing backers a16z, Accel, Bain Capital Ventures, and Ribbit Capital continuing. Customer disclosures: Notion, Duolingo, Rippling, Bilt, Eventbrite, Substack, Oura Health, Affirm, Chime, Avis Budget Group, Mercado Libre, Deutsche Telekom; 100+ enterprise customers added in 2025. Pricing is not publicly listed — Featurebase's pricing breakdown (snapshot) explicitly states "Decagon doesn't publish list pricing. The site pushes you toward a demo, and pricing happens through a sales process." Decagon explicitly frames itself as "concierge customer experience" — positioning more upmarket than Intercom Fin's SMB anchor and against Sierra's Fortune-50 positioning.

Maven AGI — Series B $50M in June 2025 (snapshot) — Dell Technologies Capital lead, with Cisco Investments, SE Ventures, Lux Capital, M13, and E14 Fund; $78M total funding. Customers named in the press release: Ibex, Tripadvisor, SS&C, Rho, Check, Papaya Pay, Clio, Parivie Beauty. The notable customer is OpenAI itself — Maven AGI's case study with OpenAI (Wayback snapshot) reports across organizations implementing MavenAGI: 93% of customer support questions answered autonomously, average time to resolve reduced by 60%, support-representative productivity 2x, and cost per ticket decreased from $40 to $8 (an 80% reduction). 100% customer renewal rate disclosed; $0–$7M ARR in 2024 across roughly 0–40 enterprise customers per the press release framing.

Cresta — Series D $125M in November 2024 (snapshot); $270M+ total funding. Cresta's positioning differs from Sierra/Decagon — it sells agent-assist primarily, not fully autonomous agents — same product shape as Concentrix iX Hero, just sold standalone instead of inside a BPO. Customers named in the press release: Intuit, Verizon, Brinks Home, Cox Communications, Hilton, CarMax (Brinks disclosed: 50% lower QM costs, 75% first-call resolution, +30 NPS points, call transfers cut from 30% to 8%). Cresta launched a fully autonomous Cresta AI Agent in 2025 — the same convergence trajectory as the agent-assist incumbents (Genesys, NICE) moving into autonomous, but from the other direction.

Cognigy — Germany-based conversational-AI platform; NICE announced acquisition for ~$955M (including a ~$50M time-bound holdback) on July 28 2025, expected to close in Q4 2025. Pre-acquisition customers: Mercedes-Benz, Nestlé, Lufthansa Group. The $955M price signals NICE's view that the conversational layer is the missing piece on top of CCaaS — see §7 on the incumbent CCaaS defensive playbook.

Replicant — voice-channel AI pure-play; raised $78M Series B in April 2022 (snapshot) (Stripes, Salesforce Ventures), $110M total per TechCrunch. The 2022 vintage is structurally awkward — Replicant predates the LLM-driven voice-agent generation and has not raised again at scale, while ElevenLabs / Cartesia / Deepgram (§2.4) have absorbed the runtime layer Replicant once owned. A 2024 funding event was an "incubator/accelerator" round — a structural downgrade in the venture market.

Hyro$45M strategic growth round in 2025 (Healthier Capital lead, with ServiceNow Ventures and Norwest); healthcare-vertical focus, branded as the "Responsible AI Agent Platform" — built around HIPAA constraints and the specifics of healthcare patient-facing communications.

Ada — Canadian AI customer-service vendor; $130M Series C at $1.2B valuation (Wayback snapshot) — Spark Capital lead with Tiger Global, Bessemer, Accel, FirstMark, Burst Capital participating. The blog post discloses 6x revenue growth in under three years and that Ada has "automated more than 1.5 billion brand interactions for hundreds of digital leaders"; named customers in the post include Facebook and Square. The post does not disclose 2023/2024 revenue figures, a 5,000-customer count, an 80%+ resolution rate, or an underlying model-provider dependency — those claims appear in third-party sources rather than Ada's own funding disclosure. Ada is the established player from the pre-LLM chatbot generation that has transitioned to LLM agents; funding has lagged the Sierra/Decagon vintage.

PolyAI — UK-based voice AI vendor; $86M Series D announced December 15 2025 (snapshot) at a $750M valuation per SiliconANGLE coverage. Co-led by Georgian, Hedosophia, and Khosla Ventures, with NVentures (NVIDIA's venture arm), British Business Bank, Citi Ventures, Squarepoint Ventures, Sands Capital, Zendesk Ventures, and Point72 Ventures participating. Total funding now exceeds $200M. The press release discloses: 100+ enterprise customers, 2,000+ live deployments across 45 languages and 25+ countries; PolyAI customers achieve 391% ROI and an average $10.3M in savings; agentic AI does the work of "1,000+ full-time employees at multiple enterprises." PolyAI sits in the voice-channel-only segment alongside Replicant.

Forethought — Series D $25M strategic round in 2025; $115M total funding. Acquired by Zendesk in March 2026 — second AI-first pure-play absorbed by the help-desk incumbent (Ultimate.ai first, in early 2024).

2.4 Voice-stack vendors (the infrastructure layer the voice channel runs on)

Voice channels (covered in agentic primer §4.2) require three real-time components: streaming speech-to-text (STT), the LLM in the middle, streaming text-to-speech (TTS). Three vendors dominate at the infrastructure layer; their pricing is the floor for any voice-channel deliverable above.

Deepgram — bundled STT + LLM + TTS + orchestration as the "Voice Agent API" at $4.50/hour all-in (snapshot), with sub-200ms TTS time-to-first-byte for Aura-2; the Voice Agent launch piece describes Nova-3 STT transcripts as "incrementally assembled during the user's turn" but does not publish a discrete STT latency figure. Deepgram explicitly positions against incumbent voice stacks: "24% less than ElevenLabs Conversational AI, 75% less than OpenAI's Realtime API." The $4.50/hour rate is the cleanest infrastructure floor for what a fully AI-handled voice channel costs, with the human agent equivalent at US fully-loaded $35–$65/hour or offshore $8–$18/hour — the AI cost is 1–2 orders of magnitude below the labor cost of the channel it's replacing.

ElevenLabs — Series C $180M at $3.3B in January 2025; Series D $500M at $11B in early 2026, with $330M+ ARR at YE2025 (snapshot) — 3.3× valuation increase in 12 months, Sequoia-led with a16z and ICONIQ increasing positions. As of the ElevenLabs Agents platform launch on March 6 2026, customers had created "more than 2 million agents and 33 million conversations" (snapshot) on the platform (third-party aggregation of the ElevenLabs Agents launch). The Series D round signals that the voice-stack vendor will be a pure infrastructure layer (analogous to Stripe in payments) rather than a vertically integrated voice-agent vendor.

Cartesia — newer entrant; raised $64M Series A in March 2025 led by Kleiner Perkins per Cartesia's Series A blog; followed by a $100M round on October 29 2025 (Kleiner Perkins, Index Ventures, Lightspeed, NVIDIA) per SiliconANGLE/Threads coverage of the Sonic-3 launch. Cartesia's Sonic 3.5 documentation (snapshot) discloses streaming TTS support for "42 languages supported, including English, Hindi, Spanish, French, German, Japanese, Hebrew, and more." The technical positioning is "state-space model" architecture (a different sequence-model class than transformer-based TTS). The Series A blog discloses Sonic-2 latency of 90ms (full) and 40ms (turbo); Sonic-3 followed at the $100M raise.

AssemblyAI — STT specialist (covered in agentic primer §4.2); $115M Series C in early 2025. Less directly competitive with the bundled Voice Agent API category but a frequent component vendor inside it.

The voice-stack layer is consolidating around the bundled API (Deepgram's Voice Agent API model) rather than the unbundled-component model (separately bought STT, LLM, TTS, orchestration). The investment implication: the voice-channel vendor above (Sierra, Decagon, PolyAI, Cognigy/NICE) is the thicker margin layer; the voice infrastructure layer (ElevenLabs, Cartesia, Deepgram) is the higher-volume but commoditizing layer.


3. Tiered customer-care orchestration — the anchor question

The framing: "a small company that wants to rent the whole stack — Tier 1 fully AI, Tier 2 AI with supervisor escalation, Tier 3 a human at the company. Is that a ServiceNow service or is that somebody else?"

The answer in May 2026: no single vendor sells the whole stack as a packaged managed service, and that's the actual gap. The closest answers are uneven across the four archetypes:

The architectural absence: no vendor today sells a packaged "you give us the company, we run Tier 1 AI, Tier 2 supervised AI, and provide human Tier 3 escalation specialists on-demand, billed per-resolution" to a mid-market business. The closest in spirit are the BPO incumbents (Teleperformance, Concentrix, TaskUs, Foundever) — they have the human escalation labor, the queue management, and the technology partnerships — but they sell agent-hours wrapped in AI tooling, not a packaged per-resolution subscription. The Concentrix + Cognigy partnerships, Foundever + Cognigy, and TaskUs's growing AI Services line are the closest signals that this packaging is coming. What's missing is the per-resolution price commitment from a BPO that owns the human labor. That's a structural gap.

The most likely-to-fill-it candidates: (a) a BPO incumbent that re-prices its offering on per-resolution terms with a defined SLA on human escalation hours included — TaskUs's 60%+ AI Services growth is the closest signal; (b) Sierra or Decagon adding a human-escalation labor wrapper through a BPO partnership (Sierra's 40% Fortune-50 footprint makes this commercially viable); (c) ServiceNow extending Now Assist with a managed-service offering through Accenture / TCS / Genpact partners that bring the human Tier 3 labor against ServiceNow's workflow platform. The last is the most likely structural answer and the most consistent with ServiceNow's existing partner GTM.


4. Unit economics — what the numbers actually look like

Three layers — per-resolution price, automation/deflection rate, FTE-equivalent savings — convert into the headline economic case. The honest range is below.

Per-resolution pricing — current anchors:

Vendor List price Source
HubSpot Breeze Customer Agent $0.50/resolved conversation HubSpot company news, April 2026
Decagon List pricing not published — quoted through sales process Featurebase Decagon pricing breakdown
Intercom Fin $0.99/outcome (50/month minimum) Fin.ai pricing page
Sierra (illustrative third-party) ~$1.50/resolution at committed volume Lorikeet's Sierra pricing analysis
Zendesk AI (committed) ~$1.50/resolution + Advanced AI add-on priced per agent Hiver Zendesk pricing breakdown
Zendesk AI (usage-based) ~$2.00/resolution Hiver Zendesk pricing breakdown
Salesforce Agentforce (Flex Credits) $0.10/action (multiple actions per resolution) Salesforce May 2025 press release
Salesforce Agentforce (legacy) $2/conversation SaaStr Agentforce pricing analysis
Voice channel — Deepgram Voice Agent API $4.50/hour (all-in STT + LLM + TTS) Deepgram pricing page

The non-voice category with disclosed list pricing is converging on $0.50–$2.00 per resolved conversation; the AI-first pure-plays (Sierra, Decagon) quote through sales and do not publish list rates. The voice category prices in hours, not conversations — $4.50/hour for the infrastructure layer alone, with the vendor-managed full voice agent (Sierra, Decagon voice-channel deployments, PolyAI) priced on top in custom enterprise quotes.

Automation rate / deflection rate — what fraction of incoming volume the AI handles end-to-end:

The realistic range for a mid-market deployment is 50–70% automation rate in year one; 70–80% is achievable for the upper quartile, with documented knowledge bases, narrow policy variance, and tech-literate end users; 80–93% appears at top-decile customers and tends to come with a narrow scope (one product, one customer segment) rather than the full inbound queue. Below ~40% the unit economics start to break down — the vendor charges a platform fee and the customer is still paying for nearly the full human queue.

FTE-equivalent savings — the load-bearing economic claim, and the one that needs the most caution.

The headline anchor in the market is Klarna's February 2024 disclosure: 700 FTE-equivalent on a base of 2.3M conversations in the first month, $40M annualized profit improvement. Working backwards: 2.3M conversations/month implies ~27.6M/year; if 700 FTEs at $50K loaded cost would cover that, that's ~$35M of labor displaced — consistent with the $40M profit-improvement framing. Klarna walked the framing back fifteen months later — covered in §5.

The realistic ranges for FTE-equivalent savings, mid-market through enterprise:

The pattern across the disclosures: the deflection-rate number is reasonable to take at face value; the FTE-equivalent claim is the one to scrutinize. Resolving 65% of conversations and replacing 65% of FTEs are different statements — the second requires the customer to actually rebalance staffing against the new volume, which most enterprises do conservatively (and which Klarna did aggressively, with documented consequences). Gartner research released February 3 2026 (snapshot) predicts that by 2027, 50% of companies that attributed headcount reductions to AI will rehire staff to perform similar functions, with Gartner noting that "AI simply isn't mature enough to fully replace the expertise, empathy, and judgment that human agents provide." Forrester's Christina McAllister, in her own analysis of Klarna's reversal (snapshot), argues AI agents "will struggle with novel scenarios, nuanced exceptions, high emotionality — all areas where humans excel" and that the low-quality service experience came from what she calls an "overzealous pursuit of cost reduction" (the Silicon Republic piece frames the same dynamic as "overeager" headcount reduction in its own editorial voice). The Gartner 50% rehire prediction and the Forrester complexity argument together support a working planning assumption that realized FTE savings sit ~30–50% below the deflection-rate-times-seat-count math, with the haircut framed as author-derived from those analyst sources rather than a directly published Gartner/Forrester figure.


5. The Klarna case study — both sides

Klarna is the most-cited deployment in this category and the cleanest case study because both the bull case and the bear case are documented in primary sources fifteen months apart.

The bull case (February 27, 2024). Klarna's press release announced its OpenAI-powered AI assistant, one month into operation, had handled 2.3 million conversations — two-thirds of Klarna's customer-service chats globally. The headline claims: - "Doing the equivalent work of 700 full-time agents" - "On par with human agents in regard to customer satisfaction score" - "More accurate in errand resolution, leading to a 25% drop in repeat inquiries" - Customer resolution time "less than 2 mins compared to 11 mins previously" - Available in 23 markets, 24/7, communicating in more than 35 languages - "Estimated to drive a $40 million USD in profit improvement to Klarna in 2024"

CEO Sebastian Siemiatkowski, on the launch: "This AI breakthrough in customer interaction means superior experiences for our customers at better prices." OpenAI COO Brad Lightcap: "Klarna is at the very forefront among our partners in AI adoption and practical application." The release became the most-cited customer-care AI deployment of 2024 — including in OpenAI's own marketing materials.

The bear case (May 9, 2025). Fifteen months later, Bloomberg and Fortune reported Klarna's reversal. The disclosed framing from Siemiatkowski (multiple outlets reporting the same quotes): - "Really investing in the quality of the human support is the way of the future for us" - "From a brand perspective, a company perspective, I just think it's so critical that you are clear to your customer that there will always be a human if you want" (per Entrepreneur's reporting of the same Bloomberg interview) - Acknowledged that while AI was cheaper, it produced "lower quality" output

The operational reversal: Klarna began recruiting human customer-service employees again, targeting students, rural populations, and dedicated Klarna users for remote-only roles at pay starting at 400 Swedish krona (~$41.17) per Fortune's reporting (snapshot). Context: an AI-driven hiring freeze reduced Klarna's headcount by 22% to ~3,500 employees over twelve months, primarily through attrition. External evaluation: tech writer Gergely Orosz, in his Pragmatic Engineer post on Klarna's bot (snapshot), described the system as "a well-built" implementation but argued that "automating L1 support is not that revolutionary, and has been done previously on different systems"; he also noted that "the bot transfers anything more complex than adding basic context to a human" — consistent with the pattern of an AI agent that handles easy cases at high deflection rates and degrades on the harder ones.

What survived, what didn't.

What survived: the deflection-rate disclosure (~67% of chats handled by the AI), the resolution-time improvement (sub-2-min vs 11-min), the $40M annualized profit framing as a labor cost reduction, the 25% drop in repeat inquiries.

What didn't: the framing that AI permanently replaced 700 customer-service agents. Siemiatkowski's February 2025 statement ("I am of the opinion that AI can already do all of the jobs that we, as humans, do") proved over-strong; by May 2025 the public framing had shifted to a hybrid model where AI handles routine cases and humans absorb the complexity.

The honest read. Klarna's deployment was not a failure — the deflection rates, resolution speeds, and labor cost savings were real. The failure was treating the AI as a labor substitute rather than a labor complement. The 700-FTE-equivalent number worked for routine tickets at high volume; the customer-experience degradation showed up where the AI hit edge cases, complaint escalations, and the brand-affecting interactions where empathy matters more than speed. The Klarna lesson for any mid-market or enterprise deployment: assume realized FTE savings are 30–50% lower than the deflection rate × seat count math suggests, because the human queue still needs to staff for escalation surge, complex cases, retention saves, and brand-affecting moments.


6. Where it's headed — 24-36 months

Four trajectories are visible in current disclosures and funding patterns:

Voice + text multimodal. The boundary between the voice channel (currently $4.50/hour-priced infrastructure) and the text channel (currently $0.50–$2.00/resolution-priced workflow) is collapsing. Sierra, Decagon, and the SaaS incumbents are building toward customers who route inbound calls into the same agent that handles inbound tickets — same knowledge base, same tool-call infrastructure, same per-resolution accounting. The voice-stack vendors (Deepgram, ElevenLabs, Cartesia) are positioning as infrastructure under the workflow vendor rather than direct-to-enterprise — analogous to how Twilio sits under the workflow vendor in the pre-AI era.

Proactive support — the agent calls the customer first. Currently customer service is reactive — the customer contacts the company. The next architectural shift is the agent reaching out before the customer does: "your shipment is delayed, your card expired, your subscription renews in 7 days." Sierra and Decagon are both building this into their roadmap framings. The unit economics shift: the per-resolution price becomes the per-proactive-outcome price, and the comparable cost is the marketing / retention / churn-mitigation budget rather than the customer-service budget.

Agentic resolution across tools, not just answering. Agentic primer §4.3 covered the architectural shift: the AI agent doesn't just answer the customer, it calls the order-management tool to issue a refund, calls the billing system to switch a plan, calls the carrier API to reroute a shipment. The customer-care vendor that wires the most tool integrations beneath the agent layer captures the most resolution share. ServiceNow's Now Assist has a structural advantage here because the tool layer (CRM, ITSM, HRSM) is already inside the ServiceNow platform; for Sierra, Decagon, Intercom, the integration layer is the build-out cost.

Embedded support inside SaaS products. The customer no longer leaves the SaaS product to file a ticket — the AI agent is inside the product itself, with context on what the user is doing. HubSpot, Intercom, Zendesk, and Salesforce are all building in this direction. The SaaS-incumbent advantage at point-of-need is the structural reason their per-resolution prices ($0.50–$2.00) can defend against the AI-first pure-plays even at the latter's lower marginal cost — the seat the SaaS incumbent already owns is the integration cost the pure-play has to build.


7. Investment angle — what to own / avoid

The category sorts into four investment buckets. Names below are the cleanest exposures; this is not a recommendation list — it is a landscape map for John's continuing fan-out.

Incumbents at structural risk (CCaaS legacy). Five9 (FIVN), NICE (NICE), Genesys (private, $21B IPO pending). The threat: the AI-first vendor doesn't need the CCaaS queue, the IVR menu, the call-routing logic — when the AI resolves the call end-to-end, the CCaaS infrastructure is bypassed. The defense: each is acquiring AI capability — NICE bought Cognigy for $955M; Five9 built its own AI revenue line (+41% YoY in Q3 2025 per their Q3 2025 earnings call); Genesys announced $1.5B in investment commitments from Salesforce and ServiceNow on July 31 2025 (snapshot), with each company investing an equal amount, used to repurchase shares from existing equity holders (Hellman & Friedman and Permira remain majority owners). Genesys Cloud reached ARR near $2.1B in Q1 FY26 with year-over-year growth above 35%. The market is pricing the risk: Five9 trades at ~$21.64 (May 2026) with a 52-week range of $13.29–$30.38 and ~$1.7B market cap — recovered off the autumn 2025 lows but still well below where it traded pre-LLM-customer-care wave, even as Q3 2025 AI revenue grew 41% YoY. The market is signaling that AI commoditizes the legacy CCaaS seat faster than the AI-revenue line can replace it. The Five9 / NICE / Genesys trajectory is the cleanest "incumbent at risk" exposure for John's broader AI-infrastructure thesis.

Incumbents adapting well. ServiceNow (NOW) and Intercom (private; Atlassian comp) are the cleanest cases. ServiceNow's Now Assist passing $600M in ACV with >100% YoY growth and >130% YoY growth in $1M+ customers is the clearest disclosed case in the public market of a workflow incumbent successfully layering AI revenue on the existing workflow seat. Intercom Fin's $0.99-per-resolution becoming the industry anchor signals it has set the commercial template the SaaS incumbents are converging on. Salesforce's Agentforce is the uncertain case — 8% customer adoption on three pricing models suggests the product has not yet locked the commercial template the way ServiceNow has.

Pure-plays with momentum. Sierra (private; $15B), Decagon (private; $4.5B), Maven AGI (private; ~$250M valuation post-Series B). The structural question for each is whether the per-resolution outcome-pricing model holds margin as the SaaS incumbents converge on the same pricing — Intercom Fin at $0.99 is already structurally below where Sierra is reportedly pricing (~$1.50). Sierra's bet is enterprise-grade complexity and the brand of the Fortune 50 logo; Decagon's bet is concierge-quality positioning; Maven AGI's bet is the developer-platform integration (OpenAI as the lighthouse customer). The risk to all three: if Intercom Fin and HubSpot Breeze can deliver 65%+ deflection at $0.50–$0.99, the pure-play premium price doesn't survive procurement scrutiny outside the very-high-complexity enterprise tier.

BPO incumbents — split exposure. TaskUs (TASK) is the cleanest pure-play exposure to the AI-services-on-BPO bet — 60%+ AI Services growth from a 20% revenue-share base is the disclosed leading indicator. Concentrix (CNXC) carries Webhelp integration overhang but has the iX Hero platform working at scale; the FY26 1.5–3% revenue-growth guidance signals the AI-tooling-on-labor model is defending revenue, not yet expanding it. Teleperformance (TEP) — €100M AI commitment and the Sanas partnership are the right defense; the 0–2% like-for-like 2026 guidance disclosed in the FY25 annual results suggests the same defending-not-expanding posture. The BPO sub-sector is the AI-substitution-risk hedge inside John's broader infrastructure thesis: if the FTE-savings claims are realized, the BPO labor line is the one absorbing the loss.

Voice-stack vendors — infrastructure picks. ElevenLabs (private; $11B), Cartesia (private; $122M total funding), Deepgram (private). Same investment shape as the broader AI-infra picks-and-shovels thesis — the infrastructure layer under the customer-care vendor captures volume but commoditizes faster than the workflow layer above. ElevenLabs at $11B with $500M ARR signals a Stripe-like trajectory in the voice channel; Deepgram's 24%-and-75% pricing-below-incumbents positioning suggests the infrastructure layer is converging on price faster than the workflow layer above.

Vendor-consolidation prediction. The pattern is already visible: NICE bought Cognigy at $955M (Sep 2025), Zendesk bought Ultimate.ai (2024) and Forethought (Mar 2026), ServiceNow + Salesforce invested $1.5B in Genesys (July 2025). The likely next-12-months consolidation moves: (a) one of Five9, Genesys, or Twilio acquires a voice-channel AI pure-play (PolyAI, Replicant) for $500M–$1B; (b) a major BPO (Teleperformance or Concentrix) acquires an AI-first pure-play (likely Maven AGI, Forethought, or Hyro tier) to package the missing per-resolution layer onto the existing labor footprint; (c) Sierra or Decagon attempts a roll-up of a smaller voice-channel vendor to extend voice + text multimodal. The structural argument: the pure-play category has more competitors than the mid-market enterprise procurement complexity can absorb, and consolidation closes that gap.

Counter-thesis (the Klarna lesson, generalized; supported by Gartner Feb 3 2026 research projecting 50% of AI-driven layoff companies will rehire by 2027, and by Forrester's Christina McAllister on Klarna's "overzealous pursuit of cost reduction" — see §4 for full citations): the realistic FTE-savings range from a customer-care AI deployment is plausibly 30–50% lower than the deflection-rate-times-seat-count math implies, with the haircut framed as author-derived from the Gartner/Forrester argument rather than a directly published industry figure. If the savings are smaller than the implied math, the per-resolution price the SaaS incumbents and AI-first pure-plays charge has less defensible margin against a procurement counter — and the BPO labor line is less under threat than the consensus AI-substitution framing suggests. The bull case for the AI-first pure-plays rests on the 60%+ deflection rate translating to 60%+ labor displacement; the Klarna walkback plus the Gartner rehire prediction together say that translation has a meaningful haircut built in. If the haircut holds, TaskUs, Concentrix, and Teleperformance are less at risk than their multiples imply, and Sierra / Decagon are more at risk than their multiples imply.


8. Open questions

The questions that should drive follow-up research, not all of which the available disclosures answer:

  1. Is there a rent-the-whole-stack mid-market managed service today, and if not, is it a gap worth filling? The anchor question, sharpened. Today's answer: no. The closest candidates (TaskUs + Maven AGI / Sierra + a BPO partner / ServiceNow + Accenture managed-service) all have pieces of the answer but none has packaged it. The follow-up question: who has the right corporate shape to package it? TaskUs has the labor and the AI-services growth rate; the gap is the per-resolution pricing commitment. Sierra has the AI-first stack and the enterprise logo footprint; the gap is the human escalation labor. The most likely answer 18 months out is a TaskUs-acquires-an-AI-first-pure-play move, or a Sierra-buys-a-mid-market-BPO move — the strategic combination is more obvious than the standalone winner.

  2. What's the realized realistic FTE-savings range for a non-Klarna-scale mid-market deployment? Most public disclosures (Klarna, Intercom, Sierra) come from large enterprises with the resources to run a sophisticated implementation. The mid-market 5–25 FTE-displacement range above is interpolated from the smaller disclosed customer cohorts (HubSpot Breeze at 8,000 users; Maven AGI's 40 enterprise customers; Fin's 7,000+ Intercom customers). Surfaced through John's next set of channel calls or expert-network calls: actual realized savings from a mid-market deployment 12 months in, vs the at-implementation projections.

  3. How does the per-resolution price evolve in a market with $0.50 floor and $2.00 ceiling? The SaaS incumbents have moved toward $0.50 (HubSpot, Decagon committed); the AI-first pure-plays sit at $1.00–$1.50 (Intercom, Sierra, Zendesk). The pricing range has compressed 50% in 12 months. Does the AI-first premium ($1.00–$1.50) survive procurement comparison once the SaaS incumbents converge to $0.50, or does the pure-play category have to differentiate on quality / scope / managed service to defend the price? The Klarna lesson (realized quality matters more than headline deflection rate) is the underlying reason the price can defend — but only if the quality differentiation is real and provable in procurement.

  4. What does the Tier-2 supervised-AI layer actually look like operationally? The framing assumes a Tier-2 where AI handles the case but a supervisor approves the action — refund issuance, plan change, account credit. The current vendor disclosures don't yet describe Tier-2 supervision as a discrete capability — they describe Tier-1 deflection plus Tier-2/3 human escalation. The structural question for the next 18 months: which vendor first packages a supervised-AI tier where the AI's autonomy is bounded by a per-action approval threshold? Cresta's agent-assist trajectory plus Sierra's autonomous-action trajectory converging here is the most likely vendor pattern.

  5. The voice-channel pricing model — does it converge with the text channel? Voice prices in hours today ($4.50/hour Deepgram floor); text prices in resolutions ($0.50–$2.00). For multimodal customers (voice and text in one queue), is the right unit price-per-resolution regardless of channel, or hours-per-channel? The vendor that moves first on a converged unit (likely Sierra or Decagon, given their multi-channel positioning) sets the commercial template, the way Intercom did on the text side at $0.99.

  6. Where does Microsoft sit? Microsoft is conspicuously absent from the above named-vendor landscape but owns the underlying model layer (Azure OpenAI Service) that Ada and many others run on, and ships Copilot for Service inside Dynamics 365. The probable answer: Microsoft is sitting at the platform layer, harvesting model-tokens from every vendor above, while letting the workflow vendors compete on the customer interface. Worth tracking specifically against the broader AI-infrastructure margin thesis John already carries.


Sources

All URLs verified as accessible 2026-05-19 unless otherwise noted; key vendor pages snapshotted locally to research/2026-05-19-customer-care-aaas-primer/snapshots/.

  1. Klarna press release, Klarna AI assistant handles two-thirds of customer service chats in its first month, Feb 27 2024. https://www.klarna.com/international/press/klarna-ai-assistant-handles-two-thirds-of-customer-service-chats-in-its-first-month/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-02-27-klarna-ai-press-release.html
  2. Fortune, Klarna plans to hire humans again, as new landmark survey reveals most AI projects fail to deliver, May 9 2025. https://fortune.com/2025/05/09/klarna-ai-humans-return-on-investment/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-fortune-klarna-reversal.html
  3. Entrepreneur, Klarna Is Hiring Customer Service Agents After AI Couldn't Cut It on Calls, May 9 2025. https://www.entrepreneur.com/business-news/klarna-ceo-reverses-course-by-hiring-more-humans-not-ai/491396 — local (Wayback-rebuilt): research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-entrepreneur-klarna-reversal.html
  4. Pragmatic Engineer (Gergely Orosz), Klarna's AI chatbot: how revolutionary is it, really? https://blog.pragmaticengineer.com/klarnas-ai-chatbot/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-pragmatic-engineer-klarna.html
  5. OpenAI case study, Klarna's AI assistant does the work of 700 full-time agents. https://openai.com/index/klarna/ — local (Wayback): research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-openai-klarna-casestudy.html
  6. Fin.ai pricing page (Intercom). https://fin.ai/pricing — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-finai-pricing.html
  7. Intercom help center, Fin AI Agent automation rate. https://www.intercom.com/help/en/articles/13533623-fin-ai-agent-automation-rate
  8. Fin.ai, AI agent KPIs: enterprise performance metrics framework. https://fin.ai/learn/ai-agent-kpis-enterprise-performance-metrics-framework
  9. Hiver, Zendesk Pricing (2025): Plans & Add-Ons Explained. https://hiverhq.com/blog/zendesk-pricing
  10. TechCrunch, Zendesk acquires agentic customer service startup Forethought, Mar 11 2026. https://techcrunch.com/2026/03/11/zendesk-acquires-agentic-customer-service-startup-forethought/
  11. Salesforce, Salesforce Introduces New Flexible Agentforce Pricing, May 15 2025. https://www.salesforce.com/news/press-releases/2025/05/15/agentforce-flexible-pricing-news/
  12. SaaStr, Salesforce Now Has 3+ Pricing Models for Agentforce. https://www.saastr.com/salesforce-now-has-3-pricing-models-for-agentforce-and-maybe-right-now-thats-the-way-to-do-it/
  13. ServiceNow Q4 2025 earnings call transcript (Motley Fool), Jan 28 2026. https://www.fool.com/earnings/call-transcripts/2026/01/28/servicenow-now-q4-2025-earnings-call-transcript/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-fool-servicenow-q4-transcript.html
  14. iTnews, Orica brings GenAI to its IT service desk (Orica 18% → 94% deflection). https://www.itnews.com.au/news/orica-brings-genai-to-its-it-service-desk-612715 — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-itnews-orica-servicenow.html
  15. HubSpot, Customer Agent and Prospecting Agent: now you pay when the task is complete, April 2026. https://www.hubspot.com/company-news/hubspots-customer-agent-and-prospecting-agent-now-you-pay-when-the-task-is-complete
  16. Markets Herald, Teleperformance Overcomes AI Concerns With Strong Results and Strategic Investments. https://marketsherald.com/teleperformance-overcomes-ai-concerns-with-strong-results-and-strategic-investments/
  17. Teleperformance press release, TP partners with Sanas to accelerate AI. https://www.tp.com/en-us/insights-list/press-releases/teleperformance-forms-strategic-partnership-with-real-time-speech-understanding-provider-sanas-as-part-its-growth-strategy-to-accelerate-ai-development-and-reinvent-customer-experience/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-teleperformance-sanas.html
  18. Concentrix Q4 FY25 results press release (FY25 full-year revenue and FY26 guidance; SEC 8-K URL not located at time of authoring — corroborated via Investing.com Q4 slides commentary cited below).
  19. Concentrix Q4 2025 slides commentary. https://www.investing.com/news/company-news/concentrix-q4-2025-slides-modest-growth-continues-ai-strategy-takes-center-stage-93CH-4445107
  20. Concentrix press release, New iX Hero feature transforms how customer-facing teams learn to deliver industry-leading experiences. https://www.concentrix.com/about/news/new-ix-hero-feature-transforms-customer-facing-teams-learn-deliver-industry-leading-experiences/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-concentrix-ix-hero.html
  21. TaskUs 8-K, Q3 2025 earnings release, SEC. https://www.sec.gov/Archives/edgar/data/0001829864/000182986425000129/earningsreleaseex991q32025.htm — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-taskus-q3-2025-8k.html
  22. Foundever press release, Foundever and Cognigy Strengthen Strategic Partnership. https://foundever.com/news/foundever-and-cognigy-strengthen-strategic-partnership-to-deliver-scalable-agentic-ai-solutions-across-the-globe/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-foundever-cognigy.html
  23. NICE press release, NiCE to Acquire Cognigy, Jul 28 2025. https://www.nice.com/press-releases/nice-to-acquire-cognigy-advancing-the-leading-cx-ai-platform-to-accelerate-ai-first-customer-experience
  24. TechCrunch, Sierra raises $950M as the race to own enterprise AI gets serious, May 4 2026. https://techcrunch.com/2026/05/04/sierra-raises-950m-as-the-race-to-own-enterprise-ai-gets-serious/
  25. TechCrunch, Bret Taylor's Sierra raises $350M at a $10B valuation, Sep 4 2025. https://techcrunch.com/2025/09/04/bret-taylors-sierra-raises-350m-at-a-10b-valuation/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-techcrunch-sierra-350m.html
  26. Sierra blog, Outcome-based pricing for AI Agents. https://sierra.ai/blog/outcome-based-pricing-for-ai-agents
  27. Lorikeet analysis, Sierra AI Pricing in 2026: What They Charge and 4 Cheaper Alternatives. https://www.lorikeetcx.ai/articles/sierra-ai-pricing-alternatives
  28. Decagon announcement, Decagon raises Series C at $1.5B valuation, June 2025. https://decagon.ai/resources/series-c-announcement
  29. MobiHealthNews, Decagon raises $250M for AI agents, triples valuation to $4.5B. https://www.mobihealthnews.com/news/decagon-raises-250m-ai-agents-triples-valuation-45b — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-mobihealth-decagon-250m.html
  30. Featurebase, Decagon Pricing Explained (2026). https://www.featurebase.app/blog/decagon-pricing — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-featurebase-decagon-pricing.html
  31. Maven AGI press release, Maven AGI Raises $50M, Jun 18 2025. https://www.prnewswire.com/news-releases/maven-agi-raises-50m-to-meet-surging-demand-for-enterprise-grade-ai-302484913.html — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-prnewswire-maven-50m.html
  32. OpenAI case study, MavenAGI launches automated customer support agents powered by OpenAI. https://openai.com/index/mavenagi/ — local (Wayback): research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-openai-mavenagi-casestudy.html
  33. Cresta press release, Cresta Closes $125M Series D. https://www.prnewswire.com/news-releases/cresta-closes-125m-series-d-to-accelerate-adoption-of-human-centric-ai-in-the-contact-center-302309858.html — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-prnewswire-cresta-125m.html
  34. PolyAI press release, PolyAI raises $86M, Dec 2025. https://www.prnewswire.com/news-releases/polyai-raises-86m-to-transform-how-enterprises-talk-to-their-customers-302641889.html — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-prnewswire-polyai-86m.html
  35. SiliconANGLE, Call center chatbot startup PolyAI raises $86M at a $750M valuation, Dec 15 2025. https://siliconangle.com/2025/12/15/call-center-chatbot-startup-polyai-raises-86m-750m-valuation/
  36. Ada blog, We've joined the unicorn club: USD130M raised in Series C round at a USD1.2B valuation. https://www.ada.cx/blog/we-ve-joined-the-unicorn-club-usd130m-raised-in-series-c-round-at-a-usd1-2b-valuation/ — local (Wayback): research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-ada-series-c-blog.html
  37. TechCrunch, Call center automation software vendor Replicant raises $78M, Apr 26 2022. https://techcrunch.com/2022/04/26/call-center-automation-software-vendor-replicant-raises-78m/ — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-techcrunch-replicant-78m.html
  38. Fierce Healthcare, Hyro raises $45M growth round to expand AI agents. https://www.fiercehealthcare.com/ai-and-machine-learning/hyro-raises-45m-growth-round-expand-ai-agents
  39. BusinessWire, Forethought Pioneers First Multi-Agent, Omnichannel AI for Customer Experience — Raises Strategic Round. https://www.businesswire.com/news/home/20250513210776/en/Forethought-Pioneers-First-Multi-Agent-Omnichannel-AI-for-Customer-Experience-Raises-Strategic-Round-to-Scale-Breakthrough-Innovation
  40. Deepgram pricing page. https://deepgram.com/pricing — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-deepgram-pricing.html
  41. Deepgram blog, Inside Deepgram's Voice Agent API: Real-Time STT, TTS, and Orchestration in One API. https://deepgram.com/learn/voice-agent-api-generally-available
  42. TechCrunch, ElevenLabs confirms $180M in Series C funding at a $3.3B valuation, Jan 30 2025. https://techcrunch.com/2025/01/30/elevenlabs-raises-180-million-in-series-c-funding-at-3-3-billion-valuation/
  43. ElevenLabs blog, Series D announcement (Sequoia-led $500M @ $11B, $330M+ ARR). https://elevenlabs.io/blog/series-d — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-elevenlabs-series-d-blog.html. (CNBC's Feb 4 2026 article on the same round at https://www.cnbc.com/2026/02/04/nvidia-backed-ai-startup-elevenlabs-11-billion-valuation.html is paywall/Wayback-only and was not used as the primary cite.)
  44. Cartesia blog, Series A and the future of voice AI (March 2025 Series A; not the October 2025 $100M round). https://cartesia.ai/blog/series-a
  45. Threads post by The Rundown AI summarizing Cartesia $100M Oct 2025 round + Sonic-3 launch. https://www.threads.com/@therundownai/post/DQcwXPPiE0w/cartesia-the-company-behind-the-sonic-ai-voice-model-has-raised-million-from
  46. Five9 Q3 2025 earnings call transcript. https://www.investing.com/news/transcripts/earnings-call-transcript-five9-q3-2025-beats-eps-stock-dips-93CH-4340688
  47. Genesys press release, Genesys Announces $1.5 Billion Investment by Salesforce and ServiceNow, Jul 31 2025. https://www.genesys.com/company/newsroom/announcements/genesys-announces-1-5-billion-investment-by-salesforce-and-servicenow — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-genesys-1-5b-investment.html
  48. CMS Wire, Genesys IPO: A Litmus Test for AI's Future in CX. https://www.cmswire.com/contact-center/genesys-ipo-a-litmus-test-for-ais-future-in-cx/
  49. Metaintro, Half of AI-Driven Layoffs Will Reverse by 2027, Gartner says (citing Gartner research released Feb 3 2026). https://www.metaintro.com/blog/ai-job-cuts-reverse-2027 — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-metaintro-gartner-ai-rehire.html
  50. Silicon Republic, Did Klarna's AI reliance affect customer service quality? (Forrester analyst Christina McAllister on Klarna). https://www.siliconrepublic.com/machines/klarna-ai-hiring-humans-forrester — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-siliconrepublic-klarna-forrester.html
  51. Cartesia documentation, Sonic 3.5 / latest TTS model (42 languages disclosed). https://docs.cartesia.ai/build-with-cartesia/tts-models/latest — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-cartesia-docs-sonic.html
  52. AIVidPipeline, ElevenLabs Agents Guide 2026 — third-party aggregation citing the ElevenLabs Agents launch (March 6 2026) for the 2M+ agents / 33M+ conversations figures. https://aividpipeline.com/blog/elevenlabs-agents-guide-2026 — local: research/2026-05-19-customer-care-aaas-primer/snapshots/2026-05-19-elevenlabs-agents-guide.html
  53. Teleperformance, 2025 Annual Results press release, Feb 26 2026 (FY25 €10.2B revenue, 14.6% recurring EBITA margin, 0–2% like-for-like 2026 guidance). https://www.tp.com/media/pxmfy4dy/tp-press-release-2025-annual-results.pdf

Sister deliverables: - docs/briefs/2026-05-15-agentic-ai-primer.md §4.2 (voice customer service) and §4.3 (non-voice / tickets) — flow mechanics, latency budgets, RAG over knowledge base - docs/briefs/2026-05-15-token-primer.md — token-pricing mechanics that underlie the per-resolution unit economics