Chatbot containment rate
Containment rate (also called chatbot containment rate) is the percentage of customer interactions that enter an automated channel — a chatbot, virtual agent, or IVR system — and are fully resolved within that channel without being escalated to a human agent. A session is “contained” when the customer’s issue is addressed, the customer indicates satisfaction, or the session ends without a transfer request. Containment rate is the primary efficiency metric for any self-service or AI automation investment, measuring the channel’s ability to independently resolve the contacts that enter it.
The formula is: Containment Rate = (Contained Sessions ÷ Total Sessions Entering the Channel) × 100. If 1,000 customers start a chat with an AI agent and 730 complete their interaction without requesting a human, the containment rate is 73%. Industry benchmarks for AI-powered chatbots in customer support: best-in-class deployments achieve 70–80% containment; average deployments reach 40–55%; rule-based bots without AI fall below 35%.
How containment rate is measured
Measuring containment rate accurately requires defining what “contained” means, and the definition matters significantly for the resulting number. A conservative, outcome-based definition counts a session as contained only if the customer explicitly confirmed resolution, or if no human contact was initiated within a defined window (typically 24 hours) after the automated session ended. A looser definition counts any session that ended without an escalation button being pressed — this overstates true containment because it includes frustrated abandonments.
Best-practice measurement combines both signals: an automated session that ends without escalation is provisionally counted as contained, but sessions followed by a new contact on the same issue within 24 hours are reclassified as unresolved. This “24-hour re-contact” adjustment is the most important correction for accurate containment measurement and can lower reported containment rates by 5–15 percentage points relative to naive measurement. Teams that skip this adjustment tend to over-report performance and under-invest in quality improvement.
Why containment rate matters
- Cost savings: Every percentage point of containment rate on a channel handling 50,000 monthly contacts represents 500 contacts per month shifted from human agents (cost: $8–15 each) to automation (cost: $0.10–1.00). At scale, even a 5-point improvement in containment rate generates six-figure annual savings.
- Agent capacity: Higher containment means human agents handle a smaller, more complex slice of volume — the interactions that genuinely require judgment, empathy, or policy authority. This improves agent experience and allows headcount to remain stable as contact volume grows.
- Speed: Contained sessions typically resolve 5–10x faster than human-handled equivalents. Customers asking “What is my order status?” get an answer in under 10 seconds through AI; the same query in a human queue averages 4–8 minutes including wait time.
Containment rate vs. deflection rate
Containment rate and deflection rate are closely related but measure different things. Containment rate is a channel-level metric: it measures performance within a specific automated channel, starting from when a customer enters that channel. Deflection rate is a portfolio-level metric: it measures the fraction of all support contacts (across all channels) that never reached a human agent. A high deflection rate can be achieved through high containment in the AI channel, through effective self-service (help center articles, FAQ pages) that prevents contacts altogether, or through IVR containment in voice channels.
A company with a 70% containment rate in its chatbot and a 40% rate of customers who never reach any support channel at all might report an overall deflection rate of 82% — because 40% of contacts were deflected before reaching the chatbot, and of the 60% that did reach it, 70% were contained. The two metrics should always be reported together for a complete picture of automation performance.
Containment rate in AI customer support
For AI customer support teams, containment rate is the north-star metric for chatbot and virtual agent performance. It should be tracked at the intent level, not just in aggregate: a chatbot that achieves 80% overall containment but only 30% containment on refund requests is failing on a high-value, high-stakes intent that can erode customer trust rapidly. Intent-level containment analysis is where AI support teams should focus their optimization efforts.
The levers for improving containment rate include: expanding the intent taxonomy to cover more contact reasons with automated workflows; improving intent detection accuracy so the right workflow fires for each message; enriching the knowledge base with content that covers long-tail queries; and enabling transactional actions (order cancellation, address update, refund initiation) that allow the AI to fully resolve issues rather than just provide information. Each of these levers typically yields 3–8 percentage points of containment improvement when applied systematically. Containment rate also connects to escalation rate inversely: every percentage point improvement in containment corresponds to an equivalent drop in escalations.

