---
title: "What AI Customer Support Actually Costs in 2026"
slug: "what-ai-customer-support-actually-costs-2026"
author: "ibl.ai Engineering"
date: "2026-05-30 21:00:00"
category: "Premium"
topics: "AI customer support cost, Intercom Fin pricing, Ada AI cost, Drift AI, Ultimate.ai cost, AI agent per conversation, customer service AI per ticket, self-hosted customer support AI, small business AI"
summary: "Per-ticket token math across the latest models, monthly bills at small / mid-market / enterprise scale, and why the per-conversation customer-support AI vendors (Intercom Fin at $0.99/conversation) are the wrong shape — especially at scale."
banner: ""
thumbnail: ""
---

## Customer Support Is Where the Per-Conversation Pricing Trap Really Bites

Of all the AI use cases this site covers, customer support is the most visibly competitive. Intercom Fin charges $0.99 per AI-resolved conversation. Ada, Drift, Ultimate.ai, Forethought, Cresta — every legacy customer-service vendor has a per-conversation or per-resolution AI add-on, with prices in the same neighborhood.

That neighborhood is roughly 50–200× what the underlying token cost is. The vendor's pitch is alignment to value — you only pay when the AI resolves a ticket. The reality is that the price doesn't drop with volume, doesn't drop as the cheap models improve, and includes a healthy margin on top of the actual compute.

The actual compute is fractions of a cent. The math is the post.

## What a Customer-Support Conversation Actually Costs Per Token

A typical customer-support exchange — customer asks a question, agent responds with order/account context, customer follows up — is about **600 input tokens** (ticket + order context + knowledge-base hit) and **400 output tokens** (drafted response). Cost per ticket on the major models:

<table style="width:100%; border-collapse:collapse; margin:1.5rem 0; font-size:0.95rem;">
  <thead>
    <tr style="background:#f5f5f0; border-bottom:2px solid #2175C5;">
      <th style="text-align:left; padding:0.75rem; color:#5f6368;">Model</th>
      <th style="text-align:right; padding:0.75rem; color:#5f6368;">Input ($/MTok)</th>
      <th style="text-align:right; padding:0.75rem; color:#5f6368;">Output ($/MTok)</th>
      <th style="text-align:right; padding:0.75rem; color:#5f6368;">$ per ticket</th>
      <th style="text-align:left; padding:0.75rem; color:#5f6368;">When to use it</th>
    </tr>
  </thead>
  <tbody>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Claude Sonnet 4.6</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$3</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$15</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$0.008</td>
      <td style="padding:0.75rem;">Standard support workhorse</td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>GPT-5 mini</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$1.50</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$6</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$0.003</td>
      <td style="padding:0.75rem;">General-purpose mid-volume</td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Claude Haiku 4.5</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$1</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$5</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$0.003</td>
      <td style="padding:0.75rem;">High-volume FAQ / order status</td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Gemini 3 Flash</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$0.35</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$1.05</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">$0.0006</td>
      <td style="padding:0.75rem;">Cheapest hosted (sub-cent)</td>
    </tr>
    <tr style="background:#f0f9ff; border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Llama 4 (self-hosted on small VPS)</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#15803d;">~$0</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#15803d;">~$0</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#15803d;"><strong>~$0</strong></td>
      <td style="padding:0.75rem;">Flat-rate; whole-company workloads</td>
    </tr>
  </tbody>
</table>

Standard workhorse: **less than a penny per ticket.** Cheapest hosted: **six hundredths of a cent.** Self-hosted on a small VPS: **flat-rate, marginal cost rounds to zero.**

For comparison, Intercom Fin charges $0.99 per AI-resolved conversation. That's **120–1,500× the underlying token cost** depending on which model the vendor is actually running.

## Monthly Bills at Three Scale Tiers

- **Small business** (20 employees, e-commerce): ~5,000 tickets/month
- **Mid-market SaaS** (200 employees): ~50,000 tickets/month
- **Enterprise / consumer brand**: ~500,000 tickets/month

Monthly cost using **Claude Haiku 4.5** (the standard cheap-and-fast model for high-volume customer support) vs the per-conversation alternatives:

<table style="width:100%; border-collapse:collapse; margin:1.5rem 0; font-size:0.95rem;">
  <thead>
    <tr style="background:#f5f5f0; border-bottom:2px solid #2175C5;">
      <th style="text-align:left; padding:0.75rem; color:#5f6368;">Approach</th>
      <th style="text-align:left; padding:0.75rem; color:#5f6368;">Pricing shape</th>
      <th style="text-align:right; padding:0.75rem; color:#5f6368;">SMB (5K/mo)</th>
      <th style="text-align:right; padding:0.75rem; color:#5f6368;">Mid-market (50K/mo)</th>
      <th style="text-align:right; padding:0.75rem; color:#5f6368;">Enterprise (500K/mo)</th>
    </tr>
  </thead>
  <tbody>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Intercom Fin</strong></td>
      <td style="padding:0.75rem;">$0.99 per AI-resolved conversation</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;">$4,950</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;">$49,500</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;"><strong>$495,000</strong></td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Ada / Ultimate.ai / Forethought</strong></td>
      <td style="padding:0.75rem;">~$0.30–0.80 per resolution</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;">~$1,500–4,000</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;">~$15K–40K</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;"><strong>~$150K–400K</strong></td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>Per-agent seat (legacy support SaaS)</strong></td>
      <td style="padding:0.75rem;">~$50/agent/mo × team</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;">~$500</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;">~$5,000</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#b91c1c;"><strong>~$30,000</strong></td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;">Direct API — Claude Haiku 4.5</td>
      <td style="padding:0.75rem;">Token-based</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$15</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$150</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$1,500</td>
    </tr>
    <tr style="border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;">Direct API — Gemini 3 Flash</td>
      <td style="padding:0.75rem;">Token-based (cheapest)</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$3</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$30</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums;">~$300</td>
    </tr>
    <tr style="background:#f0f9ff; border-bottom:1px solid #e5e7eb;">
      <td style="padding:0.75rem;"><strong>ibl.ai self-hosted (Llama 4)</strong></td>
      <td style="padding:0.75rem;">Flat license + VPS/GPU</td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#15803d;"><strong>~$100–250</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#15803d;"><strong>~$1,000–2,500</strong></td>
      <td style="text-align:right; padding:0.75rem; font-variant-numeric:tabular-nums; color:#15803d;"><strong>~$5,000–10,000</strong></td>
    </tr>
  </tbody>
</table>

At enterprise scale, Intercom Fin is **~50× more expensive** than self-hosted on ibl.ai, and **~330× more expensive** than running the cheapest hosted model directly — for the same conversations resolved.

## Why the Per-Conversation Pricing Trap Is Different Here

Customer-support AI vendors have a specific argument for per-conversation pricing: "we only charge when we deliver value." It sounds aligned. It's not. Three reasons:

**1. The marginal cost is fractions of a cent.** A vendor charging $0.99 per resolution and running Haiku 4.5 underneath has a 99%+ margin on the actual model spend. The $0.99 is value capture, not cost recovery.

**2. The unit price doesn't drop with scale.** A 500K-ticket/month enterprise pays the same $0.99/ticket as a 5K-ticket SMB. With self-hosted infrastructure, the per-ticket cost drops to near-zero as volume grows.

**3. "Resolution" is the vendor's definition.** What counts as an AI-resolved ticket (vs handed off to a human, vs escalated, vs re-opened) is the vendor's call — and changes the bill. Token-based or self-hosted has no such ambiguity.

## What Stays the Same, What Changes

Self-hosting customer-support AI doesn't mean rebuilding the support stack. The customer-facing chat widget, the agent dashboards, the ticket routing, the integrations with the existing CRM (Salesforce, HubSpot, Zendesk, Front), the order system (Shopify, WooCommerce, Stripe), the audit logs — all stays managed by ibl.ai. The compute, the model, and the customer conversation data move inside the company's infrastructure (which for SMB is often a $20–50/month VPS).

What disappears: the per-conversation bill that scales linearly with ticket volume — and that's a $500K/month line item at enterprise scale.

What appears: a self-hosted customer-support AI with a model-routing recipe the support team designed:

- **Sonnet** for complex multi-turn cases requiring reasoning across several systems
- **Haiku** for standard support (the bulk)
- **Gemini Flash** for high-volume order-status / FAQ / routing
- **Llama 4 self-hosted** for the bulk of routine queries on flat-rate infrastructure

## Run the Numbers for Your Business

For SMB-specific cost modeling, two interactive tools:

- **[AI Cost Calculator — Small Business](/solutions/small-business/ai-cost-calculator)** — compare flat-rate vs per-seat / per-conversation subscriptions
- **[AI Readiness Assessment — Small Business](/solutions/small-business/ai-readiness-assessment)** — 5-question quiz to score deployment readiness

For the segment-wide cost-math context, see **[AI Cost Math for Small Business: Per-Seat vs Usage-Based in 2026](/blog/ai-cost-math-for-small-business-per-seat-vs-usage)**.

For higher-volume / mid-market / enterprise contexts, the **[AI Help Desk Cost Savings Calculator](/resources/calculators/ai-help-desk-savings-calculator)** generalizes to customer-support workloads.

For the broader pricing landscape across every model and per-seat vendor, the hub: **[What Does AI Actually Cost in 2026?](/blog/what-does-ai-actually-cost-in-2026)**.

## Why Family-Owned and New York Matters Here

For a business of any size, the customer-support AI vendor relationship is a multi-year commitment that affects every conversation customers have with the brand. ibl.ai is family-owned and operated from New York, NY — a long-term partner with a perpetual platform license and no investor exit pressure. The runtime is open source. The customer conversation data stays inside the business's infrastructure. The math works at a 5-person startup or a 50,000-employee multi-brand enterprise.
