What's a good first-reply time for an SME?

For email and ticketing, two hours during business hours and overnight under 14 hours places you at median. Under one hour business-hours and under four overnight places you in the top quartile. For chat the bar is much higher: under two minutes business-hours, under fifteen overnight or with auto-acknowledgement.

Should I report median or 90th-percentile first-reply time?

Both. Median tells you the typical customer experience. 90th percentile tells you the worst-case experience for one in ten customers. The median can be excellent while the 90th percentile is awful, which is the situation that produces angry social-media posts.

How much does AI auto-resolution affect first-reply time?

For tickets that auto-resolution handles, first-reply time becomes seconds rather than minutes or hours. For tickets it does not handle, first-reply time on the human queue improves because the queue is shorter. The compound effect on the headline median is typically a 40 to 60 percent improvement.

First-Reply Time Benchmarks for SME Support Teams

Why FRT is the metric to focus on

Of all the operational metrics a support team can track (resolution time, CSAT, contact rate, FCR, NPS, queue depth), first-reply time is the one most directly tied to customer perception of the experience. A customer can wait three days for a complete resolution if the first reply lands in 90 minutes acknowledging the issue. The opposite (a fast resolution preceded by a 30-hour silence) feels like a worse experience even though the resolution time was shorter.

The other reason to focus on FRT is that it is the most actionable metric. The levers that move it (pre-drafted replies, routing, auto-acknowledgement, AI assistance) are all configurable, with effects that show up in days rather than quarters. Resolution time and CSAT are downstream of dozens of factors; FRT is one or two changes away from a measurable improvement.

This post walks through the actual benchmarks I have seen across SME teams in 2026, where to position your own team, and the five operational levers that move the dial fastest.

Source data

The numbers below are aggregated from three sources:

KimonDesk customer dashboards (anonymised, n=87 organisations across 4 to 30 agents, Q1 to Q2 2026).
A 2025 Forrester report on customer service software for mid-market teams, where FRT figures were segmented by team size.
A 2025 Klaus benchmark report covering 200+ support teams in the SME and mid-market segments.

I have triangulated the three to produce the percentile bands below. Where they disagreed, I used the Forrester numbers because they segmented most cleanly by team size.

The 2026 benchmarks

For SME teams of 3 to 25 agents on email and ticket-based channels, the percentile bands look like this.

Median first-reply time (business hours):

Top quartile: under 30 minutes
Median: 1 hour 50 minutes
Bottom quartile: over 4 hours 30 minutes
Bottom decile: over 12 hours

Median first-reply time (out of business hours):

Top quartile: under 4 hours (auto-acknowledgement plus a real human reply on the next shift)
Median: 13 hours 40 minutes
Bottom quartile: over 22 hours
Bottom decile: over 36 hours

For chat as a separate channel:

Top quartile: under 90 seconds (live response)
Median: 4 to 8 minutes during business hours
Bottom quartile: over 30 minutes (effectively asynchronous)

The chat numbers are tighter because chat customers expect synchronous interaction. A 30-minute wait on chat is functionally a different experience from a 30-minute wait on email; the customer has typically given up and switched channels by minute 10.

Where most SME teams actually sit

Aggregated across the 87 KimonDesk customers, the distribution looks like this:

22 percent are in the top quartile on business-hours FRT.
41 percent are around the median.
27 percent are in the bottom quartile.
10 percent are in the bottom decile.

The teams in the bottom decile typically share three characteristics: no automated routing, no pre-drafted templates, and at least one shift gap (i.e. tickets that arrive at 6pm sit untouched until 9am). Fix any one of those three and the team typically moves out of the bottom decile within a quarter.

The five levers that move FRT

In rough order of impact for an SME team starting from zero:

Lever 1: pre-drafted reply templates (macros)

Effect on business-hours median: typically -30 to -45 percent.

Most inbound questions are not unique. Of a typical SME inbound, 60 to 80 percent fall into 10 to 20 recurring patterns: refund queries, password resets, billing questions, shipping queries, common technical issues. A library of 20 well-written macros, with personalisation tokens for the customer's name and order, lets an agent send a high-quality reply in 30 seconds instead of 4 minutes.

Macros are the cheapest, fastest improvement. The setup cost is one focused day per support lead. The ongoing maintenance is adding two or three new macros a month as new patterns emerge.

Lever 2: automated routing by intent

Effect on business-hours median: typically -15 to -25 percent.

Without routing, every agent triages every ticket. With routing, billing questions go to the agent who is fastest at billing replies, technical questions to the technical agent, escalations to the team-lead. The savings come from removing the triage step and from each agent operating in their highest-throughput category.

Routing rules can be simple keyword-based ("if the email contains 'refund' or 'invoice', route to billing queue") or model-based (intent classifier reads the message and chooses the queue). Either works. The simple version is sufficient for most teams up to 15 agents.

Lever 3: shift coverage gaps

Effect on out-of-hours median: typically -50 to -70 percent on the affected shifts.

Most SMEs run business-hours support only. The 14-hour overnight gap drives the headline median FRT up sharply. Two interventions help.

First, auto-acknowledgement. Within seconds of a ticket arriving overnight, the customer gets an automatic reply ("we got your message, ticket #1234, our team will reply at 09:00 GMT"). This does not reduce the time-to-first-human-reply, but it changes the customer experience materially. Most measurement frameworks treat the auto-ack as a separate metric, but customers do not.

Second, lightweight overnight cover. A part-time agent handling an hour at 11pm and an hour at 6am can clear the easy queue and triage the harder cases for the morning shift. The cost is small for the FRT gain.

Lever 4: AI assistance for drafting

Effect on business-hours median: typically -20 to -30 percent on top of macros.

AI draft replies (covered in What AI-Native Helpdesk Actually Means) reduce the time an agent spends composing replies for non-templated tickets. The gain is smaller than the macro gain because templated tickets were already fast; the AI gain accrues to the longer-tail tickets that did not fit a macro.

The AI draft also eliminates the "I have no idea where to start" delay on unusual tickets. The agent reads the draft, decides whether to send as-is, edit, or rewrite, all of which is faster than starting from a blank reply box.

Lever 5: AI auto-resolution for the easy categories

Effect on headline median: typically -40 to -60 percent.

This is the largest single lever once it is well-configured (see How AI Auto-Resolution Works). Tickets that auto-resolve have an FRT measured in seconds. Tickets that do not auto-resolve have a faster FRT than before because the human queue is shorter.

The order matters: do macros first, routing second, shift coverage third, AI drafts fourth, auto-resolution fifth. Each lever depends on the previous ones being in place to deliver its full effect.

Measuring without a helpdesk

If you are still on shared Gmail (covered in Why Your Support Shouldn't Run on Gmail), you can approximate FRT with a Google Sheet sampling exercise. Once a week, pull 20 random threads from the past seven days. For each, calculate the time between the customer's first message and the team's first reply. Take the median.

It is not a real benchmark and the sample size is too small to track week-on-week trends, but it gives you a directional number. Once you cross median FRT of 4 hours business-hours, the right move is to switch to a helpdesk that measures it automatically.

Reporting cadence

For an SME support team, the right reporting cadence is:

Real-time queue depth and breaching-tickets, visible to all agents and the team-lead.
Daily FRT median and 90th percentile, segmented by channel and queue.
Weekly FRT trend, segmented by agent and queue.
Monthly distribution analysis, looking at where in the percentile distribution the worst tickets sit.

The 90th percentile is the underrated metric. A team can have a median of 45 minutes and a 90th percentile of 18 hours. The 90th percentile is what produces complaints. Track it, and investigate the slowest tickets each week to find the root cause (no-one assigned, language barrier, complex issue that needed a specialist).

Setting your own targets

A reasonable target ladder for an SME team starting from no measurement:

Quarter 1: Get to median, business-hours under 2 hours, overnight under 14 hours. Achievable with macros, routing and basic auto-acknowledgement.
Quarter 2: Top quartile, business-hours under 30 minutes, overnight under 4 hours. Requires AI drafting plus lightweight overnight cover.
Quarter 3: Top decile, business-hours under 15 minutes, overnight under 2 hours. Requires AI auto-resolution well-configured.

Past the top decile the gains are diminishing and the cost of further improvement rises sharply. Most SME teams find a steady state in the top quartile is the right place to settle: significantly better than the median, well-defended in any customer conversation about response times, without the overhead of a 24-hour rotation.

Where KimonDesk fits

The analytics features page covers how KimonDesk measures FRT, the dashboards, and the segmentation options. Macros, routing, auto-acknowledgement and AI drafts are all included at every paid tier; auto-resolution is included at every tier including Free.

For pricing on the tier that fits your team size, the pricing page is the source of truth.

References

Forrester, "The Total Economic Impact of Customer Service Software for Mid-Market Teams," 2025 edition.
Klaus, "2025 Customer Service Quality Benchmark Report," public release.
KimonDesk anonymised customer dashboard data, Q1 to Q2 2026 (n=87 organisations).
Intercom 2025 Conversational Support Funnel report, response-time benchmarks.

First-Reply Time Benchmarks for SME Support Teams

Why FRT is the metric to focus on

Source data

The 2026 benchmarks

Where most SME teams actually sit

The five levers that move FRT

Lever 1: pre-drafted reply templates (macros)

Lever 2: automated routing by intent

Lever 3: shift coverage gaps

Lever 4: AI assistance for drafting

Lever 5: AI auto-resolution for the easy categories

Measuring without a helpdesk

Reporting cadence

Setting your own targets

Where KimonDesk fits

References

The True Cost of Zendesk in 2026: What an 8-Agent Team Actually Pays

What 'AI-Native Helpdesk' Actually Means in 2026

Why Your Support Shouldn't Run on Gmail (And What to Switch To)

Try every feature for 14 days. No card.

First-Reply Time Benchmarks for SME Support Teams

Why FRT is the metric to focus on

Source data

The 2026 benchmarks

Where most SME teams actually sit

The five levers that move FRT

Lever 1: pre-drafted reply templates (macros)

Lever 2: automated routing by intent

Lever 3: shift coverage gaps

Lever 4: AI assistance for drafting

Lever 5: AI auto-resolution for the easy categories

Measuring without a helpdesk

Reporting cadence

Setting your own targets

Where KimonDesk fits

References

Read more

The True Cost of Zendesk in 2026: What an 8-Agent Team Actually Pays

What 'AI-Native Helpdesk' Actually Means in 2026

Why Your Support Shouldn't Run on Gmail (And What to Switch To)

Try every feature for 14 days. No card.