Limited Time
50% off all plans
01d06h53m00s
CaptionX logo
BusinessApril 22, 2026 · 9 min read

How Much to Charge for Video Captioning

Whether you're a video editor adding captions as a service, a dedicated captioning specialist, or a freelancer trying to figure out how to price a new client's request — this guide gives you the frameworks, the real numbers, and the mistakes to avoid.

Quick Reference — Captioning Rate Ranges

AI-Assisted

$1–$3/min

You generate with a tool, review, correct, deliver

Manual / Expert

$2–$6/min

Transcribed or captioned from scratch, no AI

Compliance-Grade

$5–$15+/min

Broadcast, legal, certified accuracy requirements

These are market reference ranges, not guaranteed rates. Actual pricing depends on your market, client relationship, turnaround speed, and audio complexity.

Per-Minute vs Per-Hour: Which Model to Use

The captioning industry uses per-minute-of-video as its standard pricing unit — not per-hour of your time. Understanding why matters for how you quote.

Per-minute of video (recommended)

  • Client knows the video length — easy to quote before seeing the file
  • Industry-standard — clients shopping around will recognize it
  • Predictable for both parties — no billing surprises
  • Rewards efficiency — faster workflow, same revenue

Per-hour of your time (limited use cases)

  • Client can't estimate cost without an estimate from you first
  • Penalizes your efficiency gains from tools
  • Creates friction on invoice approval
  • Can work for highly complex editing jobs billed alongside captioning

The per-minute model also scales correctly with what captioning actually involves. Captioning a dense 10-minute interview with fast-talking speakers takes more work than captioning a 10-minute explainer video with a single clear speaker at a moderate pace — but the client-facing metric (10 minutes of video) is the same. Use modifiers for complexity rather than switching to hourly billing.

What Actually Affects Your Rate

The per-minute rate is your base. Adjust it based on factors that change the actual work involved:

Audio Quality

Clean studio audio from a single speaker with a professional microphone: AI-assisted captioning will be 90–95%+ accurate. Your review time is minimal — a fast pass for proper nouns and the occasional mishear.

Noisy room audio, phone-recorded audio, or heavily accented speech: AI accuracy drops, and your correction time increases significantly. A 10-minute video with poor audio quality can easily require 30–45 minutes of review and correction — the same as a 30-minute video with clean audio.

Practical approach: Always ask for a 1–2 minute sample before quoting. A quick AI test run reveals audio quality before you commit to a rate. If the audio is difficult, quote a complexity surcharge of 25–50%.

Number of Speakers

Single speaker: straightforward. AI caption blocks rarely need speaker labels, and timing corrections are usually minimal.

Two speakers (interview format): moderate additional work. You may need to add speaker labels at transitions, and cross-talk moments require manual timing adjustment.

Three or more speakers (panel discussions, roundtables): substantially more work. Speaker identification across multiple simultaneous voices is a significant review burden. Rate accordingly — or quote separately from your standard captioning rate.

Turnaround Time

Standard turnaround (2–5 business days): your standard rate applies.

Rush turnaround (same day or next day): add a 25–50% rush surcharge. This is standard across professional services — rush work disrupts your workflow and often requires priority over other clients.

Don't absorb rush requests at standard rates. If a client needs captions for a 60-minute conference recording by tomorrow morning, that is not a standard job. Charge for it accordingly or decline if the timeline is unrealistic.

Delivery Format Requirements

Basic SRT delivery: standard rate. SRT is universally supported and requires no additional formatting work beyond standard captioning.

Multiple format delivery (SRT + burned-in video + VTT): add a format conversion fee. Each additional deliverable is additional work.

Broadcast-grade formats (MCC, TTML for Netflix/streaming compliance): higher rate. These formats have strict specifications and require verification against compliance standards. If you're not familiar with broadcast caption specifications, don't offer this service without proper training — errors in broadcast caption delivery can result in chargebacks and damaged client relationships.

Volume and Ongoing Work

One-off projects: your standard rate, no discount. The overhead of client onboarding, quoting, and delivery communication is the same regardless of project size.

Retainer or recurring volume clients: 10–20% discount is reasonable in exchange for predictable, recurring work. A client who sends you 8 hours of podcast content per month with no selling overhead on your end is more valuable than a client who sends 8 hours once. Build retainer pricing into your offering once you have a track record with the client.

If You're a Video Editor Adding Captions as a Service

Most video editors who add captioning to their services make two mistakes: they underprice it, and they hide it.

Mistake 1: Bundling captions into your editing rate

"I just include captions — it's part of my editing package" seems like a differentiator. It is actually an invisibility problem. When captions are bundled, the client has no idea what they're paying for them. They don't value them. And when they ask for their next project without captions (because they're posting a clip to a platform where they plan to burn in their own), you can't price the non-captioned version differently.

Line-item captioning. Every time. Even if the total rate is the same as your old bundled rate. It trains clients to see captions as a service — which makes them willing to pay for it when they eventually realize they want to add it to a project where they hadn't originally requested it.

Mistake 2: Pricing captions at the cost of your tool

Captioning is a professional skill — even with AI assistance. You are responsible for accuracy. You review. You correct. You deliver a professional product. The fact that your tool (like CaptionX) is free or low-cost does not determine what the service is worth to the client.

Price what the client is getting, not what your tool costs. A 20-minute corporate training video captioned accurately and delivered as SRT + burned-in MP4 is worth $40–$80 in most markets. Charge that, regardless of whether your tool cost you $0 or $4 to generate the captions.

What AI-assisted captioning actually looks like in practice

With a tool like CaptionX generating captions inside Premiere Pro, a typical caption workflow for a 10-minute video looks like:

  1. Generate captions: ~2 minutes (tool processing time)
  2. Review and correct: 5–15 minutes depending on audio quality
  3. Export SRT and burned-in version: ~3 minutes
  4. Deliver with notes: ~5 minutes

Total: 15–25 minutes of your time for a 10-minute video. At $2/min, that's $20 for 15–25 minutes of work — effective rate of $48–$80/hour. At $3/min it's $30, or $72–$120/hour. This is why captioning as a service is worth offering.

Sample Rate Card for Captioning Services

Adjust these numbers for your market, client type, and experience level. These are a starting framework, not a final answer.

ServiceRateNotes
Standard captioning (AI-assisted, clean audio)$1.50–$2.50/minSingle speaker, studio-quality audio, SRT delivery
Standard captioning (AI-assisted, mixed audio)$2–$3.50/minRemote guests, some background noise, moderate correction needed
Captioning + burned-in video delivery+$0.50–$1/minAdd to base rate for burned-in (open caption) export
Multi-speaker / panel discussion+$0.50–$1/minAdd to base rate for 3+ speakers or heavy cross-talk
Rush delivery (same or next day)+25–50% surchargeApplied to total base rate
Multilingual captions (translated)$3–$6/minIncludes translation — rate varies by language pair
Compliance-grade broadcast captioning$6–$15+/minRequires broadcast caption training and format certification
Minimum charge (any job)$35–$50 minimumRegardless of video length
Retainer discount (recurring monthly volume)10–20% off baseFor clients with 60+ min/month of regular work

What Clients Actually Expect

Pricing is half the equation. The other half is knowing what clients expect so you don't deliver something that creates a revision cycle.

98%+ accuracy is the professional standard

Clients don't count errors, but they notice them when they review. A caption track with multiple errors per minute looks unprofessional — even if the total error rate is below 5%. Review your work before delivery. At a minimum, scan the first two minutes, the last two minutes, and a few minutes from the middle.

Timing matters as much as text

Captions that appear early or late feel wrong even when the words are correct. The text should appear at the exact moment the word is spoken. If captions run noticeably behind the audio, the viewer is reading the previous sentence while the speaker has moved on — this is more disruptive than a single misspelled word.

Line length affects readability

Caption blocks with 3+ lines are hard to read, especially on mobile. Aim for a maximum of 2 lines per block, and keep each line under 42 characters. Premiere Pro's caption tools let you set these as defaults — configure them once and they apply to every job.

Clients don't always know what format they need

A client who says 'I need captions for my YouTube video' may not know whether they want a closed caption SRT file or a burned-in video file. Ask two questions: what platform is this going to, and do you want to be able to toggle captions on/off? That tells you what to deliver.

Under-promise on accuracy, over-deliver

Tell clients upfront that AI-assisted captions are reviewed and corrected for accuracy but may not achieve 100% accuracy on technical terminology or proper nouns — and that you welcome any corrections they find. This sets expectations and protects you from revision requests that amount to clients line-editing every word.

How CaptionX Affects Your Captioning Economics

If you're offering captioning as a service and billing at $1.50–$3 per minute of video, your tool cost is a direct line item against your margin. Here's how CaptionX affects that:

Per-minute tool services

Services that charge per minute of video (like Rev's AI tier) add a cost-of-goods to every caption job. On a 60-minute podcast episode billed at $2/min ($120 revenue), a $0.50/min tool cost is $30 — 25% of your revenue going to tooling.

Scale that across multiple clients and the tool cost becomes a meaningful expense that compresses your margin on every job.

CaptionX free tier

CaptionX gives you free captions every month inside Premiere Pro — with no per-minute charge. For a freelancer doing regular captioning work, this means your tool cost for AI caption generation is effectively $0 on jobs within the free tier.

On a $120 podcast episode job: $0 in tool cost. On a $60 client video job: $0 in tool cost. Every dollar of revenue is yours to keep.

Caption Inside Premiere Pro

Keep More of What You Charge — $0 Tool Cost on Free Tier

CaptionX generates captions inside Premiere Pro — no browser upload, no per-minute fee, no monthly quota that your client work will exhaust. Free captions every month, starting now.

Get CaptionX Free

Free captions every month. No trial. No credit card. No catch.

Common Questions

I'm a video editor — should I offer captioning?

Yes, if you're working in Premiere Pro and your clients deliver to YouTube, social media, or corporate platforms. Captioning is a natural adjacent service — the video is already in your timeline. Adding a captioning service line lets you capture revenue that clients would otherwise spend with a separate captioning service, and it differentiates your offering against editors who don't offer captions.

How do I tell a client why captioning costs extra?

You don't need to justify it — it's a service with a price, like color grading or audio mixing. If a client asks, the honest answer is: accurate captions require review and correction time beyond just generating a transcript. AI tools speed up the process but don't eliminate the need for a professional pass. You're billing for that professional review.

Should I charge less because I use AI?

No. You're billing for the deliverable (accurate, timed captions in the client's required format), not for how long it took you to produce it. A surgeon who completes an operation in 30 minutes doesn't charge less than one who takes 90 minutes — skill and tooling are what enable efficiency. Use AI to become more profitable, not to race to the bottom.

What's a good starting rate if I'm new to captioning services?

Start at $1.50/min with a $35 minimum charge. This is at the lower end of market rates — appropriate for building a client base and a portfolio. Once you have 5–10 completed jobs and client references, move to $2/min. Rates above $2.50/min require demonstrated expertise or a specialization (e.g., medical content, legal depositions, broadcast compliance).

Do I need to offer human captioning as well as AI-assisted?

Only if you're targeting clients with high accuracy requirements (legal, broadcast, compliance-grade). For the majority of web video, corporate content, and social media captioning, reviewed AI-assisted captions meet the client's needs. If a client specifically requests 99%+ accuracy with certification, that's a different service tier with different pricing and workflow.