How Much to Charge for Video Captioning
Whether you're a video editor adding captions as a service, a dedicated captioning specialist, or a freelancer trying to figure out how to price a new client's request — this guide gives you the frameworks, the real numbers, and the mistakes to avoid.
Quick Reference — Captioning Rate Ranges
AI-Assisted
$1–$3/min
You generate with a tool, review, correct, deliver
Manual / Expert
$2–$6/min
Transcribed or captioned from scratch, no AI
Compliance-Grade
$5–$15+/min
Broadcast, legal, certified accuracy requirements
These are market reference ranges, not guaranteed rates. Actual pricing depends on your market, client relationship, turnaround speed, and audio complexity.
Per-Minute vs Per-Hour: Which Model to Use
The captioning industry uses per-minute-of-video as its standard pricing unit — not per-hour of your time. Understanding why matters for how you quote.
Per-minute of video (recommended)
- Client knows the video length — easy to quote before seeing the file
- Industry-standard — clients shopping around will recognize it
- Predictable for both parties — no billing surprises
- Rewards efficiency — faster workflow, same revenue
Per-hour of your time (limited use cases)
- Client can't estimate cost without an estimate from you first
- Penalizes your efficiency gains from tools
- Creates friction on invoice approval
- Can work for highly complex editing jobs billed alongside captioning
The per-minute model also scales correctly with what captioning actually involves. Captioning a dense 10-minute interview with fast-talking speakers takes more work than captioning a 10-minute explainer video with a single clear speaker at a moderate pace — but the client-facing metric (10 minutes of video) is the same. Use modifiers for complexity rather than switching to hourly billing.
What Actually Affects Your Rate
The per-minute rate is your base. Adjust it based on factors that change the actual work involved:
Audio Quality
Clean studio audio from a single speaker with a professional microphone: AI-assisted captioning will be 90–95%+ accurate. Your review time is minimal — a fast pass for proper nouns and the occasional mishear.
Noisy room audio, phone-recorded audio, or heavily accented speech: AI accuracy drops, and your correction time increases significantly. A 10-minute video with poor audio quality can easily require 30–45 minutes of review and correction — the same as a 30-minute video with clean audio.
Practical approach: Always ask for a 1–2 minute sample before quoting. A quick AI test run reveals audio quality before you commit to a rate. If the audio is difficult, quote a complexity surcharge of 25–50%.
Number of Speakers
Single speaker: straightforward. AI caption blocks rarely need speaker labels, and timing corrections are usually minimal.
Two speakers (interview format): moderate additional work. You may need to add speaker labels at transitions, and cross-talk moments require manual timing adjustment.
Three or more speakers (panel discussions, roundtables): substantially more work. Speaker identification across multiple simultaneous voices is a significant review burden. Rate accordingly — or quote separately from your standard captioning rate.
Turnaround Time
Standard turnaround (2–5 business days): your standard rate applies.
Rush turnaround (same day or next day): add a 25–50% rush surcharge. This is standard across professional services — rush work disrupts your workflow and often requires priority over other clients.
Don't absorb rush requests at standard rates. If a client needs captions for a 60-minute conference recording by tomorrow morning, that is not a standard job. Charge for it accordingly or decline if the timeline is unrealistic.
Delivery Format Requirements
Basic SRT delivery: standard rate. SRT is universally supported and requires no additional formatting work beyond standard captioning.
Multiple format delivery (SRT + burned-in video + VTT): add a format conversion fee. Each additional deliverable is additional work.
Broadcast-grade formats (MCC, TTML for Netflix/streaming compliance): higher rate. These formats have strict specifications and require verification against compliance standards. If you're not familiar with broadcast caption specifications, don't offer this service without proper training — errors in broadcast caption delivery can result in chargebacks and damaged client relationships.
Volume and Ongoing Work
One-off projects: your standard rate, no discount. The overhead of client onboarding, quoting, and delivery communication is the same regardless of project size.
Retainer or recurring volume clients: 10–20% discount is reasonable in exchange for predictable, recurring work. A client who sends you 8 hours of podcast content per month with no selling overhead on your end is more valuable than a client who sends 8 hours once. Build retainer pricing into your offering once you have a track record with the client.
If You're a Video Editor Adding Captions as a Service
Most video editors who add captioning to their services make two mistakes: they underprice it, and they hide it.
Mistake 1: Bundling captions into your editing rate
"I just include captions — it's part of my editing package" seems like a differentiator. It is actually an invisibility problem. When captions are bundled, the client has no idea what they're paying for them. They don't value them. And when they ask for their next project without captions (because they're posting a clip to a platform where they plan to burn in their own), you can't price the non-captioned version differently.
Line-item captioning. Every time. Even if the total rate is the same as your old bundled rate. It trains clients to see captions as a service — which makes them willing to pay for it when they eventually realize they want to add it to a project where they hadn't originally requested it.
Mistake 2: Pricing captions at the cost of your tool
Captioning is a professional skill — even with AI assistance. You are responsible for accuracy. You review. You correct. You deliver a professional product. The fact that your tool (like CaptionX) is free or low-cost does not determine what the service is worth to the client.
Price what the client is getting, not what your tool costs. A 20-minute corporate training video captioned accurately and delivered as SRT + burned-in MP4 is worth $40–$80 in most markets. Charge that, regardless of whether your tool cost you $0 or $4 to generate the captions.
What AI-assisted captioning actually looks like in practice
With a tool like CaptionX generating captions inside Premiere Pro, a typical caption workflow for a 10-minute video looks like:
- Generate captions: ~2 minutes (tool processing time)
- Review and correct: 5–15 minutes depending on audio quality
- Export SRT and burned-in version: ~3 minutes
- Deliver with notes: ~5 minutes
Total: 15–25 minutes of your time for a 10-minute video. At $2/min, that's $20 for 15–25 minutes of work — effective rate of $48–$80/hour. At $3/min it's $30, or $72–$120/hour. This is why captioning as a service is worth offering.
Sample Rate Card for Captioning Services
Adjust these numbers for your market, client type, and experience level. These are a starting framework, not a final answer.
| Service | Rate | Notes |
|---|---|---|
| Standard captioning (AI-assisted, clean audio) | $1.50–$2.50/min | Single speaker, studio-quality audio, SRT delivery |
| Standard captioning (AI-assisted, mixed audio) | $2–$3.50/min | Remote guests, some background noise, moderate correction needed |
| Captioning + burned-in video delivery | +$0.50–$1/min | Add to base rate for burned-in (open caption) export |
| Multi-speaker / panel discussion | +$0.50–$1/min | Add to base rate for 3+ speakers or heavy cross-talk |
| Rush delivery (same or next day) | +25–50% surcharge | Applied to total base rate |
| Multilingual captions (translated) | $3–$6/min | Includes translation — rate varies by language pair |
| Compliance-grade broadcast captioning | $6–$15+/min | Requires broadcast caption training and format certification |
| Minimum charge (any job) | $35–$50 minimum | Regardless of video length |
| Retainer discount (recurring monthly volume) | 10–20% off base | For clients with 60+ min/month of regular work |
What Clients Actually Expect
Pricing is half the equation. The other half is knowing what clients expect so you don't deliver something that creates a revision cycle.
98%+ accuracy is the professional standard
Clients don't count errors, but they notice them when they review. A caption track with multiple errors per minute looks unprofessional — even if the total error rate is below 5%. Review your work before delivery. At a minimum, scan the first two minutes, the last two minutes, and a few minutes from the middle.
Timing matters as much as text
Captions that appear early or late feel wrong even when the words are correct. The text should appear at the exact moment the word is spoken. If captions run noticeably behind the audio, the viewer is reading the previous sentence while the speaker has moved on — this is more disruptive than a single misspelled word.
Line length affects readability
Caption blocks with 3+ lines are hard to read, especially on mobile. Aim for a maximum of 2 lines per block, and keep each line under 42 characters. Premiere Pro's caption tools let you set these as defaults — configure them once and they apply to every job.
Clients don't always know what format they need
A client who says 'I need captions for my YouTube video' may not know whether they want a closed caption SRT file or a burned-in video file. Ask two questions: what platform is this going to, and do you want to be able to toggle captions on/off? That tells you what to deliver.
Under-promise on accuracy, over-deliver
Tell clients upfront that AI-assisted captions are reviewed and corrected for accuracy but may not achieve 100% accuracy on technical terminology or proper nouns — and that you welcome any corrections they find. This sets expectations and protects you from revision requests that amount to clients line-editing every word.
How CaptionX Affects Your Captioning Economics
If you're offering captioning as a service and billing at $1.50–$3 per minute of video, your tool cost is a direct line item against your margin. Here's how CaptionX affects that:
Per-minute tool services
Services that charge per minute of video (like Rev's AI tier) add a cost-of-goods to every caption job. On a 60-minute podcast episode billed at $2/min ($120 revenue), a $0.50/min tool cost is $30 — 25% of your revenue going to tooling.
Scale that across multiple clients and the tool cost becomes a meaningful expense that compresses your margin on every job.
CaptionX free tier
CaptionX gives you free captions every month inside Premiere Pro — with no per-minute charge. For a freelancer doing regular captioning work, this means your tool cost for AI caption generation is effectively $0 on jobs within the free tier.
On a $120 podcast episode job: $0 in tool cost. On a $60 client video job: $0 in tool cost. Every dollar of revenue is yours to keep.
Caption Inside Premiere Pro
Keep More of What You Charge — $0 Tool Cost on Free Tier
CaptionX generates captions inside Premiere Pro — no browser upload, no per-minute fee, no monthly quota that your client work will exhaust. Free captions every month, starting now.
Get CaptionX FreeFree captions every month. No trial. No credit card. No catch.
Common Questions
I'm a video editor — should I offer captioning?
Yes, if you're working in Premiere Pro and your clients deliver to YouTube, social media, or corporate platforms. Captioning is a natural adjacent service — the video is already in your timeline. Adding a captioning service line lets you capture revenue that clients would otherwise spend with a separate captioning service, and it differentiates your offering against editors who don't offer captions.
How do I tell a client why captioning costs extra?
You don't need to justify it — it's a service with a price, like color grading or audio mixing. If a client asks, the honest answer is: accurate captions require review and correction time beyond just generating a transcript. AI tools speed up the process but don't eliminate the need for a professional pass. You're billing for that professional review.
Should I charge less because I use AI?
No. You're billing for the deliverable (accurate, timed captions in the client's required format), not for how long it took you to produce it. A surgeon who completes an operation in 30 minutes doesn't charge less than one who takes 90 minutes — skill and tooling are what enable efficiency. Use AI to become more profitable, not to race to the bottom.
What's a good starting rate if I'm new to captioning services?
Start at $1.50/min with a $35 minimum charge. This is at the lower end of market rates — appropriate for building a client base and a portfolio. Once you have 5–10 completed jobs and client references, move to $2/min. Rates above $2.50/min require demonstrated expertise or a specialization (e.g., medical content, legal depositions, broadcast compliance).
Do I need to offer human captioning as well as AI-assisted?
Only if you're targeting clients with high accuracy requirements (legal, broadcast, compliance-grade). For the majority of web video, corporate content, and social media captioning, reviewed AI-assisted captions meet the client's needs. If a client specifically requests 99%+ accuracy with certification, that's a different service tier with different pricing and workflow.