How to Add Subtitles to a Video for Free (and Get an Editable .SRT)
By Mario · Founder of PixPipe
How to Add Subtitles to a Video for Free (and Get an Editable .SRT)
Most people watch video with the sound off. On social feeds, the majority of views happen muted — on a commute, in a meeting, next to a sleeping baby. If your video relies on its audio to make sense, you are losing a large share of your audience before they ever hear a word.
Captions fix that. They also improve watch time, make your content accessible, and give the platform's algorithm text to understand what your video is about. The catch has always been the work: typing captions by hand is tedious, and most "free" auto-caption tools either watermark the result, cap you at a few minutes, or hand you a locked file you can't edit.
This guide shows how to generate an accurate, fully editable subtitle file — a .srt — from any video for free, then use it anywhere.
SRT vs. VTT vs. burned-in captions
Before generating anything, it helps to know what you're making. There are three common ways captions reach a viewer:
.srt(SubRip): the universal subtitle file. It's plain text with timestamps and works almost everywhere — YouTube, CapCut, Premiere, DaVinci Resolve, VLC. If you only make one file, make this one..vtt(WebVTT): the web-native version, used for HTML5<video>and some platforms. Nearly identical content, slightly different format.- Burned-in (hardcoded) captions: text rendered permanently onto the video pixels. Great for TikTok/Reels where you want styled, always-on captions — but you can't turn them off or fix a typo later.
The smart workflow is to generate an editable .srt first, correct it, and then decide whether to upload it as a separate track (YouTube, web) or burn it in (short-form). The editable file is the source of truth.
How an SRT file actually works
Open an .srt in any text editor and it's refreshingly simple:
1
00:00:00,000 --> 00:00:03,200
Most people watch video with the sound off.
2
00:00:03,200 --> 00:00:06,800
Captions fix that — and improve watch time.
Each block is a number, a start-and-end timestamp, and the line of text. Because it's just text, you can open it, fix a misspelled name, merge two short lines, or adjust timing by hand. That editability is exactly what locked, watermarked tools take away from you.
Creating subtitles from a video for free
The modern way to do this without uploading your footage is a browser-based transcriber. Using PixPipe's Video to Text:
- Drop your video in. The audio is extracted and the speech is transcribed by an on-device AI model (OpenAI's Whisper) — your file is never uploaded to a server.
- Get the transcript with timestamps. The tool segments the speech into timed lines automatically.
- Download the
.srt(or.vtt). You also get a plain.txtif you just want the words. - Review and edit. Open the
.srt, fix any proper nouns, and tidy line breaks.
There's no signup, no minute limit, and no watermark on the output.
Cleaning up subtitles so they actually read well
Auto-generated timing is a starting point, not the finish line. A few quick edits make captions feel professional:
- Keep lines short. Aim for roughly 32–42 characters per line and no more than two lines on screen. Long captions get cut off and are hard to read.
- Break at natural pauses. Split lines where a speaker would breathe, not mid-phrase.
- Fix the names first. Brand names, people, and jargon are where every speech model errs. These are also the words viewers most notice getting wrong.
- Don't caption filler. You can trim "um," "uh," and false starts for readability — subtitles are allowed to be a cleaner version of the audio.
Putting your captions to work
Once you have a polished .srt:
- YouTube: In Studio, open the video → Subtitles → upload the
.srt. YouTube indexes the text, which can help discovery. - CapCut / Premiere / DaVinci: Import the
.srtas a caption track. You can then restyle, reposition, or burn it in for short-form. - Web video: Reference the
.vttin a<track>element on your HTML5 player. - Repurposing: The transcript behind your captions is also the raw material for show notes, a blog post, or social copy — the words are already written.
Why do this locally instead of with an online service
Two reasons. First, privacy: client work, course content, and unreleased footage shouldn't be uploaded to a third party just to get captions. A browser tool keeps the file on your machine. Second, cost and friction: there's no account to make, no minutes to ration, and no watermark to pay to remove. For the everyday job of captioning your own videos, that's the whole game.
FAQ
How do I get an editable subtitle file, not just burned-in captions?
Use a tool that exports .srt or .vtt. Those are plain-text, timestamped files you can open and edit in any text editor — unlike captions rendered onto the video pixels.
Is it really free with no watermark?
Yes — browser-based transcribers like PixPipe run on your own device, so there's no server cost, no account, and no watermark on the file.
Will my video be uploaded to make the subtitles?
No. The audio is processed locally in your browser; only a one-time AI model download happens, not an upload of your video.
Can I make subtitles for a non-English video?
Yes. Whisper supports 90+ languages and can auto-detect the spoken one — and some tools can translate the speech into English subtitles as well.
What's the difference between .srt and .vtt?
Both are timestamped subtitle files. .srt is the most widely supported (YouTube, editors, players); .vtt is the web standard for HTML5 video. When in doubt, use .srt.
