6 minutes of dead air.
Gone in 30 seconds.
Sapari transcribes your video, finds every pause longer than your threshold, and cuts them. You watch the timeline, not a stopwatch.
7 days · 30 AI minutes · No credit card
The problem
The worst kind
of editing labor.
Manual silence cutting is scrub, find a pause, cut, cross-fade, repeat — for hours. A 30-minute recording with natural pauses is 90 minutes of work before you've started on captions or audio.
Most of the "edit four hours per hour of footage" trap, right there in one feature.
How it works
Two signals.
Better than either alone.
Word-level audio
Sapari runs the audio through speech-to-text with word-level timing.
Word gap + acoustic
Combines gaps between spoken words and acoustic silence. Catches paused speech and buried filler.
Reviewable cards
Every silence becomes a card on the timeline. Keep it, dismiss it, or drag the boundary.
Controls
One slider.
Off to aggressive.
You also get edge padding — how much silence to leave around each speech segment so cuts don't sound choppy. Higher pacing reduces padding; lower pacing keeps it.
The numbers
Fast, long-form,
frame-tight.
to analyze.
to analyze.
Podcasters run 2-hour episodes through it.
Cuts land between words, never mid-syllable.
In the pipeline
One step of one analysis.
Silence removal runs alongside false start detection, caption generation, audio cleanup, and B-roll placement — all from the same pass.
See the full pipeline →Before you ask
Common questions.
What if I want natural pauses? +
Set the slider to Natural/Podcast or turn silence removal off entirely. You keep full control.
Will it cut mid-sentence? +
No. Cuts land in gaps between words, not during them. If the speaker paused inside a sentence, Sapari detects the gap but knows the sentence isn't over — you can dismiss that specific cut.
What about breath sounds? +
Breath is usually below the word-gap threshold at most pacing settings. At Hyper, it gets cut — which most short-form creators want.
Can I remove silence from a video that's already edited? +
Yes. Upload the edited version as a new project and Sapari treats it like any recording.
Does it work on non-English audio? +
Yes. Transcription supports English, Spanish, Portuguese, and French. Silence detection is language-independent.
What if I want natural pauses?
Set the slider to Natural/Podcast or turn silence removal off entirely. You keep full control.
Will it cut mid-sentence?
No. Cuts land in gaps between words, not during them. If the speaker paused inside a sentence, Sapari detects the gap but knows the sentence isn't over — you can dismiss that specific cut.
What about breath sounds?
Breath is usually below the word-gap threshold at most pacing settings. At Hyper, it gets cut — which most short-form creators want.
Can I remove silence from a video that's already edited?
Yes. Upload the edited version as a new project and Sapari treats it like any recording.
Does it work on non-English audio?
Yes. Transcription supports English, Spanish, Portuguese, and French. Silence detection is language-independent.