A clip is not a highlight.
The most common mistake in clipping is pulling "the moment where I said the smart thing" and expecting it to work as a clip. It usually doesn't, because a 30–60 second clip on TikTok needs to work without the surrounding context that made the long-form moment land. A clip needs:
A self-contained setup
The first 3–5 seconds tell the viewer what's about to happen.
A payoff that lands in the clip itself
Not a reference to something earlier in the podcast.
A reason to finish watching
A specific number, a revealed answer, a closing claim, a punchline.
The two-minute moment in the podcast where you and your guest laughed isn't a clip. The 30-second story with a setup, turn, and punchline is.
Where to find the clips.
Long recordings have predictable clip shapes. One long recording usually yields 3–8 viable clips by these rules. Fewer if the conversation was meandering, more if it was debate-style.
Length and format.
Recommendations across the major short-form platforms cluster in a tight range:
Captions are non-negotiable across all of them. Most short-form is watched muted.
Cutting the clip.
Once you've identified a clip, the cut matters:
- Start inside the action.
Don't start with "so let me tell you about the time when..." Start with the first concrete image or claim. Cut the ramp-up.
- End on the punchline.
Don't trail into "anyway, that's what I was going to say." Cut the wind-down.
A clip that's 90 seconds of content plus 15 seconds of setup and 10 seconds of wind-down is really a 90-second clip. Publish it as one.
How to do it in Sapari.
Today the workflow is manual range selection on the full timeline. Automatic long-to-short extraction (where the AI surfaces clip candidates and scores them) is on the roadmap.
Edit the full recording
Silence, false starts, captions, and audio cleanup all run in one pass.
Identify clip ranges
Use the criteria above. The transcript view helps you find phrases by keyword.
Mark range markers on the timeline
One marker per clip you intend to publish.
Export each range as a separate 9:16 render
Captions auto-resize for the new aspect. 72px short-line is the smart default for vertical.
For creators whose only use case is clip extraction (no long-form publishing), dedicated clip-extraction tools are purpose-built for that one job and faster at it. For creators who publish both long-form and clips from the same source, Sapari handles both from the same edit.
Common questions.
How many clips should I expect from one long recording?
Varies with content density. Interview-style conversation with two engaged speakers usually yields more than monologue vlogs. It's easy to overestimate how many clips an episode will produce. Aim for quality over quantity.
Should I caption clips differently than the long-form?
Yes. Clip captions need to be the 9:16 defaults (larger, sized for mobile). Long-form captions are smaller and positioned for desktop. Tools that handle both aspect ratios from the same edit do this automatically at export. Sapari's smart defaults set 72px for vertical, 40px for horizontal.
Do I need to re-record intros for clips?
No. The hook is inside the first 3–5 seconds of the clip if you picked it right. Recording a separate intro usually makes the clip feel more produced and less native.
Can I use the same clip on multiple platforms?
Yes. Export 9:16 for TikTok, Reels, and Shorts. For LinkedIn and X, consider exporting 1:1. It displays better on desktop feeds.