Use case · How-to

How to pull short clips
from a long recording.

One recorded hour can feed a week of short-form output, if you know how to find the clips. Most don't, and they end up pulling "interesting moments" that don't work as standalone content, because a clip has different requirements than a long-form moment.

A clip is not a highlight.

The most common mistake in clipping is pulling "the moment where I said the smart thing" and expecting it to work as a clip. It usually doesn't, because a 30–60 second clip on TikTok needs to work without the surrounding context that made the long-form moment land. A clip needs:

01

A self-contained setup

The first 3–5 seconds tell the viewer what's about to happen.

02

A payoff that lands in the clip itself

Not a reference to something earlier in the podcast.

03

A reason to finish watching

A specific number, a revealed answer, a closing claim, a punchline.

The two-minute moment in the podcast where you and your guest laughed isn't a clip. The 30-second story with a setup, turn, and punchline is.

Where to find the clips.

Long recordings have predictable clip shapes. One long recording usually yields 3–8 viable clips by these rules. Fewer if the conversation was meandering, more if it was debate-style.

The disagreement
A moment where you and a guest (or your own earlier position) disagree strongly. Conflict holds attention.
The specific number
Any moment quoting a concrete stat, dollar amount, or timeline. Specificity holds attention.
The counterintuitive claim
"Most people think X, but actually Y" is the shape of a lot of high-performing short-form. If your long-form has one of these, it's a clip.
The story
Any tight 60–90 second anecdote with a beginning, middle, end. Stories are the most naturally clip-shaped format.
The hot take
A moment where the speaker makes a strong claim they're willing to defend.

Length and format.

Recommendations across the major short-form platforms cluster in a tight range:

Platform Length Aspect Captions
TikTok · Instagram Reels 15–60s 9:16 Required
YouTube Shorts Up to 60s 9:16 Required
LinkedIn Up to 90s 1:1 or 9:16 Required
X (Twitter) Varies 1:1 outperforms 9:16 Required

Captions are non-negotiable across all of them. Most short-form is watched muted.

Cutting the clip.

Once you've identified a clip, the cut matters:

  • Start inside the action.

    Don't start with "so let me tell you about the time when..." Start with the first concrete image or claim. Cut the ramp-up.

  • End on the punchline.

    Don't trail into "anyway, that's what I was going to say." Cut the wind-down.

A clip that's 90 seconds of content plus 15 seconds of setup and 10 seconds of wind-down is really a 90-second clip. Publish it as one.

How to do it in Sapari.

Today the workflow is manual range selection on the full timeline. Automatic long-to-short extraction (where the AI surfaces clip candidates and scores them) is on the roadmap.

01

Edit the full recording

Silence, false starts, captions, and audio cleanup all run in one pass.

02

Identify clip ranges

Use the criteria above. The transcript view helps you find phrases by keyword.

03

Mark range markers on the timeline

One marker per clip you intend to publish.

04

Export each range as a separate 9:16 render

Captions auto-resize for the new aspect. 72px short-line is the smart default for vertical.

For creators whose only use case is clip extraction (no long-form publishing), dedicated clip-extraction tools are purpose-built for that one job and faster at it. For creators who publish both long-form and clips from the same source, Sapari handles both from the same edit.

Common questions.

How many clips should I expect from one long recording? +

Varies with content density. Interview-style conversation with two engaged speakers usually yields more than monologue vlogs. It's easy to overestimate how many clips an episode will produce. Aim for quality over quantity.

Should I caption clips differently than the long-form? +

Yes. Clip captions need to be the 9:16 defaults (larger, sized for mobile). Long-form captions are smaller and positioned for desktop. Tools that handle both aspect ratios from the same edit do this automatically at export. Sapari's smart defaults set 72px for vertical, 40px for horizontal.

Do I need to re-record intros for clips? +

No. The hook is inside the first 3–5 seconds of the clip if you picked it right. Recording a separate intro usually makes the clip feel more produced and less native.

Can I use the same clip on multiple platforms? +

Yes. Export 9:16 for TikTok, Reels, and Shorts. For LinkedIn and X, consider exporting 1:1. It displays better on desktop feeds.

One recording.
A week of clips.

7 days. 30 AI minutes. No credit card.

Start free trial