Skip to main content
AHO

How to Script a 60-Second Listing Video (For Reels, TikTok, and Shorts)

By Michał Babula · ~8 min read · 2026-06-26

Real estate agent filming a vertical listing video on a smartphone inside a bright modern apartment

Why 60 Seconds Is the Right Constraint

Sixty seconds feels short until you try to fill it badly. Most listing videos I see from agents run two to four minutes, and they spend the first thirty seconds on a slow pan of the front door while elevator music plays. Nobody is watching that. Not on TikTok, not on Reels, not anywhere.

The 60-second cap forces a discipline that actually improves the video. You cannot afford a slow intro. You cannot afford to show every room. You have to decide, before you press record, what the three best things about this property are — and that decision-making process is also good listing strategy.

From agents I've spoken to who started cutting their videos to under 90 seconds, the consistent feedback is that inquiry rates went up, not down. Less is more is a cliché but it holds here. Buyers who want more will click through to the full listing. That's the whole point.

The 3-Act Structure That Actually Works

Every 60-second listing video should follow the same three-beat rhythm. It doesn't matter if you're selling a studio in Warsaw on Otodom or a condo in Miami that'll end up on Zillow — the structure is the same because human attention works the same.

Act 1: The Hook (0–2 Seconds)

Two seconds. That's your entire window before someone scrolls. The hook is not a title card. It is not your logo. It is not "Hi, I'm Sarah from XYZ Realty." It is a single visual or a single line of text that creates an immediate emotional response or question in the viewer's mind.

Good hooks tend to fall into three categories:

  • The price shock — a price that's lower than the viewer expects for what they're seeing. "Under €280k in the city centre" over a shot of a clean, modern kitchen.
  • The lifestyle image — a shot so visually specific it makes someone think "that's my life." Morning light through floor-to-ceiling windows. A coffee cup on a balcony with a skyline behind it.
  • The bold claim — "The best garden in the street" or "Every room faces south." Something concrete, not fluffy.

What you are trying to do in two seconds is stop the scroll. Everything else is secondary to that.

Act 2: Three Selling Features (3–47 Seconds)

You have roughly 45 seconds for three features. That's about 15 seconds each, which is one short clip plus one or two lines of caption text. Do not try to do four or five features. You'll rush all of them and land none.

How to pick the three: think about the buyer profile for this specific property. A 2-bedroom near a school? Lead with the garden, the second bedroom's size, and the proximity to the school. A city-centre studio? Lead with the view, the storage (buyers always worry about storage in studios), and the commute time to a major employer or transport hub.

Each feature should follow the same mini-structure: show it visually, name it in the caption, add one specific detail. "South-facing garden — 40 ft, fully enclosed." Not "lovely outdoor space." Specific beats vague every time.

Act 3: Address + CTA (48–60 Seconds)

The last 12 seconds are functional. You need: the address (or at minimum the street name and neighbourhood), the asking price if you haven't shown it yet, and one clear call to action. One. Not "like, follow, share, comment, and click the link in bio." Pick one. In my experience, "link in bio for full listing" outperforms everything else for actual lead generation, because it sends people somewhere you control.

End on a clean static frame or a slow exterior shot — something that doesn't demand attention but holds the screen while the viewer decides whether to act.

What Footage to Shoot and in What Order

Shoot in reverse order from how you'll edit. That sounds counterintuitive, but here's the logic: you want to shoot the exterior and neighbourhood context last, when you've already warmed up and know which angles work. The interior shots — especially the hook shot — need the most care, so shoot those first when your eye is fresh and the light is still where you planned it.

A practical shooting list for a 2-bedroom property:

  1. Hook shot — the single best room, best angle, best light. Shoot 5–6 takes. Vertical framing, phone steady or on a small gimbal.
  2. Feature 1 clip — 15–20 seconds of usable footage, slow walk or static wide shot.
  3. Feature 2 clip — same approach, different room or detail.
  4. Feature 3 clip — could be a detail shot (built-in storage, underfloor heating controls, anything specific).
  5. Exterior + street — a slow pan or a walk-up shot showing the building and immediate surroundings.
  6. Neighbourhood B-roll — 10 seconds of the nearest park, café, or transport link. You may not use it on every platform, but have it.

Shoot everything vertical. Editing a horizontal clip into a vertical video for Reels is painful and the quality shows. If you're also creating a portal listing with horizontal images, that's a separate shoot or a separate crop — don't compromise the vertical video to get both at once.

Writing Captions That Hold Attention Without Sound

Roughly 70–85% of social video is watched without sound, depending on the platform and placement. I've seen figures thrown around everywhere, but the honest answer is: assume your viewer has no audio. Your captions are not subtitles for your voiceover. They are the primary communication channel.

A few rules that work in practice:

  • One idea per caption frame. Don't stack three facts into one text overlay. Show one, let it breathe for 2–3 seconds, then cut to the next.
  • Use numbers wherever possible. "Large bedroom" means nothing. "14 × 12 ft bedroom" means something. "5-minute walk to the tube" beats "great transport links."
  • Contrast matters more than font choice. White text with a subtle dark drop shadow reads on almost any background. Thin sans-serif fonts in pale colours do not.
  • Don't caption the obvious. If you're showing the kitchen, you don't need a caption that says "Kitchen." Use that caption real estate for a detail the viewer can't see: "Quartz worktops, installed 2023."

Which Feature Wins on Each Platform

The same 60-second video will perform differently depending on which feature you lead with — and the algorithm differences between TikTok, Reels, and Shorts are real enough that it's worth thinking about this before you post.

TikTok: Lead with the lifestyle hook. TikTok's audience skews younger and responds to aspiration and identity. The question in their mind is "could I live like this?" A morning-light kitchen shot, a rooftop terrace, a view — these outperform price-led hooks on TikTok in my experience. Price can come in Act 2, but it shouldn't be the opening frame.

Instagram Reels: Aesthetic quality and price together. Reels users are more accustomed to polished visual content, and they respond to the combination of "this looks beautiful" and "this is what it costs." A clean, well-lit wide shot of the best room with the price overlaid in the first two seconds works well here. Reels also rewards consistency — agents who post regularly tend to see compounding reach in a way that TikTok's pure discovery model doesn't always replicate.

YouTube Shorts: Neighbourhood context wins. Shorts viewers often have higher purchase intent — they searched for something, or they were already on YouTube researching an area. Leading with neighbourhood — "2 minutes from Thornfield station, 12 minutes to the city centre" — captures that intent better than a lifestyle shot. Use your neighbourhood B-roll here.

The Full Example Script: A 2-Bedroom in Thornfield

Here's a line-by-line script for a fictional 2-bedroom flat in Thornfield, listed at £285,000. I've written it for a Reels-first post, with notes on what to adjust per platform.

[0:00–0:02] — HOOK SHOT
Visual: Wide shot of the open-plan living room, south light flooding in, clean and staged.
Caption overlay: "£285,000 · Thornfield · 2 bed"
No voiceover needed.

[0:03–0:17] — FEATURE 1: The Kitchen
Visual: Slow pan across the kitchen — quartz worktops, integrated appliances, island with bar stools.
Caption: "Full island kitchen · quartz worktops · integrated throughout"

[0:18–0:32] — FEATURE 2: The Main Bedroom
Visual: Static wide shot of the bedroom, showing the built-in wardrobe wall and the window.
Caption: "Main bedroom · 14 × 11 ft · full wall of built-in storage"

[0:33–0:47] — FEATURE 3: The Private Terrace
Visual: Walk out through the doors onto the terrace, pan to show the size and the view.
Caption: "Private terrace · south-facing · no overlooking"

[0:48–0:55] — ADDRESS + CONTEXT
Visual: Exterior shot, street visible, building looking clean.
Caption: "14 Maple Row, Thornfield · 8 min walk to station"

[0:56–1:00] — CTA
Visual: Static frame, clean background.
Caption: "Full listing + floor plan → link in bio"
Optional voiceover: "Click the link in bio for the full listing and floor plan."

Platform adjustments: For TikTok, swap the opening frame to the terrace shot — lead with aspiration, not price. For Shorts, open with the station walk time and neighbourhood context before going inside.

Automation handles distribution. Scripting is still yours. Tools like AHO can push the finished video to Instagram, Facebook, LinkedIn, and WhatsApp automatically — but the 60 seconds of content that makes someone stop scrolling? That still comes from understanding the property and the buyer. No scheduler writes that for you.

Editorial review by Michał Babula (also the author) on 2026-06-26. Author and reviewer are the same person in this version — I'll flag if that changes.

How to Script a 60-Second Listing Video (For Reels, TikTok, and Shorts) — AHO Blog | AHO