Songbird
Songbird is an AI director that transforms lyrics and images into flowing cinematic sequences and generates detailed music production notes for AI music generators. Whether you need dance videos, music videos, COLORFUL iPhone photography, or production notes for Suno/Udio, Songbird creates chains of invisible-cut clips that flow together as a single continuous experience—not isolated shots, but sequences with emotional and spatial continuity.
What Is Songbird?
Songbird is a custom GPT that generates sequenced prompts for music videos, dance videos, and photographic sequences, plus detailed production notes for AI music generators. Unlike tools that create isolated images, Songbird thinks in continuous flows—every clip inherits momentum from the previous one, creating invisible cuts hidden inside camera movement, subject motion, or environmental rhythm.
The tool automatically sequences your prompts (A0, A1, A2... B0, B1, B2...) so when you generate images or video, they sort in narrative order. Each prompt captures one 5-second clip with three components: entry (inheriting motion), beat (one clear micro-event), and exit (forward momentum toward the next clip). Type session to generate production notes you can paste into AI music generators.
🎬 Core Philosophy:
Songbird maintains emotional, spatial, temporal, character, and camera-path continuity across every sequence. Nothing is static. Nothing is disconnected. Every clip pushes something forward—reveals, emotion, escalation, discovery, spatial progression.
Music Production Notes
AI Music Generators can't read traditional music notation, but they can read production notes. Songbird generates detailed session notes that you can paste directly into the style section of your AI music generator (Suno, Udio, etc.).
These production notes specify everything from instrumentation and tempo to chord progressions and performance style. Type session in Songbird to generate production notes for your current song, then copy them into your AI music generator's style field.
Example: "Up on the Housetop" - Southern Black Gospel Choir Version
Global Setup
- Key: A♭ Major (classic gospel brightness)
- Tempo: 104 BPM
- Time Signature: 4/4 (gospel shout swing pocket)
- Feel: Southern Black gospel choir energy; full-church celebration; big choir, powerful call-and-response, Hammond B3 swells, syncopated claps, driving shout groove
Form Map
- Intro: 4 bars (drum fill → choir "Ho! Ho! Ho!" + B3 swell)
- Verse 1-6: 8 bars each (lead vocal with choir responses)
- Final Refrain + Shout Vamp: 24 bars (choir shouts, modulated ad-libs, rising B3 intensity)
- Outro: 4 bars (tag: big choir "Good Saint Nick!" → held A♭ chord)
Instrumentation
- Lead Vocal: powerhouse tone; heavy vibrato; ad-libs; melismatic turns
- Full Gospel Choir (SATB): strong block harmony, call-and-response, dynamic swells
- Hammond B3 Organ: swelling pads, gliss runs, shout patterns, Leslie speaker movement
- Piano (Gospel): bluesy grace notes, pentatonic licks, rhythmic comping
- Drums (Gospel Shout): tight kick, snappy snare, busy fills, hi-hat accents
- Bass Guitar: syncopated pentatonic movement, octave jumps, approach tones
- Handclaps: syncopated, layered, continuous in refrains
- Tambourine: bright, on backbeats and shouted accents
Production Notes
- Choir wide in stereo; sopranos L, altos center-L, tenors center-R, basses R
- B3 thick in mix; slow Leslie on verses, fast Leslie on refrains/vamp
- Piano bright and present; slight saturation for gospel bite
- Overall tone: live worship energy + Christmas celebration
Copy these production notes into Suno, Udio, or your AI music generator's style section for authentic gospel arrangements. Details and link to Songbird tool at https://www.humanitarians.ai/songbird
Three Creative Engines
Songbird has three primary output modes, each producing sequenced prompts that flow together:
🕺 boogie
Dance-video sequences. Consistent dancer, clothing, environment, and lighting. Footwork, torso/arm dynamics, and emotional phrasing tied to lyrics. Each clip is a loopable mini-scene with camera logic that flows across clips.
🎵 song
Music-video sequences driven by performance, mood, and narrative. Artist performance (lip-sync, gestures, emotional beats), visual storytelling tied to lyrics, cinematic but seamless camera flow, with mini-events per clip.
📱 colorful
COLORFUL iPhone sequences with hypercolor, snapshot, Southern Gothic palette. Handheld, imperfect, real perspective. Deadpan, accidental framing. Mundane American settings treated with mythic reverence.
Visual Styles
- Colorful: Inspired by William Eggleston—hyper-saturated colors, Southern Gothic atmosphere, mundane spaces treated as sacred. Gas stations, parking lots, and strip malls become stages for the extraordinary.
- Tiffany: Inspired by Saul Leiter—rain-streaked windows, reflections, soft color stains, quiet urban poetry. Everything seen through glass, distance, and weather.
- Unreal: Standard iPhone/dashcam/security camera aesthetic with authentic artifacts, grain, and that "caught on camera" feeling.
Special Modes
- Xmas: Holiday mode that switches session to Christmas logic. Can be combined with boogie or song for holiday-appropriate styling while maintaining sequencing rules.
- muzak: General (non-Christmas) session logic for standard music video work.
Flowing Cinematic Sequences
Songbird creates chains of invisible-cut clips where every transition is hidden inside camera movement, subject movement, or environmental rhythm. This approach transforms how you think about AI-generated visual content:
- Dance visualization: Create choreographed sequences that maintain character, outfit, and environmental continuity across every clip
- Music videos: Generate performance-driven narratives with seamless emotional and spatial flow
- Photo sequences: Produce COLORFUL iPhone photography that feels like consecutive screenshots from someone wandering through haunting, magical ordinary moments
- Devotional content: Visualize mantras, chants, and spiritual practices with reverent continuity
- Experimental cinema: Storyboard impossible scenes with documentary realism and perfect sequencing
Every prompt Songbird generates is designed as part of a larger flow. Clips inherit momentum, maintain continuity, and push something forward—whether that's a reveal, an emotion, an escalation, or a spatial progression. Nothing is static. Nothing is disconnected.
Session Logic
A Songbird session maintains lyrics, mood, style, camera flow, and environment continuity. New images continue the session. Style stays fixed unless you change it. Type new song to reset everything. Every clip inherits momentum from the previous one, creating seamless visual narratives.
Example Projects
Songbird can transform any lyrics into flowing visual sequences. From devotional Sanskrit mantras to Christmas carols to original protest songs, the tool creates visually cohesive narrative sequences with perfect continuity.
Each sequence demonstrates how Songbird maintains emotional, spatial, and temporal continuity—whether creating dance choreography, music video narratives, or COLORFUL iPhone photography. The tool adapts to any musical style while preserving the core principle: invisible cuts, flowing motion, continuous experience.
Get Started
Ready to create flowing cinematic sequences? Type boogie for dance videos, song for music videos, or colorful for COLORFUL iPhone sequences. Use Xmas for holiday mode. Type session to generate music production notes. Type list or commands to see all available options.
Paste your lyrics, upload images if desired, and Songbird will generate sequenced prompts that maintain perfect continuity. Each session preserves your lyrics, mood, style, and camera flow—just keep adding images or type new song to start fresh.
Note: You'll need ChatGPT Plus to access custom GPTs. Songbird automatically sequences prompts (A0, A1, A2...) so your generated content sorts in narrative order.