Gonna win this race.

Creative Experiment #001

Rain slashed across asphalt, engines screamed through a storm, and in the chaos, a figure floated above it all — a red cloth trailing like fire in the wind. This is the moment that inspired Gonna Win This Race, a three-minute vertical music video exploring persistence through emotional change.

It began with a lyric from Tember, sent over Instagram as a video. Her words captured the rhythm of emotional cycles — the push and pull of persistence against forces you cannot control. I captured her original video using a screen recorder, converted the MP4 to MP3, and then transcribed it with Turboscribe.ai (free). I adapted it into a full song: verses, pre-chorus, chorus, bridge, and final lift — shaping her emotional core into something that could carry melody and momentum across a full track.


Original Lyric (Tember)

I’ve been taking two steps back
trying to move in time with the changes that don’t relax.
Changing the tune of my song
to something I want to hear all day long.
Why is it never easy?
Changes come in seasons,
and my feelings come in waves.
I see my brightness fading,
I’m picking up these pieces.
I’m gonna win this race.


Song Structure Adaptation (Steve with ChatGPT)

Verse 1
Two steps back, I try to keep the time
Life keeps shifting, beats don’t align
Turning up my song, I chase the light
Somewhere in the music, I take flight

Pre-Chorus
Seasons turn, waves rise high
Every fall won’t break my stride

Chorus
Why’s it never easy, why’s it always change
I feel my light fading, standing in the rain
Holding all the pieces, running through the pain
I’m gonna win this race, yeah, I’m gonna win this race

Verse 2
Some days drag, some slip away
Every broken moment shows a way
I keep the rhythm, even when it shakes
Every step I take lifts me from the gray

Pre-Chorus
Seasons turn, waves rise high
Every fall won’t break my stride

Chorus
Why’s it never easy, why’s it always change
I feel my light fading, standing in the rain
Holding all the pieces, running through the pain
I’m gonna win this race, yeah, I’m gonna win this race

Bridge (Cinematic Lift)
If the tide pulls me under, I will rise
Every storm behind me lights the skies
Hear my heart, hear it loud and clear
Nothing can stop me, the finish line is near

Final Chorus
Why’s it never easy, why’s it always change
Holding all the pieces, keeping up the pace
I’m gonna win this race, yeah, I’m gonna win this race
I’m gonna win this race, yeah, I’m gonna win this race


Music generation brought its own challenges. The first prompts were blocked by the system because of the word “intimate,” intended purely as a musical descriptor. Swapping it with “close-mic, personal” allowed the track to generate successfully, though achieving the exact emotional tone required about 25 renders using Producer.ai. The initial prompt included detailed instructions for cinematic indie style, lush strings, driving acoustic guitar, soft synth pads, dynamic percussion, soaring female vocals, and a motivational, reflective tone — all carefully tuned to preserve the emotional lift.

Meanwhile, the visual concept emerged alongside the music. Attempts to generate racing footage directly in Grok Imagine failed — the cars looked surreal or toy-like. The solution was to reverse the workflow: first generate realistic still images in Midjourney, then convert them into motion clips using Grok Imagine.

  • 26 still images were generated in Midjourney, each with four variations to give a large pool of candidate frames
  • Only the strongest images were selected; image rejection was minimal, roughly 1%
  • The motion prompt for Grok Imagine conversion was simple: High-speed race

Between 12-second bursts of racing came the 6-second red fabric sequences — a floating woman moving through air, a visual metaphor for freedom and release. These clips were generated using a layered prompt workflow:

  1. A reference artwork was cropped using GIMP to isolate the subject
  2. The cropped image was analyzed using Midjourney’s Describe function to produce a detailed prompt
  3. The prompt evolved across iterations to produce 15 six-second video clips, forming the repeating motif of floating sequences

Final prompt for the woman in red:
“dark-haired, beautiful Caucasian woman in red cloth flying on the wind, wearing minimal clothing, full-body, full-length, dancing, detailed background, gray color scheme.”

The final video alternates these sequences with racing footage, establishing a pulse: velocity and suspension, struggle and freedom, storm and air.

Even the opening moment was a stroke of serendipity. Grok Imagine generated a short clip of the female race driver beside her car — unexpectedly with lip-sync voiceover narration:

“This one is for my team. The conditions were tough, but we pushed through together.”

A quick pass through Audacity cleaned the audio, removing background noise, and it became the perfect five-second introduction to the piece.

The project was full of creative friction — AI moderation blocking words, direct video generation failing, floating imagery requiring prompt reconstruction. Each obstacle forced iteration:

  • Lyrics: ~10 AI-assisted revisions
  • Music: ~25 renders
  • Race imagery: 26 Midjourney stills (each with four variations)
  • Floating sequences: 15 generated video clips
  • Numerous prompt adjustments and rejected outputs

All these small technical choices — prompt tweaks, clip selection, workflow reversals, noise cleanup — became part of the creative storytelling. The final film exists not as a single act of creation but as the cumulative result of persistent experimentation.

In the finished video, the storm races forward, engines roar, wheels spray water, and through it all, the red figure floats — a moment of release punctuating effort and struggle. It’s a reminder that persistence and freedom exist in tandem, and even amid chaos, we can find lift, momentum, and ultimately, that we can win this race.


Creative Stack

  • Lyrics: Tember (original concept)
  • Song adaptation and structure: Steve (AI-assisted drafting)
  • Music generation: Producer.ai
  • Race imagery: Midjourney → Grok Imagine video conversion
  • Floating sequences: Grok Imagine
  • Editing: Kdenlive
  • Audio cleanup: Audacity
  • Transcription: Turboscribe.ai
Kdenlive open source video editor