AI video editing using Runway: Part Six

We continue our series introducing one of the world’s most popular online generative AI video production toolkits, as we create a short video for a brand new immersive experience.

The AI tools and models used in this series

We’ll use runway.ml to generate video clips and assemble them into a sequence, and we’ll use additional AI tools to help with initial ideation and to create text prompts to use at runway to help generate high quality video clips.

The main steps in the project:

Generate an overview for an ‘immersive experience’ promotional video using perplexity.ai
Generate still images using the flux-1.1-pro model at replicate.com to use as ‘first frame’ prompts at runway
Generate short video clips using runway’s latest Gen-4 model
Use runway’s online video editor to sequence the promotional video
Use the speech-02-hd AI model at replicate.com to generate a realistic-sounding voiceover
Use udio.com to generate a background music track
Add text titles and export the video file

Part 6: Use Runway to generate an AI voiceover

Generative AI has been capable of producing ‘realistic’ human voices for a few years now, and AI-powered ‘text to speech’ can be found in many creative applications. In part one of this series we asked perplexity.ai to block out the main scenes for the video promo, and it suggested the bare bones of a script which can go with each short scene:

“What if you could step inside the story?”

“Welcome to [Experience Name]-where reality blurs with imagination.”

“Explore, interact, and lose yourself in a world designed to ignite your senses.”

“Every moment is unforgettable. Every step, a new adventure.”

“Tickets are limited. Don’t miss your chance.”

“Book now at [website]. Experience the extraordinary.”

We’ve manually refined the above suggestions to better match the style and content of the ad:

Where we’re going, we don’t need goggles…

Welcome to total immersion, where reality blurs with your imagination.

Lose yourself in a world created to ignite your senses

Experience the extraordinary. Experience the future.

This is reality remixed. Book now.

Step 1: On the Runway dashboard select ‘Generate Audio’:

Sidebar menu with options: Start a session, Generate Video, Generate Image, Generate Audio, and All Tools.

Step 2: Try out the various available voices by selecting a category then playing back the audio examples provided from the available matches:

A software interface displays voice options with tags; "Maggie," "Ella," and "Frank" are listed with play buttons.

Step 3: Enter the text for the voice generation at the top left, then click the ‘Generate’ button:

Screenshot of a generative audio interface with text for generating speech and options for selecting voices and narration style.

Here’s the generated audio for Benjamin, as selected above:

To add a generated voiceover audio to the video project, open the video project then open the ‘Audio’ assets folder. Select the audio clip then drag it on to the timeline underneath the existing video track:

A video editing software interface displays a neon-lit scene with people and an audio waveform in the timeline.

To time the narration correctly, it needs to be separated into individual sentences. Align the vertical playhead at the end of each sentence in turn and click the ‘Split layer at playhead‘ button as highlighted below:

A video editing timeline with video and audio tracks, showing the playhead at 2:37 of a 35-minute project.

Separate out the individual audio clips along the timeline and place them roughly where they need to be:

A video editing software interface displaying a group selfie in the preview window and audio settings on the right panel.

This is a rough cut. Here’s the current project exported to a 1080p video:

Despite trying many of the voices, we’re not completely happy with the result. We require a deep, gravelly voiceover. We revisited replicate and used the speech-02-hd model to generate a slower, deeper male voiceover. Dedicated audio models such as this can afford more control over final audio than Runway, so bear this in mind if your project includes lots of ‘text to speech’.