Getting Started: How to use Infinity AI
There are several text-to-video or image-to-video AI models available (Luma, Runway, Pika, etc) but not many that allow your characters to speak.
People are at the center of stories. And if you have expressive, talking characters, you can tell good stories even if you don’t have anything else.
To that end, we launched the Infinity V2 model, which allows you to animate ✨infinite✨ actors. See example user videos in the Infinity Gallery.
This is a step-by-step tutorial showing how to use Infinity to generate AI talking head videos.
Step 0. Navigate to the Infinity Studio
Infinity V2 is a video foundation model focused on expressive characters that talk: infinity.ai/studio. Login to get access to the full set of features.
Step 1. Generate an image
The first step is to pick the image of your character. Any style of image - photos, paintings, 3D cartoons, etc - works with Infinity. Please note that animals and non-humanoid characters (e.g. a toaster with a face) do not work very well at the moment (though you can still make it work with a lot of persistence). There are 3 options for the image:
Tip: Infinity works with input images of ANY aspect ratio. Make sure "Crop face" is OFF so that the image's original aspect ratio is used.
Option 1. Select a character from the library
Option 2. Generate a character using the text-to-image tool
Option 3. Upload an image file
Step 2. Generate audio
Next, we need to define the audio that we want our character to say. Similarly, there are a few options for the audio:
Option 1. Type out the script
Go to the “text-to-speech” tab and write out the exact script you want your character to say. Select a voice from the dropdown menu and then press “generate audio.”
Tip: Use punctuation to control the delivery of your script. “-” and “...” for pauses. “!” and CAPITALIZATION for emphasis. Each time you press “generate audio” the audio will be different.
Option 2. Record your own voice
Go to the "record" tab of the audio section and record yourself directly.
Option 3. Upload an audio file
Upload any audio file with talking or singing in it. Note: long sections of silence or instrumentals will lead to weird behavior. Use active speaking/singing audio only.
Step 3. Generate the video
Finally... generate your video! Here’s an explanation of the generation parameters:
- Resolution: 100k, 250k, 400k (experimental). These refer to the TOTAL number of pixels in a single frame. For example, if your video aspect ratio is 1:1 square, then a “250k resolution video” will be 512 x 512px. If it’s 16:9, then a “250k resolution video” will be 672 x 384px.
- Crop Face: ON or OFF. If OFF, then your entire image will be uploaded. If ON, then your image will be cropped to a square around the character’s face.
- Stability Mode: Expressive, Medium, Stable.
- Num Videos: 1-3. Number of videos to generate at once with these generation parameters.
Our recommendation is to use: 250k resolution, Crop Face OFF, Expressive (or Medium)
Tip: Every time you generate a video, it will be different. Start with the recommended generation parameters and then adjust accordingly. Increase stability if you’re getting a lot of blurring. Increase resolution if the video looks good but you just want more details.
See gallery with more examples: https://infinity.ai/gallery
Additional Studio Tips [advanced]
Save a character
Saving a character (i.e. an image + voice combination) makes it easy to re-use that person in the future. To save a character from a clip you’ve already generated just click the “Replay” icon and then “Save character.” This character will now appear permanently in the Character Library.
Your characters are only visible to you. You can organize and edit your characters by going to the “My characters” page.
Replay
Replay makes it easy to generate the same video clip again. Press the “replay” icon and then “Replay.” This will load the same image, audio, and video generation parameters into the studio.
Clone a voice
Clone your own voice by uploading a short audio sample. These voices will be private to you and you can manage them in the "My voices" section.
Advanced Tutorials
Turn any blog into a video podcast with NotebookLM and Infinity
Generate daily videos with Perplexity and Infinity
Generate an AI Influencer with ChatGPT and Infinity
Create a cinematic video ad using Runway and Infinity
Questions? Reach out at founders@infinity.ai 👋