create_lipsync
Synchronize video mouth movements with audio using text-to-speech or custom audio upload. Works for real, 3D, or 2D human characters with a clear face. Video length limited to 10 seconds.
Instructions
Create a lip-sync video by synchronizing mouth movements with audio. Supports both text-to-speech (TTS) with various voice options or custom audio upload. The original video must contain a clear, steady human face with visible mouth. Works with real, 3D, or 2D human characters (not animals). Video length limited to 10 seconds.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
audio_url | No | URL of custom audio file (mp3, wav, flac, ogg; max 20MB, 60s). If provided, TTS parameters are ignored | |
model_name | No | Model version to use (default: kling-v2-master) | |
tts_speed | No | Speech speed for TTS (0.5-2.0, default: 1.0) | |
tts_text | No | Text for text-to-speech synthesis (used only if audio_url is not provided) | |
tts_voice | No | Voice style for TTS (default: male-warm). Includes Chinese and English voice options | |
video_url | Yes | URL of the video to apply lip-sync to (must contain clear human face) |