Experience our talking photo technology in action by exploring our interactive demo on GitHub: AKool Talking Photo Demo.
Important Notes
- Image Quality: Use high-resolution images with clearly visible faces for better results
- Audio Format: Standard audio formats (MP3 recommended)
- Prompt: Use the
promptparameter to control hand gestures for more natural and professional-looking videos - Resolution: Choose between 720 or 1080 based on your needs - higher resolution may take longer to process
- Resource Expiration: Generated videos are valid for 7 days, save them promptly
- Webhook: Use
webhookUrlto receive notifications when video generation is complete - Save the
_idfrom the response to check video status using the Get Video Info Result API
Authorizations
Your API Key used for request authorization. If both Authorization and x-api-key have values, Authorization will be used first and x-api-key will be discarded.
Body
application/json
Resource address of the talking picture
Example:
"https://drz0f01yeq1cx.cloudfront.net/1688098804494-e7ca71c3-4266-4ee4-bcbb-ddd1ea490e75-9907.jpg"
Resource address of the talking audio
Example:
"https://drz0f01yeq1cx.cloudfront.net/1710752141387-e7867802-0a92-41d4-b899-9bfb23144929-4946.mp3"
Prompt words for controlling gestures and movements
Example:
"Throughout the entire video, maintain natural and smooth hand movements."
Output video resolution. Currently supported formats - 720, 1080
Available options:
720, 1080 Example:
"720"
Callback url address based on HTTP request
Example:
""