Skip to main content
POST
/
api
/
open
/
v3
/
content
/
video
/
createbytalkingphoto
Create By Talking Photo
curl --request POST \
  --url https://openapi.akool.com/api/open/v3/content/video/createbytalkingphoto \
  --header 'Content-Type: application/json' \
  --header 'x-api-key: <api-key>' \
  --data '{
  "talking_photo_url": "https://drz0f01yeq1cx.cloudfront.net/1688098804494-e7ca71c3-4266-4ee4-bcbb-ddd1ea490e75-9907.jpg",
  "audio_url": "https://drz0f01yeq1cx.cloudfront.net/1710752141387-e7867802-0a92-41d4-b899-9bfb23144929-4946.mp3",
  "prompt": "Throughout the entire video, maintain natural and smooth hand movements. When speaking, use appropriate hand gestures to emphasize key points, such as opening your hands to express welcome or explanation, pointing your fingers forward to emphasize, and placing your hands together to summarize. The gestures should be coherent, not stiff, with moderate amplitude, and the frequency should be coordinated with the speaking speed. During pauses, the gestures naturally return to a relaxed state, presenting a professional and friendly presentation style.",
  "resolution": "720",
  "webhookUrl": ""
}'
{
  "code": 1000,
  "msg": "OK",
  "data": {
    "faceswap_quality": 2,
    "storage_loc": 1,
    "_id": "64dd90f9f0b6684651e90d60",
    "create_time": 1692242169057,
    "uid": 378337,
    "type": 5,
    "from": 2,
    "video_lock_duration": 0.8,
    "deduction_lock_duration": 10,
    "external_video": "",
    "talking_photo": "https://***.cloudfront.net/1692242161763-4fb8c3c2-018b-4b84-82e9-413c81f26b3a-6613.jpeg",
    "video": "",
    "__v": 0,
    "video_status": 1,
    "prompt": "Throughout the entire video, maintain natural and smooth hand movements. When speaking, use appropriate hand gestures to emphasize key points, such as opening your hands to express welcome or explanation, pointing your fingers forward to emphasize, and placing your hands together to summarize. The gestures should be coherent, not stiff, with moderate amplitude, and the frequency should be coordinated with the speaking speed. During pauses, the gestures naturally return to a relaxed state, presenting a professional and friendly presentation style.",
    "resolution": "720"
  }
}
The resources (image, video, voice) generated by our API are valid for 7 days. Please save the relevant resources as soon as possible to prevent expiration.
Experience our talking photo technology in action by exploring our interactive demo on GitHub: AKool Talking Photo Demo.

Important Notes

  • Image Quality: Use high-resolution images with clearly visible faces for better results
  • Audio Format: Standard audio formats (MP3 recommended)
  • Prompt: Use the prompt parameter to control hand gestures for more natural and professional-looking videos
  • Resolution: Choose between 720 or 1080 based on your needs - higher resolution may take longer to process
  • Resource Expiration: Generated videos are valid for 7 days, save them promptly
  • Webhook: Use webhookUrl to receive notifications when video generation is complete
  • Save the _id from the response to check video status using the Get Video Info Result API

Authorizations

x-api-key
string
header
required

Your API Key used for request authorization. If both Authorization and x-api-key have values, Authorization will be used first and x-api-key will be discarded.

Body

application/json
talking_photo_url
string
required

Resource address of the talking picture

Example:

"https://drz0f01yeq1cx.cloudfront.net/1688098804494-e7ca71c3-4266-4ee4-bcbb-ddd1ea490e75-9907.jpg"

audio_url
string
required

Resource address of the talking audio

Example:

"https://drz0f01yeq1cx.cloudfront.net/1710752141387-e7867802-0a92-41d4-b899-9bfb23144929-4946.mp3"

prompt
string

Prompt words for controlling gestures and movements

Example:

"Throughout the entire video, maintain natural and smooth hand movements."

resolution
enum<string>

Output video resolution. Currently supported formats - 720, 1080

Available options:
720,
1080
Example:

"720"

webhookUrl
string

Callback url address based on HTTP request

Example:

""

Response

200 - application/json

Video creation request submitted successfully

code
integer
required

Interface returns business status code (1000: success)

Example:

1000

msg
string
required

Interface returns status information

Example:

"OK"

data
object