Method and apparatus for synthesizing image and voice using deep learning

US11861962B2

Method and apparatus for synthesizing image and voice using deep learning

US11861962B2

About

This patent describes a comprehensive method and apparatus for synthesizing both images and voices using deep learning, specifically for creating virtual human video content. It covers techniques for generating realistic facial expressions, lip movements, and synchronized speech, crucial for compelling AI video production.

Patent IDUS11861962B2

GrantedJanuary 2, 2024

FiledOctober 5, 2022

Assignee / startup

AI Studios

Loading Patent...