This patent describes a comprehensive method and apparatus for synthesizing both images and voices using deep learning, specifically for creating virtual human video content. It covers techniques for generating realistic facial expressions, lip movements, and synchronized speech, crucial for compelling AI video production.
Patent IDUS11861962B2
GrantedJanuary 2, 2024
FiledOctober 5, 2022