This patent describes a system and method for synthesizing image and voice using deep learning. It focuses on creating realistic video content with virtual humans by generating facial images and synchronized voices based on input text or audio, enabling efficient production of various media.
Patent IDUS10922880B2
GrantedFebruary 23, 2021
FiledSeptember 2, 2019