VoiceDub: Automatic Lip-Synchronization: Difference between revisions
From HLT@INESC-ID
| No edit summary | |||
| Line 28: | Line 28: | ||
| This scene is from the Warner Bros. Pictures movie, ''Harry Potter and the Azkaban Prisoner'', with professional dubbing and with traditional lip-synchronization. | This scene is from the Warner Bros. Pictures movie, ''Harry Potter and the Azkaban Prisoner'', with professional dubbing and with traditional lip-synchronization. | ||
| {| align="center" | |||
| ! style="padding-left: 20px; padding-right: 20px;" | [[Image:Harry_Potter_original.jpg]] | |||
| ! style="padding-left: 20px; padding-right: 20px;" | [[Image:Harry_Potter_dobragem.jpg]] | |||
| ! style="padding-left: 20px; padding-right: 20px;" | [[Image:Harry_Potter_nova_dobragem.jpg]] | |||
| |- | |||
| ! style="font-weight: normal;" | [http://www.l2f.inesc-id.pt/~cmdm/VoiceDub/Harry_Potter_original.avi avi] [http://www.l2f.inesc-id.pt/~cmdm/VoiceDub/Harry_Potter_original.mpeg mpeg4] | |||
| ! style="font-weight: normal;" | [http://www.l2f.inesc-id.pt/~cmdm/VoiceDub/Harry_Potter_dobragem.avi avi] [http://www.l2f.inesc-id.pt/~cmdm/VoiceDub/Harry_Potter_dobragem.mpeg mpeg4] | |||
| ! style="font-weight: normal;" | [http://www.l2f.inesc-id.pt/~cmdm/VoiceDub/Harry_Potter_nova_dobragem.avi avi] [http://www.l2f.inesc-id.pt/~cmdm/VoiceDub/Harry_Potter_nova_dobragem.mpeg mpeg4] | |||
| |- | |||
| ! style="font-weight: normal;" | Original<br/>(Portuguese) | |||
| ! style="font-weight: normal;" | Professional Dubbing<br/>(English) | |||
| ! style="font-weight: normal;" | After VoiceDub<br/>(English) | |||
| |} | |||
Revision as of 18:39, 8 July 2007
Film dubbing is a common and expensive process. Dubbing can become necessary in situations where the original recorded sound doesn't have the necessary quality, or when the actor's voice does not convey the intended effect, or still when the original film's language must be translated. In all of these situations studio sound recordings are made after the capture of the images, that need to be synchronized with the movements of the actor's lips. In the case of language translation the process becomes more difficult, since the synchronization with the lip movements is harder to achieve. This work intends to analyze this last situation and to develop tools to help minimizing the synchronization problem.
The VoiceDub Application
Currently, the most common lip-synchronization process used by recording studios is by trial and error: diverse recordings are made until the dub utterance durations are as close as possible to the original ones. In order to automate this procedure we use the concept of distance between visemes together with a cost function to measure the synchronization error. Using this metric, the alignment between the phone sequences of the original and of the dubbing utterance can be computed using an approach based on finite state transducers. Based on this alignment, the durations of the phones of the dubbing utterance can be adjusted to match the original ones and thus producing a better synchronization with the actor's lip movements. Next are presented some time-scaling transformations for a few examples of the test corpus, were it's possible to verify the efficiency of this application.
A Treta Continua
This scene is from the comic Portuguese play A Treta Continua, performed by actors José Pedro Gomes and António Feio, with non-professional dubbing and with very bad lip-synchronization.
|   |   |   | 
|---|---|---|
| avi mpeg4 | avi mpeg4 | avi mpeg4 | 
| Original (Portuguese) | Non-Professional Dubbing (English) | After VoiceDub (English) | 
Harry Potter and the Azkaban Prisoner
This scene is from the Warner Bros. Pictures movie, Harry Potter and the Azkaban Prisoner, with professional dubbing and with traditional lip-synchronization.
|   |   |   | 
|---|---|---|
| avi mpeg4 | avi mpeg4 | avi mpeg4 | 
| Original (Portuguese) | Professional Dubbing (English) | After VoiceDub (English) | 
Brother Bear - Kenay and Koda
This scene is from the Walt Disney movie, Brother Bear - Kenay and Koda, with professional dubbing and with traditional lip-synchronization.