Line 7: | Line 7: | ||
* Ferramenta de pesquisa em áudio para advogados | * Ferramenta de pesquisa em áudio para advogados | ||
− | * Microphone network selection and calibration approaches in multi-room environments | + | * Microphone network selection and calibration approaches in multi-room environments (Aluno: Miguel Matos Orientador: Alberto Abad) |
* Mobile alert: combining human mobile motion detection and voice analysis | * Mobile alert: combining human mobile motion detection and voice analysis | ||
Line 13: | Line 13: | ||
* Os arquivos de áudio de ontem e de hoje | * Os arquivos de áudio de ontem e de hoje | ||
− | * Query-by-example speaker search in large data speech collections | + | * Query-by-example speaker search in large data speech collections (Aluno: ??? Orientador: Alberto Abad) |
− | * Spoken term detection in speech collections using spoken queries | + | * Spoken term detection in speech collections using spoken queries (Aluno: ??? Orientador: Alberto Abad) |
− | * Towards a universal language recognition system | + | * Towards a universal language recognition system (Aluno: ??? Orientador: Alberto Abad) |
* Using Shazam-like audio fingerprinting to block advertisements in audio podcasts | * Using Shazam-like audio fingerprinting to block advertisements in audio podcasts |
Advertisement-blocking applications in internet browsers (e.g. AdBlock in Firefox or Chrome) block advertisements appearing on websites so that they do not display on screen. Audio fingerprinting applications like Shazam recognize a song from small segments by using audio fingerprinting. The objective of this thesis is to design a program which is able to block advertisements found on audio available for download on the web (podcast) on semi-supervised form. The method should exploit fingerprinting techniques like those of Shazam and the fact that Ads do not change from podcast to podcast, unlike the conventional content. The work will include
- Gathering a small set of audio examples from the web containing advertisements (i.e. from BBC podcasts) which is representative of the problem to be solved.
- Using fingerprinting algorithms to identify similar segments in the audio podcasts, store fingerprints and filter out advertisements.