Downloads: Difference between revisions
From HLT@INESC-ID
No edit summary |
No edit summary |
||
Line 9: | Line 9: | ||
== Corpora Resources == | == Corpora Resources == | ||
=== Speech === | |||
* '''[[VoxCeleb-PT]]''' - annotated corpus of European Portuguese celebrities. | |||
=== Automatic Key Phrase Extraction === | === Automatic Key Phrase Extraction === |
Revision as of 14:34, 7 June 2022
These are tools and resources made available by the L²F.
Tools
- Just.Ask is a Question-Answering system for English
- Eugenio is a word predictor for European Portuguese
Corpora Resources
Speech
- VoxCeleb-PT - annotated corpus of European Portuguese celebrities.
Automatic Key Phrase Extraction
- 110-PT-BN-KP - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.
Translation
- Golden collection of parallel multi-language word alignments - Manually annotated word alignments between six european languages taken from the Europarl common test set
(more information on the Speech-to-speech Translation information page)
Recommendation
- Fairy tale corpus - Corpus of fairy tales: the corpus is divided in semantically related clusters.
Lexical Resources
Other
- Portuguese Dictionary for Excalibur, a spell checker for the Macintosh.
Assembled by Nuno Mamede in association with LabEL