Downloads: Difference between revisions
From HLT@INESC-ID
No edit summary |
No edit summary |
||
Line 9: | Line 9: | ||
=== Automatic Key Phrase Extraction === | === Automatic Key Phrase Extraction === | ||
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases | * '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases. | ||
* '''[[500N-KPCrowd]]''' - | * '''[[500N-KPCrowd]]''' - Corpus of English News articles annotated with Key Phrases using Amazon's Mechanical Turk. | ||
=== Translation === | === Translation === |
Revision as of 10:00, 11 September 2013
These are tools and resources made available by the L²F.
Tools
- Eugenio is a word predictor for European Portuguese
Corpora Resources
Automatic Key Phrase Extraction
- 110-PT-BN-KP - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.
- 500N-KPCrowd - Corpus of English News articles annotated with Key Phrases using Amazon's Mechanical Turk.
Translation
- Golden collection of parallel multi-language word alignments - Manually annotated word alignments between six european languages taken from the Europarl common test set
(more information on the Speech-to-speech Translation information page)
Recommendation
- Fairy tale corpus - Corpus of fairy tales: the corpus is divided in semantically related clusters.
Lexical Resources
Other
- Portuguese Dictionary for Excalibur, a spell checker for the Macintosh.
Assembled by Nuno Mamede in association with LabEL