Downloads: Difference between revisions

From HLT@INESC-ID

No edit summary
No edit summary
Line 9: Line 9:


== Corpora Resources ==
== Corpora Resources ==
=== Speech ===
* '''[[VoxCeleb-PT]]''' - annotated corpus of European Portuguese celebrities.


=== Automatic Key Phrase Extraction ===
=== Automatic Key Phrase Extraction ===

Revision as of 14:34, 7 June 2022

These are tools and resources made available by the L²F.

Tools

  • Just.Ask is a Question-Answering system for English
  • Eugenio is a word predictor for European Portuguese

Corpora Resources

Speech

  • VoxCeleb-PT - annotated corpus of European Portuguese celebrities.

Automatic Key Phrase Extraction

  • 110-PT-BN-KP - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.

Translation

Recommendation

  • Fairy tale corpus - Corpus of fairy tales: the corpus is divided in semantically related clusters.

Lexical Resources

Other