Difference between revisions of "Downloads"

From HLT@INESC-ID

 
Line 9: Line 9:
  
 
== Corpora Resources ==
 
== Corpora Resources ==
 
=== Speech ===
 
* '''[[VoxCeleb-PT]]''' - annotated corpus of European Portuguese celebrities.
 
  
 
=== Automatic Key Phrase Extraction ===
 
=== Automatic Key Phrase Extraction ===

Latest revision as of 15:32, 7 June 2022

These are tools and resources made available by the L²F.

Tools

  • Just.Ask is a Question-Answering system for English
  • Eugenio is a word predictor for European Portuguese

Corpora Resources

Automatic Key Phrase Extraction

  • 110-PT-BN-KP - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.

Translation

Recommendation

  • Fairy tale corpus - Corpus of fairy tales: the corpus is divided in semantically related clusters.

Lexical Resources

Other