Difference between revisions of "L²F Day 2006"

From HLT@INESC-ID

(Lexicons)
(Lexicons)
Line 26: Line 26:
 
* DicPro: 6.2k anthroponyms
 
* DicPro: 6.2k anthroponyms
 
* SMorph: 26k root forms (morphology + inflection paradigm)
 
* SMorph: 26k root forms (morphology + inflection paradigm)
* EPLexIC: 80k word forms (morphology + pronunciation)
+
* EPLexIC: 80k word forms (morphology + pronunciation); in construction
 
* ONOMASTICA: 85k proper names (people, streets, cities, companies); 11 languages and cross-lingual information; pronunciation
 
* ONOMASTICA: 85k proper names (people, streets, cities, companies); 11 languages and cross-lingual information; pronunciation
 
* Broadcast News: 64k entries (pronunciation)
 
* Broadcast News: 64k entries (pronunciation)

Revision as of 10:27, 17 February 2006

Integrated Tools and Ontologies

Presentation by Joana Paulo.

  • Integrated tools:
    • ATA
    • JaVaLi!
    • DID
    • SAF
  • 3rd Party
    • Intex
  • Ontologies:
    • OntoWine (wine domain ontology)
    • OntoChef (cooking domain ontology)

Lexicons

Presentation by Ricardo Daniel Ribeiro.

  • PAROLE/SIMPLE: 20k root forms + inflection paradigms (morphology + syntax + semantics)
  • LUSOlex: 65k root forms (morphology + gramcat)
  • BRASILex: 68k root forms (morphology + gramcat)
  • Integração do LUSOlex + EPLexIC: ~8-10x EPLexIC phonetic forms
  • DicPro: 6.2k anthroponyms
  • SMorph: 26k root forms (morphology + inflection paradigm)
  • EPLexIC: 80k word forms (morphology + pronunciation); in construction
  • ONOMASTICA: 85k proper names (people, streets, cities, companies); 11 languages and cross-lingual information; pronunciation
  • Broadcast News: 64k entries (pronunciation)