Resources

From HLT@INESC-ID

Revision as of 02:04, 13 February 2006 by Root (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

L²F has been particularly active in the creation of linguistic resources for European Portuguese. The cooperation with CLUL has been of paramount importance in this activity. The resources are listed in inverse chronological order. The corresponding webpages are in Portuguese.

  • IPSOM (aligned spoken books)
  • ALERT
  • CORAL
  • BD-PÚBLICO
  • SPEECHDAT
  • BDFALA
  • EUROM.1

Pronunciation lexica (besides the ones included in the above corpora documentation):

  • ONOMASTICA (Proper names of 11 European languages, in cooperation with TLP - Telefones de Lisboa e Porto): ~ 100.000 names of people, streets, towns and companies
  • PF (Português Fundamental): ~ 26.000 citation forms

The pronunciation lexica developed in L²F use the SAMPA phonetic alphabet. See the SAMPA table for European Portuguese and some comments about its design.

See also:

If you have time, surf the links ...