Resources: Difference between revisions
From HLT@INESC-ID
m (→Lexica) |
m (→See Also) |
||
Line 24: | Line 24: | ||
* [http://www.icp.grenet.fr/ELRA/home.html ELRA] (European Language Resources Association) | * [http://www.icp.grenet.fr/ELRA/home.html ELRA] (European Language Resources Association) | ||
* [http://morph.ldc.upenn.edu/ LDC] (Linguistic Data Consortium) | * [http://morph.ldc.upenn.edu/ LDC] (Linguistic Data Consortium) | ||
* [http://crnvmc.cern.ch/FIND/DICTIONARY? English/Technical Dictionary] | * Dictionaries: | ||
* [gopher://uts.mcc.ac.uk/77/gopherservices/enquire.english American English Dictionary] | ** [http://crnvmc.cern.ch/FIND/DICTIONARY? English/Technical Dictionary] | ||
* [gopher://gopher.princeton.edu:5003/7 Webster's Dictionary] | ** [gopher://uts.mcc.ac.uk/77/gopherservices/enquire.english American English Dictionary] | ||
* [gopher://info.mcc.ac.uk/77/miscellany/acronyms/.index/index Acronyms Dictionary] | ** [gopher://gopher.princeton.edu:5003/7 Webster's Dictionary] | ||
* [http://www.fmi.uni-passau.de/htbin/lt/lte English-German Dictionary] | ** [gopher://info.mcc.ac.uk/77/miscellany/acronyms/.index/index Acronyms Dictionary] | ||
* [http://www.fmi.uni-passau.de/htbin/lt/ltd German-English Dictionary] | ** [http://www.fmi.uni-passau.de/htbin/lt/lte English-German Dictionary] | ||
* [http://nova.sti.nasa.gov/nasa-thesaurus.html NASA Thesaurus] | ** [http://www.fmi.uni-passau.de/htbin/lt/ltd German-English Dictionary] | ||
** [http://nova.sti.nasa.gov/nasa-thesaurus.html NASA Thesaurus] | |||
[[category:Resources]] | [[category:Resources]] |
Revision as of 13:32, 3 June 2006
L²F has been particularly active in the creation of linguistic resources for European Portuguese. The cooperation with CLUL has been of paramount importance in this activity. The resources listed are in inverse chronological order. The corresponding webpages are in Portuguese.
Corpora
- IPSOM - aligned spoken books
- ALERT
- CORAL
- BD-PÚBLICO- large vocabulary, speaker-independent, continuous speech corpus
- SPEECHDAT - corpus for training and assessment of isolated and continuous speech utterances
- BDFALA
- EUROM.1
Lexica
Pronunciation lexica (besides the ones included in the above corpora documentation):
- ONOMASTICA (Proper names of 11 European languages, in cooperation with TLP - Telefones de Lisboa e Porto): ~ 100.000 names of people, streets, towns and companies
- PF (Português Fundamental): ~ 26.000 citation forms
The pronunciation lexica developed by L²F use the SAMPA phonetic alphabet. See the SAMPA table for European Portuguese and some comments about its design.
See Also
- Resource Links
- List of Newspapers on the Internet produced by Isabel Trancoso and maintained jointly with IMS Stuttgart.
- Linguateca (Distributed language resource center for Portuguese)
- ELRA (European Language Resources Association)
- LDC (Linguistic Data Consortium)
- Dictionaries: