Line 3: | Line 3: | ||
== Corpora == | == Corpora == | ||
− | * [[LECTRA Corpus|LECTRA]] | + | * [[LECTRA Corpus|LECTRA]] - Classroom lectures |
− | * [[IPSOM Pilot Corpus|IPSOM]] - | + | * [[IPSOM Pilot Corpus|IPSOM]] - Aligned spoken books |
− | * [[ALERT Corpus|ALERT]] | + | * [[ALERT Corpus|ALERT]] - Broadcast news |
− | * [[CORAL Corpus|CORAL]] | + | * [[CORAL Corpus|CORAL]] - Spoken dialogues (map task) |
− | * [[BD-PÚBLICO Corpus|BD-PÚBLICO]]- | + | * [[BD-PÚBLICO Corpus|BD-PÚBLICO]]- Large vocabulary, speaker-independent, continuous speech |
− | * [[SPEECHDAT Corpus|SPEECHDAT]] - | + | * [[SPEECHDAT Corpus|SPEECHDAT]] - Multi-purpose telephone speech database |
− | * [[BDFALA Corpus|BDFALA]] | + | * [[BDFALA Corpus|BDFALA]] - Speech analysis / synthesis (extension of EUROM.1) |
− | * [[EUROM.1 Corpus|EUROM.1]] | + | * [[EUROM.1 Corpus|EUROM.1]] - Multi-Lingual speech corpus for phonetic comparison |
== Lexica == | == Lexica == |
L²F has been particularly active in the creation of linguistic resources for European Portuguese. The cooperation with CLUL has been of paramount importance in this activity. The resources listed are in inverse chronological order. The corresponding webpages are in Portuguese.
Pronunciation lexica (besides the ones included in the above corpora documentation):
The pronunciation lexica developed by L²F use the SAMPA phonetic alphabet. See the SAMPA table for European Portuguese and some comments about its design.