ATA (system)

The development of terminologies is a hard work, especially when it is done manually. ATA eases this task by extracting the terms contained in a specialized text. The main purpose of this tool is to suggest a list of words likely to be terms giving the linguistic and statistic information collected about them. The results can be used for example in topic detection.

Goals

The main goal of the ATA project is to automatically extract the terms in a specialized text. To do this, the text is feed into a chain of NLP tools: SMorph, PAsMo, MARv, SuSAna, GeTerms, Anota, PrintTerms. The first four modules will do the linguistic analysis. The two in the middle are responsible for the statistical enrichments of the text, counting the occurrences of each candidate. Also they add information about the word count on reference text. The last module is responsible for the decision on which words to suggest.

Features

Team / Authors

Joana L. Paulo

Platforms

Windows / Linux / MacOSX (as long as all the tools needed are still available in both platforms)

Developing status

Stable
Project ended in 2004/07

ATA (system)

From HLT@INESC-ID

Revision as of 18:33, 25 June 2008 by Joana (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Goals

Features

ATA (system)

From HLT@INESC-ID

Revision as of 18:33, 25 June 2008 by Joana (talk | contribs)(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Goals

Features

Revision as of 18:33, 25 June 2008 by Joana (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)