ATA (system)


Revision as of 18:33, 25 June 2008 by Joana (Talk | contribs) (Features)

The development of terminologies is a hard work, especially when it is done manually. ATA eases this task by extracting the terms contained in a specialized text. The main purpose of this tool is to suggest a list of words likely to be terms giving the linguistic and statistic information collected about them. The results can be used for example in topic detection.


The main goal of the ATA project is to automatically extract the terms in a specialized text. To do this, the text is feed into a chain of NLP tools: SMorph, PAsMo, MARv, SuSAna, GeTerms, Anota, PrintTerms. The first four modules will do the linguistic analysis. The two in the middle are responsible for the statistical enrichments of the text, counting the occurrences of each candidate. Also they add information about the word count on reference text. The last module is responsible for the decision on which words to suggest.


Team / Authors
  • Joana L. Paulo
  • Windows / Linux / MacOSX (as long as all the tools needed are still available in both platforms)
Developing status
  • Stable
  • Project ended in 2004/07