PAsMo: Difference between revisions

From HLT@INESC-ID

No edit summary
 
No edit summary
Line 1: Line 1:
Performs a morphological analysis over unrestricted text and easily integrates a Natural Language Processing System as it regroups words and their constituents according to rewriting rules; changes the tags according to correspondence rules; and splits phrases according to a list of separators. Works over real texts, such as newspapers and books.
Performs a morphological analysis over unrestricted text and easily integrates a Natural Language Processing System as it regroups words and their constituents according to rewriting rules; changes the tags according to correspondence rules; and splits phrases according to a list of separators. Works over real texts, such as newspapers and books.


= Goals =
== Goals ==
PAsMo rewrites the text according to the morphological features of is words and a set of given rewriting rules. PAsMo is also used to translate tags, easing the eventual integration with the syntactic analyzer and the proceeding modules in a Natural Language chain. Separation in phrases is preformed according to a list of separator words. The possibility of adding initial (and final) junks to use in the rules is possible.
PAsMo rewrites the text according to the morphological features of is words and a set of given rewriting rules. PAsMo is also used to translate tags, easing the eventual integration with the syntactic analyzer and the proceeding modules in a Natural Language chain. Separation in phrases is preformed according to a list of separator words. The possibility of adding initial (and final) junks to use in the rules is possible.


= Features =
== Features ==
Platforms
Platforms
* Windows / Linux / MacOSX
* Windows / Linux / MacOSX

Revision as of 18:27, 25 June 2008

Performs a morphological analysis over unrestricted text and easily integrates a Natural Language Processing System as it regroups words and their constituents according to rewriting rules; changes the tags according to correspondence rules; and splits phrases according to a list of separators. Works over real texts, such as newspapers and books.

Goals

PAsMo rewrites the text according to the morphological features of is words and a set of given rewriting rules. PAsMo is also used to translate tags, easing the eventual integration with the syntactic analyzer and the proceeding modules in a Natural Language chain. Separation in phrases is preformed according to a list of separator words. The possibility of adding initial (and final) junks to use in the rules is possible.

Features

Platforms

  • Windows / Linux / MacOSX

Developing status

  • Stable
  • Current version: 3.5

Integration in other systems

As PAsMo writes the tags and regroups the words it is mainly used to ease the integration of a morphological analyzer with the following module in the NLP chain.

Implementation issues

PAsMo is written in C++ and the algorithm was enhanced, reducing processing time. Also XML input and output is available in order to ease communication with other modules and allow data verification.

Client/Server platforms

A distributed version of PAsMo is available allowing the use of the system in a client/server platform through Sun Remote Procedure Call (RPC).