Difference between revisions of "Tools"

From HLT@INESC-ID

 
(9 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
{{TOCright}}
 
{{TOCright}}
Language processing can take advantage of several tools, such as syntactic and semantic analyzers. In order to perform their tasks, some of these  tools use linguistic information (for instance, dictionaries and grammars), making natural language processing by computers closer to the human process.
+
{| align="center" style="border-style: solid; border-width: 1px; border-color: gray80;"
 +
|[[Image:tools1.jpg|197px]]
 +
|[[Image:tools2.jpg|197px]]
 +
|[[Image:tools3.jpg|197px]]
 +
|}
 +
Language processing uses tools such as syntactic and semantic analyzers. In order to perform their tasks, some of these  tools use linguistic information (for instance, dictionaries and grammars), making natural language processing by computers closer to the human process.
  
 
We are using natural language processing  tools in many of our applications, namely in dialog management, automatic summarization, information retrieval, question answering, discourse analysis, term and emotion extraction.
 
We are using natural language processing  tools in many of our applications, namely in dialog management, automatic summarization, information retrieval, question answering, discourse analysis, term and emotion extraction.
Line 12: Line 17:
 
* [[RuDriCo]] - a rule-based morphology processor;
 
* [[RuDriCo]] - a rule-based morphology processor;
 
* [[SMorph]] - a morphological analyser;
 
* [[SMorph]] - a morphological analyser;
* [[XA]] - a morphological analyser similar to [[wikipedia:ispell|ispell]] and [[jspell]];
+
* [[XA]] - a morphological analyser similar to ispell and jspell;
 
* [[YAH]] - yet another hyphenator.
 
* [[YAH]] - yet another hyphenator.
 +
* [[VerbForms]] - a verbal form generator;
  
 
== Syntax ==
 
== Syntax ==
  
 
* [[Algas]] -- establishes dependency relations between chunks and words
 
* [[Algas]] -- establishes dependency relations between chunks and words
* [[ParVO]] -- a [[wikipedia:C_Plus_Plus|C++]] implementation of [[wikipedia:Earley parser|Earley's algorithm]] with attribute [[wikipedia:Unification|unification]] (as in an [[wikipedia:attribute grammar|attribute grammar]]).
+
* [[ParVO]] -- a C++ implementation of Earley's algorithm with attribute unification (as in an attribute grammar).
* [[SuSAna]]
+
* [[SuSAna]] -- susrface syntactic analyser and chunker
 
* [[TiraTeimas]] -- verifies if a set of chunks satisfies a set of constraints
 
* [[TiraTeimas]] -- verifies if a set of chunks satisfies a set of constraints
  
 
== Syntax/Semantics Interface ==
 
== Syntax/Semantics Interface ==
  
* [[FUF Unification Engine]] -- a funcional unification formalism unification engine
 
 
* [[AsDeCopas]] -- applies contextual rules (possibly hierarchically organized) to a graph
 
* [[AsDeCopas]] -- applies contextual rules (possibly hierarchically organized) to a graph
 
* [[Ogre]] -- transforms a structure where both chunks and words are connected into a dependency structure
 
* [[Ogre]] -- transforms a structure where both chunks and words are connected into a dependency structure
Line 30: Line 35:
 
== Semantics ==
 
== Semantics ==
  
* [[ATA]] - automatic term extraction (semantics-like processing)
+
* [[ATA (system)|ATA]] - automatic term extraction (semantics-like processing)
  
 
== Discourse Analysis ==
 
== Discourse Analysis ==
Line 52: Line 57:
 
* [[AUDIMUS]] - Automatic Large Vocabulary Continuous Speech Recognition for the European Portuguese language
 
* [[AUDIMUS]] - Automatic Large Vocabulary Continuous Speech Recognition for the European Portuguese language
  
== Other tools ==
+
== Speech Search ==
  
* [[SAF]]
+
* [[DTW]] - DTW system developed for the Mediaeval 2013 Spoken Web Search challenge
 +
* [[STDfusion]] - Tools for discriminative calibration/fusion of heterogeneous spoken term detection systems
  
 
[[category:Tools]]
 
[[category:Tools]]

Latest revision as of 12:22, 2 October 2013

Tools1.jpg Tools2.jpg Tools3.jpg

Language processing uses tools such as syntactic and semantic analyzers. In order to perform their tasks, some of these tools use linguistic information (for instance, dictionaries and grammars), making natural language processing by computers closer to the human process.

We are using natural language processing tools in many of our applications, namely in dialog management, automatic summarization, information retrieval, question answering, discourse analysis, term and emotion extraction. Besides applying these tools to text we are applying them to automatic transcriptions of spoken documents, leading to new challenges.

Morphology

  • MARv - a morphossyntactic disambiguation tool;
  • monge - a word form generator;
  • PAsMo - a rule-based morphology processor, tag converter, and sentence splitter;
  • RuDriCo - a rule-based morphology processor;
  • SMorph - a morphological analyser;
  • XA - a morphological analyser similar to ispell and jspell;
  • YAH - yet another hyphenator.
  • VerbForms - a verbal form generator;

Syntax

  • Algas -- establishes dependency relations between chunks and words
  • ParVO -- a C++ implementation of Earley's algorithm with attribute unification (as in an attribute grammar).
  • SuSAna -- susrface syntactic analyser and chunker
  • TiraTeimas -- verifies if a set of chunks satisfies a set of constraints

Syntax/Semantics Interface

  • AsDeCopas -- applies contextual rules (possibly hierarchically organized) to a graph
  • Ogre -- transforms a structure where both chunks and words are connected into a dependency structure

Semantics

  • ATA - automatic term extraction (semantics-like processing)

Discourse Analysis

  • DID - a discourse indentifier.

Multi-purpose

  • Galinha - a portal for building and running applications.
  • LRDB - a language resources database and access framework.
  • FSTk - a finite-state transducer library.
  • ShReP - A Framework for constructing NLP systems.

Speech Annotation

  • L2F_PhoneAlign - A DTW-based phonetic aligner.
  • L2F_MuLA - A tool to synchronize annotation of speech at various levels of granularity.

Speech Recognition

  • AUDIMUS - Automatic Large Vocabulary Continuous Speech Recognition for the European Portuguese language

Speech Search

  • DTW - DTW system developed for the Mediaeval 2013 Spoken Web Search challenge
  • STDfusion - Tools for discriminative calibration/fusion of heterogeneous spoken term detection systems
Retrieved from "