https://www.hlt.inesc-id.pt/wiki/index.php?title=NLE_GRID_Project_-_State_of_the_Art_and_References&feed=atom&action=historyNLE GRID Project - State of the Art and References - Revision history2024-03-29T11:35:47ZRevision history for this page on the wikiMediaWiki 1.41.0https://www.hlt.inesc-id.pt/wiki/index.php?title=NLE_GRID_Project_-_State_of_the_Art_and_References&diff=4632&oldid=prevDavid at 11:01, 31 January 20082008-01-31T11:01:29Z<p></p>
<table style="background-color: #fff; color: #202122;" data-mw="interface">
<col class="diff-marker" />
<col class="diff-content" />
<col class="diff-marker" />
<col class="diff-content" />
<tr class="diff-title" lang="en">
<td colspan="2" style="background-color: #fff; color: #202122; text-align: center;">← Older revision</td>
<td colspan="2" style="background-color: #fff; color: #202122; text-align: center;">Revision as of 11:01, 31 January 2008</td>
</tr><tr><td colspan="2" class="diff-lineno" id="mw-diff-left-l20">Line 20:</td>
<td colspan="2" class="diff-lineno">Line 20:</td></tr>
<tr><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><br></td><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><br></td></tr>
<tr><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>To benefit from the use of a computational GRID, NLE applications need to subscribe an architectural model that allows automated discovery of components and data, a flexible way to incorporate the different components in a working application, coordination of execution and storage of results. The goal is to allow NLE researchers to design their applications for a computational GRID without requiring expertise in GRID computing.</div></td><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>To benefit from the use of a computational GRID, NLE applications need to subscribe an architectural model that allows automated discovery of components and data, a flexible way to incorporate the different components in a working application, coordination of execution and storage of results. The goal is to allow NLE researchers to design their applications for a computational GRID without requiring expertise in GRID computing.</div></td></tr>
<tr><td class="diff-marker" data-marker="−"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;"><div><del style="font-weight: bold; text-decoration: none;"></del></div></td><td colspan="2" class="diff-side-added"></td></tr>
<tr><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><br></td><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><br></td></tr>
<tr><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>== References ==</div></td><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>== References ==</div></td></tr>
</table>Davidhttps://www.hlt.inesc-id.pt/wiki/index.php?title=NLE_GRID_Project_-_State_of_the_Art_and_References&diff=4631&oldid=prevDavid at 11:01, 31 January 20082008-01-31T11:01:18Z<p></p>
<table style="background-color: #fff; color: #202122;" data-mw="interface">
<col class="diff-marker" />
<col class="diff-content" />
<col class="diff-marker" />
<col class="diff-content" />
<tr class="diff-title" lang="en">
<td colspan="2" style="background-color: #fff; color: #202122; text-align: center;">← Older revision</td>
<td colspan="2" style="background-color: #fff; color: #202122; text-align: center;">Revision as of 11:01, 31 January 2008</td>
</tr><tr><td colspan="2" class="diff-lineno" id="mw-diff-left-l20">Line 20:</td>
<td colspan="2" class="diff-lineno">Line 20:</td></tr>
<tr><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><br></td><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><br></td></tr>
<tr><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>To benefit from the use of a computational GRID, NLE applications need to subscribe an architectural model that allows automated discovery of components and data, a flexible way to incorporate the different components in a working application, coordination of execution and storage of results. The goal is to allow NLE researchers to design their applications for a computational GRID without requiring expertise in GRID computing.</div></td><td class="diff-marker"></td><td style="background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;"><div>To benefit from the use of a computational GRID, NLE applications need to subscribe an architectural model that allows automated discovery of components and data, a flexible way to incorporate the different components in a working application, coordination of execution and storage of results. The goal is to allow NLE researchers to design their applications for a computational GRID without requiring expertise in GRID computing.</div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">== References ==</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* J. Curran. Blueprint for a High Performance NLP Infrastructure. Proceedings of the HLT-NAACL 2003 Workshop on Software Engineering and Architecture of Language Technology Systems, Edmonton, Canada pp. 39-44. ACL. 2003.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* H. Cunningham, Y. Wilks, and R. J. Gaizauskas. GATE - a General Architecture for Text Engineering. In Proc. of the 16th Conf. on Computational Linguistics (COLING96), Copenhagen, 1996.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Rajkumar Buyya, David Abramson and Jonathan Giddy. NimrodG Resource Broker for Service Oriented Grid Computing. IEEE Distributed Systems Online, Vol. 2, N. 7, November 2001.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Ian Foster, Carl Kesselman, J Nick, Steven Tuecke. The Physiology of the Grid - An Open Grid Services Architecture for Distributed Systems Integration. Global Grid Forum, June 22, 2002.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Ian Foster, Carl Kesselman, Steven Tuecke. The anatomy of the grid: Enabling scalable virtual organisations. International Journal of Supercomputer Applications, 15(3), 2001</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Globus Project. The Globus Project. University of Chicago</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* David Graff. English Gigaword. Linguistic Data Consortium, 2002</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Baden Hughes and Steven Bird. A Grid Based Architecture for High-Performance NLP. Natural Language Engineering, 2003.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Ricardo Ribeiro, David M. de Matos, and Nuno Mamede. How to integrate data from different sources. Proc. of the INTERA Workshop "A Registry of Linguistic Data Categories within an Integrated Language Resources Repository Area". ELRA. May 2004.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* F. Batista and Nuno Mamede. Flexible Module for Shallow Parsing, Using Preferences. TASHA'2003 - Workshop on Tagging and Shallow Processing of Portuguese, Oct. 2003.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Ricardo Ribeiro and Nuno Mamede and Isabel Trancoso. Reusing Linguistic Resources: a Case Study in Morphosyntactic Tagging. TASHA'2003 - Workshop on Tagging and Shallow Processing of Portuguese, Oct. 2003.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* R. Ribeiro, L. Oliveira, I. Trancoso. Using Morphossyntactic Information in TTS Systems: Comparing Strategies for European Portuguese. Proc. PROPOR'2003 Faro, Portugal, June 2003</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* S. Paulo, L. Oliveira. Multilevel Annotation Of Speech Signals Using Weighted Finite State Transducers. Proc. 2002 IEEE Workshop on Speech Synthesis Santa Monica, California, September 2002</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* D. M. de Matos, A. Mateus, J. Graca, and N. J. Mamede. Empowering the User: a Data-oriented Application-Building Framework. In Adj. Proc. of the 7th ERCIM Workshop "User Interfaces for All", pages 37-44, Chantilly, France, October 2002.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* M. C. Viana and Luis C. Oliveira and A. I. Mata. Prosodic Phrasing: Machine and Human Evaluation International Journal of Speech Technology, 6(1), pp. 83-94, Jan. 2003, Kluwer Academic Publishers.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* David M. de Matos, Ricardo Ribeiro, and Nuno Mamede. Rethinking Reusable Resources. Proc. of the International Conference on Language Resources and Evaluation, LREC'2004, ELRA. May 2004.</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* Sergio Manuel Gaspar Ferreira Paulo and Luis C. Oliveira. DTW-based Phonetic Alignment Using Multiple Acoustic Features EUROSPEECH'2003 - 8th European Conference on Speech Communication and Technology, Sep. 2003</ins></div></td></tr>
<tr><td colspan="2" class="diff-side-deleted"></td><td class="diff-marker" data-marker="+"></td><td style="color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;"><div><ins style="font-weight: bold; text-decoration: none;">* David Matos and Joana Paulo and Nuno Mamede. Managing Linguistic Resources and Tools. Proc. of the 6th Intl. Workshop, PROPOR 2003, Jun. 2003 , pp. 135--142 , Springer-Verlag, Heidelberg.</ins></div></td></tr>
</table>Davidhttps://www.hlt.inesc-id.pt/wiki/index.php?title=NLE_GRID_Project_-_State_of_the_Art_and_References&diff=4629&oldid=prevDavid at 11:00, 31 January 20082008-01-31T11:00:14Z<p></p>
<p><b>New page</b></p><div>The following sections contain the state of the art and bibliographic references of the NLE GRID project proposal.<br />
<br />
== State of the Art ==<br />
<br />
The use of distributed computing services in NLE is still at an early stage, compared to what has been achieved in areas like high energy physics and biology. In our view this is due to the lack of standardization and interoperability of most NLE tools. <br />
<br />
Research laboratory like ours, that uses a considerable amount of NLE tools and modules, often face the problem of re-using these resources. These may have been produced in-house or they may be third-party modules. In either case, the task of managing them is not simple: for instance, some tool may be available but may be deemed to hard to reuse for a particular task, causing the redevelopment of a similar tool. <br />
<br />
If reuse is a problem, the contact between old tools and new users is also a critical issue. The problem here is often in terms of the time required to acquire the necessary expertise to fully and productively use some resource. <br />
<br />
To address the above issues, Matos (2003) proposed the Galinha system, a web-based user interface for building modular applications. The interface allows new users and non-specialists to assemble and test complex prototypes: the only requirement is a clear understanding of the meaning of the data used by each module - a requirement much less stringent than understanding the modules themselves. <br />
<br />
The infrastructure used to support the interface is a partial implementation of the theoretical interconnection model proposed in Matos (2002). In the first stage, the Galaxy Communicator system (MIT, 2001) was selected to provide messaging support for the infrstructure's message exchanges. <br />
<br />
A similar solution was proposed by Curran (2003) using of a Generative Programming approach for the development of NLE applications by the composition of elementary components like sentence boundary detectors, POS taggers, chunkers and named entity recognizers. This re-usable components can be optimized for both performance and high runtime efficiency. These components are encapsulated with standard interfaces for gluing them together into new tools. Curran also suggests the use of a web services interface to allow the composition of components developed by different researchers running in different locations. <br />
<br />
Hughes and Bird (2003) proposed the extension of the component-based architecture to integrate interfaces with computational GRID services. In this project we plan to build on that proposal and to integrate it into the Galinha system. <br />
<br />
A computational GRID allows for large-scale analysis, distributed resources and processing, in addition to engendering new models for collaboration and application development. Foster et al (2001, 2002) provides a physiological and an anatomical overview of GRID computing services and provides foundational architectures for application development in the GRID space. <br />
<br />
To benefit from the use of a computational GRID, NLE applications need to subscribe an architectural model that allows automated discovery of components and data, a flexible way to incorporate the different components in a working application, coordination of execution and storage of results. The goal is to allow NLE researchers to design their applications for a computational GRID without requiring expertise in GRID computing.</div>David