<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.hlt.inesc-id.pt/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Ldsm</id>
	<title>HLT@INESC-ID - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.hlt.inesc-id.pt/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Ldsm"/>
	<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/w/Special:Contributions/Ldsm"/>
	<updated>2026-05-28T18:49:01Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7980</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7980"/>
		<updated>2017-07-22T18:37:26Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: /* Further Reading */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''500M-KPCrowd''' is a corpus made of 500 news articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from online news sources.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7979</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7979"/>
		<updated>2017-07-22T18:37:00Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: /* Download */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''500M-KPCrowd''' is a corpus made of 500 news articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from online news sources.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7978</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7978"/>
		<updated>2017-07-22T18:33:24Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: /* Resources */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7977</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7977"/>
		<updated>2017-07-22T18:32:37Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''500M-KPCrowd''' is a corpus made of 500 news articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from online news sources.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
Contact author&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7197</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7197"/>
		<updated>2013-12-19T00:43:44Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''500M-KPCrowd''' is a corpus made of 500 news articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from online news sources.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/500N-KPCrowd.zip 500N-KPCrowd]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7196</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7196"/>
		<updated>2013-12-19T00:42:50Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Resources ==&lt;br /&gt;
* [[110-PT-BN-KP]] - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.&lt;br /&gt;
* [[500N-KPCrowd]] - 500 news articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=EVNE2013&amp;diff=7134</id>
		<title>EVNE2013</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=EVNE2013&amp;diff=7134"/>
		<updated>2013-10-09T16:55:22Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;EVNE2013 - Economic and Violent Named-Event corpus&lt;br /&gt;
&lt;br /&gt;
Version 1.0 2013&lt;br /&gt;
Created by Luis Marujo (lmarujo@cs.cmu.edu)&lt;br /&gt;
&lt;br /&gt;
EVNE2013 has 100 news articles covering 10 violent and economic event types: Armed Clashes, Bankruptcy, Change of CEO, Legal Trouble, Mergers, Sex abuse, Street protest, Strike, Suicide Bombing, and Terrorism Bombing. The news articles were collected from 63 sources.&lt;br /&gt;
&lt;br /&gt;
==  Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/EVNE2013.zip EVNE2013]&lt;br /&gt;
&lt;br /&gt;
== Data Format ==&lt;br /&gt;
&lt;br /&gt;
For each of the 100 news articles we provide the following information per line:&lt;br /&gt;
URL: &lt;br /&gt;
Source:&lt;br /&gt;
Labels: (each label matches one sentence)&lt;br /&gt;
&lt;br /&gt;
We have the following event types and corresponding labels:&lt;br /&gt;
&lt;br /&gt;
Armed Clashes: AC&lt;br /&gt;
Bankruptcy: B&lt;br /&gt;
&lt;br /&gt;
Change of CEO: CC &lt;br /&gt;
&lt;br /&gt;
Legal Trouble: L&lt;br /&gt;
&lt;br /&gt;
Mergers: M &lt;br /&gt;
&lt;br /&gt;
Sex abuse: S&lt;br /&gt;
&lt;br /&gt;
Street protest: P&lt;br /&gt;
&lt;br /&gt;
Strike (Work): SW&lt;br /&gt;
&lt;br /&gt;
Suicide Bombing: SB&lt;br /&gt;
&lt;br /&gt;
Terrorism Bombing: T&lt;br /&gt;
&lt;br /&gt;
No event or null event: N &lt;br /&gt;
&lt;br /&gt;
To obtain the text documents just run the following script in unix command line or cywgin(Windows):&lt;br /&gt;
./downloadFilesScript.sh&lt;br /&gt;
&lt;br /&gt;
Then, we used boilerpipe extractor of text from html e.g.: https://code.google.com/p/boilerpipe/&lt;br /&gt;
and Stanford Parser to identify the sentence boundaries.&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please note that we are not including the news text document due to copyright issues.&lt;br /&gt;
&lt;br /&gt;
Contact the author if you find any problem retrieving and/or aligning the labels with the sentences.&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=EVNE2013&amp;diff=7133</id>
		<title>EVNE2013</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=EVNE2013&amp;diff=7133"/>
		<updated>2013-10-09T16:49:42Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;EVNE2013 - Economic and Violent Named-Event corpus&lt;br /&gt;
&lt;br /&gt;
Version 1.0 2013&lt;br /&gt;
Created by Luis Marujo (lmarujo@cs.cmu.edu)&lt;br /&gt;
&lt;br /&gt;
EVNE2013 has 100 news articles covering 10 violent and economic event types: Armed Clashes, Bankruptcy, Change of CEO, Legal Trouble, Mergers, Sex abuse, Street protest, Strike, Suicide Bombing, and Terrorism Bombing. The news articles were collected from 63 sources.&lt;br /&gt;
&lt;br /&gt;
== Data Format ==&lt;br /&gt;
&lt;br /&gt;
For each of the 100 news articles we provide the following information per line:&lt;br /&gt;
URL: &lt;br /&gt;
Source:&lt;br /&gt;
Labels: (each label matches one sentence)&lt;br /&gt;
&lt;br /&gt;
We have the following event types and corresponding labels:&lt;br /&gt;
Armed Clashes: AC&lt;br /&gt;
Bankruptcy: B&lt;br /&gt;
Change of CEO: CC &lt;br /&gt;
Legal Trouble: L&lt;br /&gt;
Mergers: M &lt;br /&gt;
Sex abuse: S&lt;br /&gt;
Street protest: P&lt;br /&gt;
Strike (Work): SW&lt;br /&gt;
Suicide Bombing: SB&lt;br /&gt;
Terrorism Bombing: T&lt;br /&gt;
&lt;br /&gt;
No event or null event: N &lt;br /&gt;
&lt;br /&gt;
To obtain the text documents just run the following script in unix command line or cywgin(Windows):&lt;br /&gt;
./downloadFilesScript.sh&lt;br /&gt;
&lt;br /&gt;
Then, we used boilerpipe extractor of text from html e.g.: https://code.google.com/p/boilerpipe/&lt;br /&gt;
and Stanford Parser to identify the sentence boundaries.&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please note that we are not including the news text document due to copyright issues.&lt;br /&gt;
&lt;br /&gt;
Contact the author if you find any problem retrieving and/or aligning the labels with the sentences.&lt;br /&gt;
&lt;br /&gt;
==  Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/EVNE2013.zip EVNE2013]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=EVNE2013&amp;diff=7132</id>
		<title>EVNE2013</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=EVNE2013&amp;diff=7132"/>
		<updated>2013-10-09T16:48:58Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: Created page with &amp;quot;EVNE2013 - Economic and Violent Named-Event corpus  Version 1.0 2013 Created by Luis Marujo (lmarujo@cs.cmu.edu)  EVNE2013 has 100 news articles covering 10 violent and economic ...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;EVNE2013 - Economic and Violent Named-Event corpus&lt;br /&gt;
&lt;br /&gt;
Version 1.0 2013&lt;br /&gt;
Created by Luis Marujo (lmarujo@cs.cmu.edu)&lt;br /&gt;
&lt;br /&gt;
EVNE2013 has 100 news articles covering 10 violent and economic event types: Armed Clashes, Bankruptcy, Change of CEO, Legal Trouble, Mergers, Sex abuse, Street protest, Strike, Suicide Bombing, and Terrorism Bombing. The news articles were collected from 63 sources.&lt;br /&gt;
&lt;br /&gt;
== Data Format ==&lt;br /&gt;
&lt;br /&gt;
For each of the 100 news articles we provide the following information per line:&lt;br /&gt;
URL: &lt;br /&gt;
Source:&lt;br /&gt;
Labels: (each label matches one sentence)&lt;br /&gt;
&lt;br /&gt;
We have the following event types and corresponding labels:&lt;br /&gt;
Armed Clashes: AC&lt;br /&gt;
Bankruptcy: B&lt;br /&gt;
Change of CEO: CC &lt;br /&gt;
Legal Trouble: L&lt;br /&gt;
Mergers: M &lt;br /&gt;
Sex abuse: S&lt;br /&gt;
Street protest: P&lt;br /&gt;
Strike (Work): SW&lt;br /&gt;
Suicide Bombing: SB&lt;br /&gt;
Terrorism Bombing: T&lt;br /&gt;
&lt;br /&gt;
No event or null event: N &lt;br /&gt;
&lt;br /&gt;
To obtain the text documents just run the following script in unix command line or cywgin(Windows):&lt;br /&gt;
./downloadFilesScript.sh&lt;br /&gt;
&lt;br /&gt;
Then, we used boilerpipe extractor of text from html e.g.: https://code.google.com/p/boilerpipe/&lt;br /&gt;
and Stanford Parser to identify the sentence boundaries.&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please note that we are not including the news text document due to copyright issues.&lt;br /&gt;
Contact the author if you find problems retrieving and/or aligning the labels with the sentences.&lt;br /&gt;
&lt;br /&gt;
==  Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/EVNE2013.zip EVNE2013]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7121</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7121"/>
		<updated>2013-09-24T18:03:56Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Resources ==&lt;br /&gt;
* [[110-PT-BN-KP]] - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7120</id>
		<title>Downloads</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7120"/>
		<updated>2013-09-24T18:03:24Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
These are tools and resources made available by the L²F.&lt;br /&gt;
&lt;br /&gt;
== Tools ==&lt;br /&gt;
&lt;br /&gt;
* [http://qa.l2f.inesc-id.pt/wiki/index.php/Systems#Just.Ask Just.Ask] is a Question-Answering system for English&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese&lt;br /&gt;
&lt;br /&gt;
== Corpora Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Automatic Key Phrase Extraction ===&lt;br /&gt;
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.&lt;br /&gt;
&lt;br /&gt;
=== Translation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set &amp;lt;br&amp;gt;(more information on the [[Speech-to-speech Translation]] information page)&lt;br /&gt;
&lt;br /&gt;
=== Recommendation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is  divided in semantically related clusters.&lt;br /&gt;
&lt;br /&gt;
== Lexical Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Other ===&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh.&amp;lt;br/&amp;gt;Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7117</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7117"/>
		<updated>2013-09-11T10:00:47Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Resources ==&lt;br /&gt;
* [[110-PT-BN-KP]] - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.&lt;br /&gt;
* [[500N-KPCrowd]] -Corpus of English News articles annotated with Key Phrases using Amazon's Mechanical Turk.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7116</id>
		<title>Downloads</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7116"/>
		<updated>2013-09-11T10:00:25Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
These are tools and resources made available by the L²F.&lt;br /&gt;
&lt;br /&gt;
== Tools ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese&lt;br /&gt;
&lt;br /&gt;
== Corpora Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Automatic Key Phrase Extraction ===&lt;br /&gt;
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases.&lt;br /&gt;
* '''[[500N-KPCrowd]]''' - Corpus of English News articles annotated with Key Phrases using Amazon's Mechanical Turk.&lt;br /&gt;
&lt;br /&gt;
=== Translation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set &amp;lt;br&amp;gt;(more information on the [[Speech-to-speech Translation]] information page)&lt;br /&gt;
&lt;br /&gt;
=== Recommendation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is  divided in semantically related clusters.&lt;br /&gt;
&lt;br /&gt;
== Lexical Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Other ===&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh.&amp;lt;br/&amp;gt;Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7115</id>
		<title>Downloads</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7115"/>
		<updated>2013-09-10T18:35:11Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
These are tools and resources made available by the L²F.&lt;br /&gt;
&lt;br /&gt;
== Tools ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese&lt;br /&gt;
&lt;br /&gt;
== Corpora Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Automatic Key Phrase Extraction ===&lt;br /&gt;
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases&lt;br /&gt;
* '''[[500N-KPCrowd]]''' - Crowdsourced annotated corpus of English News articles with Key Phrases.&lt;br /&gt;
&lt;br /&gt;
=== Translation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set &amp;lt;br&amp;gt;(more information on the [[Speech-to-speech Translation]] information page)&lt;br /&gt;
&lt;br /&gt;
=== Recommendation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is  divided in semantically related clusters.&lt;br /&gt;
&lt;br /&gt;
== Lexical Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Other ===&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh.&amp;lt;br/&amp;gt;Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7114</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7114"/>
		<updated>2013-09-10T18:34:40Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Resources ==&lt;br /&gt;
* [[110-PT-BN-KP]] - Manually annotated corpus of Portuguese Broadcast News with Key Phrases&lt;br /&gt;
* [[500N-KPCrowd]] - Crowdsourced annotated corpus of English News articles with Key Phrases.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7113</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7113"/>
		<updated>2013-09-10T18:32:36Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''500M-KPCrowd''' corpus is made of 500 News articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from the online news sources.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/500N-KPCrowd.zip 500N-KPCrowd]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7112</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7112"/>
		<updated>2013-09-10T18:32:17Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;500M-KPCrowd corpus is made of 500 News articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from the online news sources.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/500N-KPCrowd.zip 500N-KPCrowd]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7111</id>
		<title>500N-KPCrowd</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=500N-KPCrowd&amp;diff=7111"/>
		<updated>2013-09-10T18:30:39Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: Created page with &amp;quot;500M-KPCrowd corpus is made of 500 News articles (50 stories for each of the 10 categories) manually annotated with Key Phrases by 20 Amazon Mechanical Turk workers.   The news a...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;500M-KPCrowd corpus is made of 500 News articles (50 stories for each of the 10 categories) manually annotated with Key Phrases by 20 Amazon Mechanical Turk workers. &lt;br /&gt;
&lt;br /&gt;
The news articles were retrieved from the WWW and are e&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
&lt;br /&gt;
* Number of stories: 450 / 50 (Train / Test)&lt;br /&gt;
* Average number of Amazon Mechanical Turk workers per news: 20&lt;br /&gt;
* Number of Topics: 10&lt;br /&gt;
* Average Number of Key Phrases per news story: 40&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo and Anatole Gershman and Jaime Carbonell and Robert Frederking and João Paulo da Silva Neto, '''Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization''', 8th International Conference on Language Resources and Evaluation (LREC 2012), May. 2012 , ELRA. [http://www.inesc-id.pt/ficheiros/publicacoes/8262.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=8262 bibTeX]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/500N-KPCrowd.zip 500N-KPCrowd]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7110</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7110"/>
		<updated>2013-09-10T18:11:52Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Resources ==&lt;br /&gt;
* [[110-PT-BN-KP]] - Manually annotated corpus of Portuguese Broadcast News with Key Phrases&lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7109</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=7109"/>
		<updated>2013-09-10T18:10:20Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Corpora ==&lt;br /&gt;
[[110-PT-BN-KP]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915,6827'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7108</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7108"/>
		<updated>2013-09-10T18:08:15Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. &lt;br /&gt;
&lt;br /&gt;
The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese '''[[ALERT Corpus|ALERT]]'''&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact '''Luis Marujo''' for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, '''Keyphrase Cloud Generation of Broadcast News''', In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7107</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7107"/>
		<updated>2013-09-10T18:07:40Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. &lt;br /&gt;
&lt;br /&gt;
The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese '''[[ALERT Corpus|ALERT]]'''&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact Luis Marujo for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, '''Keyphrase Cloud Generation of Broadcast News''', In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7106</id>
		<title>Downloads</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7106"/>
		<updated>2013-09-10T18:07:03Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
These are tools and resources made available by the L²F.&lt;br /&gt;
&lt;br /&gt;
== Tools ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese&lt;br /&gt;
&lt;br /&gt;
== Corpora Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Automatic Key Phrase Extraction ===&lt;br /&gt;
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases&lt;br /&gt;
&lt;br /&gt;
=== Translation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set &amp;lt;br&amp;gt;(more information on the [[Speech-to-speech Translation]] information page)&lt;br /&gt;
&lt;br /&gt;
=== Recommendation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is  divided in semantically related clusters.&lt;br /&gt;
&lt;br /&gt;
== Lexical Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Other ===&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh.&amp;lt;br/&amp;gt;Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7105</id>
		<title>Downloads</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Downloads&amp;diff=7105"/>
		<updated>2013-09-10T18:06:41Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{TOCright}}&lt;br /&gt;
These are tools and resources made available by the L²F.&lt;br /&gt;
&lt;br /&gt;
== Tools ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/~lco/eugenio/index.html Eugenio] is a word predictor for European Portuguese&lt;br /&gt;
&lt;br /&gt;
== Corpora Resources ==&lt;br /&gt;
&lt;br /&gt;
== Automatic Key Phrase Extraction ==&lt;br /&gt;
* '''[[110-PT-BN-KP]]''' - Manually annotated corpus of Portuguese Broadcast News with Key Phrases&lt;br /&gt;
&lt;br /&gt;
=== Translation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Word_Alignments|Golden collection of parallel multi-language word alignments]]''' - Manually annotated word alignments between six european languages taken from the Europarl common test set &amp;lt;br&amp;gt;(more information on the [[Speech-to-speech Translation]] information page)&lt;br /&gt;
&lt;br /&gt;
=== Recommendation ===&lt;br /&gt;
&lt;br /&gt;
* '''[[Fairy tale corpus]]''' - Corpus of fairy tales: the corpus is  divided in semantically related clusters.&lt;br /&gt;
&lt;br /&gt;
== Lexical Resources ==&lt;br /&gt;
&lt;br /&gt;
=== Other ===&lt;br /&gt;
&lt;br /&gt;
* [http://www.l2f.inesc-id.pt/resources/Portug.Dict.sit Portuguese Dictionary] for [http://www.eg.bucknell.edu/~excalibr/excalibur.html Excalibur], a spell checker for the Macintosh.&amp;lt;br/&amp;gt;Assembled by [[Nuno Mamede]] in association with [http://label2.ist.utl.pt/label/ LabEL]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7104</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7104"/>
		<updated>2013-09-10T18:04:58Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. &lt;br /&gt;
&lt;br /&gt;
The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese '''[[ALERT Corpus|ALERT]]'''&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. &lt;br /&gt;
&lt;br /&gt;
Please contact Luis Marujo for other uses.&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7103</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7103"/>
		<updated>2013-09-10T18:03:17Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. &lt;br /&gt;
&lt;br /&gt;
The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese '''[[ALERT Corpus|ALERT]]'''&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7102</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7102"/>
		<updated>2013-09-10T18:02:59Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. &lt;br /&gt;
&lt;br /&gt;
The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese [[ALERT Corpus|ALERT]]&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7101</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7101"/>
		<updated>2013-09-10T18:01:46Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. &lt;br /&gt;
The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese [[ALERT Corpus|ALERT]]&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7100</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7100"/>
		<updated>2013-09-10T18:01:23Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. The Broadcast News (BN) were extracted from 8 BN programs from the European Portuguese [[ALERT Corpus|ALERT]]&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7099</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7099"/>
		<updated>2013-09-10T17:59:59Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. The 110 Portuguese Broadcast News (BN) were extracted from 8 BN programs, containing from the European Portuguese [[ALERT Corpus|ALERT]]&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7098</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7098"/>
		<updated>2013-09-10T17:59:12Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110-PT-BN-KP corpus is made of 110 Portuguese Broadcast News annotated with Key Phrases by an expert. The 110 Portuguese Broadcast News (BN) were extracted from 8 BN programs, containing from the European Portuguese ALERT&lt;br /&gt;
BN [[ALERT Corpus|ALERT]]&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7097</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7097"/>
		<updated>2013-09-10T17:55:25Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110 Portuguese Broadcast News annotated with Key Phrases by an expert.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data below:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7096</id>
		<title>110-PT-BN-KP</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN-KP&amp;diff=7096"/>
		<updated>2013-09-10T17:54:18Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: Created page with &amp;quot;110 Portuguese Broadcast News annotated with Key Phrases by an expert.  == Statistics == Train / Test * Number of stories: 100 / 10 * Number of words: 29,225 /3,896 * Average Num...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110 Portuguese Broadcast News annotated with Key Phrases by an expert.&lt;br /&gt;
&lt;br /&gt;
== Statistics ==&lt;br /&gt;
Train / Test&lt;br /&gt;
* Number of stories: 100 / 10&lt;br /&gt;
* Number of words: 29,225 /3,896&lt;br /&gt;
* Average Number of Key Phrases: 24 / 29&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
The corpus is free for non-commercial use. Please contact Luis Marujo for other uses.&lt;br /&gt;
Please cite this paper if you write any paper using the data above:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN-KP.zip 110-PT-BN-KP]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7095</id>
		<title>110-PT-BN</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7095"/>
		<updated>2013-09-10T17:44:02Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: Blanked the page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7094</id>
		<title>110-PT-BN</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7094"/>
		<updated>2013-09-10T17:28:34Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110 Portuguese Broadcast News annotated with Key Phrases by an expert.&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN.zip link 110-PT-BN]&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper or patent using the data above:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf pdf] [http://www.inesc-id.pt/intranet/publicacoes/bibtex.php?bibtex=7588 bibtex]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7093</id>
		<title>110-PT-BN</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7093"/>
		<updated>2013-09-10T17:25:22Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;110 Portuguese Broadcast News annotated with Key Phrases by an expert.&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;br /&gt;
[http://www.l2f.inesc-id.pt/~ldsm/110-PT-BN.zip link 110-PT-BN]&lt;br /&gt;
&lt;br /&gt;
== Further Reading ==&lt;br /&gt;
&lt;br /&gt;
Please cite this paper if you write any paper or patent using the data above:&lt;br /&gt;
&lt;br /&gt;
Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto, Keyphrase Cloud Generation of Broadcast News, In proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association, ISCA, Florence, Italy, August 2011 [http://www.inesc-id.pt/pt/indicadores/Ficheiros/7588.pdf link pdf]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7092</id>
		<title>110-PT-BN</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=110-PT-BN&amp;diff=7092"/>
		<updated>2013-09-10T17:06:22Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: Created page with &amp;quot; 110 Portuguese Broadcast News annotated by an expert.  == Download ==&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
110 Portuguese Broadcast News annotated by an expert.&lt;br /&gt;
&lt;br /&gt;
== Download ==&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=5147</id>
		<title>Luís Marujo</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Lu%C3%ADs_Marujo&amp;diff=5147"/>
		<updated>2008-12-15T15:56:59Z</updated>

		<summary type="html">&lt;p&gt;Ldsm: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Luís Marujo&lt;br /&gt;
|username=ldsm&lt;br /&gt;
|contact=ldsm&lt;br /&gt;
|phone=+351-213-100-300 ext. 2514&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&amp;lt;inesc-id what='person' id='915'&amp;gt;&amp;lt;/inesc-id&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
&lt;br /&gt;
* Galinha infrastructure (till 2008)&lt;br /&gt;
* REAP.PT&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Masters Students]]&lt;/div&gt;</summary>
		<author><name>Ldsm</name></author>
	</entry>
</feed>