500N-KPCrowd: Difference between revisions

Revision as of 18:32, 10 September 2013

500M-KPCrowd corpus is made of 500 News articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers.

The news articles were retrieved from the online news sources.

Statistics

Number of stories: 450 / 50 (Train / Test)
Average number of Amazon Mechanical Turk workers per news: 20
Number of Topics: 10
Average Number of Key Phrases per news story: 40

Download

500N-KPCrowd

@@ Line 1: / Line 1: @@
-M-KPCrowd corpus is made of 500 News articles (50 stories for each of the 10 categories) manually annotated with Key Phrases by 20 Amazon Mechanical Turk workers.
+M-KPCrowd corpus is made of 500 News articles (50 stories for each of the 10 categories selected) manually annotated with Key Phrases by 20 Amazon's Mechanical Turk workers.
-The news articles were retrieved from the WWW and are e
+The news articles were retrieved from the online news sources.
 == Statistics ==

500N-KPCrowd: Difference between revisions

From HLT@INESC-ID

Revision as of 18:32, 10 September 2013

Statistics

Further Reading

Download