https://www.hlt.inesc-id.pt/wiki/index.php?title=Language_Dynamics_and_Capitalization_using_Maximum_Entropy&feed=atom&action=historyLanguage Dynamics and Capitalization using Maximum Entropy - Revision history2024-03-29T07:23:33ZRevision history for this page on the wikiMediaWiki 1.41.0https://www.hlt.inesc-id.pt/wiki/index.php?title=Language_Dynamics_and_Capitalization_using_Maximum_Entropy&diff=4716&oldid=prevJoana at 15:41, 30 May 20082008-05-30T15:41:47Z<p></p>
<p><b>New page</b></p><div>__NOTOC__<br />
{{infobox|name=Fernando Batista<br />
|username=fmmb<br />
|contact=fernando.batista<br />
|phone=+351-213-100-390<br />
|fax=+351-213-145-843<br />
}}<br />
== Date ==<br />
<br />
* 15:00, June 6, 2008<br />
* Room 336<br />
<br />
== Speaker ==<br />
<br />
* [[Fernando Batista]]<br />
<br />
== Abstract ==<br />
<br />
This paper studies the impact of written language variations and the way it affects the capitalization task over time. A discriminative approach, based on maximum entropy models, is proposed to perform capitalization, taking the language changes into consideration. The proposed method makes it possible to use large corpora for training. The evaluation is performed over newspaper corpora using different testing periods. The achieved results reveal a strong relation between the capitalization performance and the elapsed time between the training and testing data periods.<br />
<br />
<br />
[[category:Seminars]]<br />
[[category:Seminars 2008]]<br />
[[category:Conference Practice]]</div>Joana