<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.hlt.inesc-id.pt/wiki/index.php?action=history&amp;feed=atom&amp;title=Extracting_Parallel_Data_from_Microblog_Messages</id>
	<title>Extracting Parallel Data from Microblog Messages - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.hlt.inesc-id.pt/wiki/index.php?action=history&amp;feed=atom&amp;title=Extracting_Parallel_Data_from_Microblog_Messages"/>
	<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Extracting_Parallel_Data_from_Microblog_Messages&amp;action=history"/>
	<updated>2026-05-16T15:47:49Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Extracting_Parallel_Data_from_Microblog_Messages&amp;diff=6830&amp;oldid=prev</id>
		<title>Rdmr: Created page with &quot;__NOTOC__ {{infobox|name=Wang Ling |username=wlin |contact=wlin |phone=+351-213-100-300 |fax=+351-213-145-843 }}  == Date ==  * 15:00, Friday, January 4&lt;sup&gt;th&lt;/sup&gt;, 2013 * Room...&quot;</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Extracting_Parallel_Data_from_Microblog_Messages&amp;diff=6830&amp;oldid=prev"/>
		<updated>2012-12-29T00:35:28Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;__NOTOC__ {{infobox|name=Wang Ling |username=wlin |contact=wlin |phone=+351-213-100-300 |fax=+351-213-145-843 }}  == Date ==  * 15:00, Friday, January 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt;, 2013 * Room...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;__NOTOC__&lt;br /&gt;
{{infobox|name=Wang Ling&lt;br /&gt;
|username=wlin&lt;br /&gt;
|contact=wlin&lt;br /&gt;
|phone=+351-213-100-300&lt;br /&gt;
|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Date ==&lt;br /&gt;
&lt;br /&gt;
* 15:00, Friday, January 4&amp;lt;sup&amp;gt;th&amp;lt;/sup&amp;gt;, 2013&lt;br /&gt;
* Room 336&lt;br /&gt;
&lt;br /&gt;
== Speaker ==&lt;br /&gt;
&lt;br /&gt;
* [[Wang Ling]]&lt;br /&gt;
&lt;br /&gt;
== Abstract ==&lt;br /&gt;
&lt;br /&gt;
We present a novel method for extracting parallel data from microblog messages. In contrast with previously described methods that detect parallel documents, our approach finds parallel segments within the same document. We demonstrate our technique’s applicability by extracting a large number of parallel Chinese-English sentence pairs from Sina Weibo, the Chinese counterpart of Twitter. We evaluate the quality of our automatic method using a corpus of hand-labeled examples. Used in a Chinese-English machine translation system, the automatically extracted parallel yields text substantial improvements on microblog message translation, more than doubling the baseline BLEU score relative to a system that uses existing parallel data resources.&lt;br /&gt;
&lt;br /&gt;
[[category:Seminars]]&lt;br /&gt;
[[category:Seminars_2013]]&lt;/div&gt;</summary>
		<author><name>Rdmr</name></author>
	</entry>
</feed>