<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.hlt.inesc-id.pt/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Filcab</id>
	<title>HLT@INESC-ID - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.hlt.inesc-id.pt/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Filcab"/>
	<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/w/Special:Contributions/Filcab"/>
	<updated>2026-05-05T22:27:46Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6506</id>
		<title>TAP Corpus</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6506"/>
		<updated>2011-11-29T15:28:08Z</updated>

		<summary type="html">&lt;p&gt;Filcab: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting the corpus  ==&lt;br /&gt;
&amp;lt;pre&amp;gt;git clone ssh://ssh.l2f.inesc-id.pt/afs/l2f/home/filcab/git/tap.git&amp;lt;/pre&amp;gt;&lt;br /&gt;
Assuming the pdfs are in &amp;quot;&amp;lt;tt&amp;gt;originals/UP*&amp;lt;/tt&amp;gt;…&amp;quot; &lt;br /&gt;
&lt;br /&gt;
(You can do the following commands to get the pdfs there.)&lt;br /&gt;
&amp;lt;pre&amp;gt;cd tap&lt;br /&gt;
mkdir originals&lt;br /&gt;
cd originals&lt;br /&gt;
lndir /afs/l2f/corpora/up-magazine/originals&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Getting the corpus aligned ==&lt;br /&gt;
Prerequisite:&lt;br /&gt;
stanford coreNLP at &amp;lt;tt&amp;gt;tap/stanford-corenlp&amp;lt;/tt&amp;gt;;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;tt&amp;gt;l2fstring&amp;lt;/tt&amp;gt; available.&lt;br /&gt;
&lt;br /&gt;
=== Running ===&lt;br /&gt;
&amp;lt;pre&amp;gt;./everything&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the pdfs are not at &amp;lt;tt&amp;gt;originals/&amp;lt;/tt&amp;gt;, run: &amp;lt;tt&amp;gt;./everything &amp;amp;lt;directory&amp;amp;gt;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Aligned sentences are stored in &amp;lt;tt&amp;gt;aligned/&amp;lt;/tt&amp;gt; Tagged corpus is at &amp;lt;tt&amp;gt;tagged/&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The aligned and tagged corpora are available at: &amp;lt;tt&amp;gt;/afs/l2f/home/filcab/tap-corpus&amp;lt;/tt&amp;gt;&lt;/div&gt;</summary>
		<author><name>Filcab</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6505</id>
		<title>TAP Corpus</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6505"/>
		<updated>2011-11-29T15:27:52Z</updated>

		<summary type="html">&lt;p&gt;Filcab: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting the corpus  ==&lt;br /&gt;
&amp;lt;pre&amp;gt;git clone ssh://ssh.l2f.inesc-id.pt/afs/l2f/home/filcab/git/tap.git&amp;lt;/pre&amp;gt;&lt;br /&gt;
Assuming the pdfs are in &amp;quot;&amp;lt;tt&amp;gt;originals/UP*&amp;lt;/tt&amp;gt;…&amp;quot; &lt;br /&gt;
&lt;br /&gt;
(You can do the following commands to get the pdfs there.)&lt;br /&gt;
&amp;lt;pre&amp;gt;cd tap&lt;br /&gt;
mkdir originals&lt;br /&gt;
cd originals&lt;br /&gt;
lndir /afs/l2f/corpora/up-magazine/originals&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Getting the corpus aligned ==&lt;br /&gt;
Prerequisite:&lt;br /&gt;
stanford coreNLP at &amp;lt;tt&amp;gt;tap/stanford-corenlp&amp;lt;/tt&amp;gt;;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;tt&amp;gt;l2fstring&amp;lt;/tt&amp;gt; available.&lt;br /&gt;
&lt;br /&gt;
Run:&lt;br /&gt;
&amp;lt;pre&amp;gt;./everything&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the pdfs are not at &amp;lt;tt&amp;gt;originals/&amp;lt;/tt&amp;gt;, run: &amp;lt;tt&amp;gt;./everything &amp;amp;lt;directory&amp;amp;gt;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Aligned sentences are stored in &amp;lt;tt&amp;gt;aligned/&amp;lt;/tt&amp;gt; Tagged corpus is at &amp;lt;tt&amp;gt;tagged/&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The aligned and tagged corpora are available at: &amp;lt;tt&amp;gt;/afs/l2f/home/filcab/tap-corpus&amp;lt;/tt&amp;gt;&lt;/div&gt;</summary>
		<author><name>Filcab</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6504</id>
		<title>TAP Corpus</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6504"/>
		<updated>2011-11-29T14:44:35Z</updated>

		<summary type="html">&lt;p&gt;Filcab: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting the corpus  ==&lt;br /&gt;
&amp;lt;pre&amp;gt;git clone ssh://ssh.l2f.inesc-id.pt/afs/l2f/home/filcab/git/tap.git&amp;lt;/pre&amp;gt;&lt;br /&gt;
Assuming the pdfs are in &amp;quot;&amp;lt;tt&amp;gt;originals/UP*&amp;lt;/tt&amp;gt;…&amp;quot; &lt;br /&gt;
&lt;br /&gt;
(You can do the following commands to get the pdfs there.)&lt;br /&gt;
&amp;lt;pre&amp;gt;cd tap&lt;br /&gt;
mkdir originals&lt;br /&gt;
cd originals&lt;br /&gt;
lndir /afs/l2f/corpora/up-magazine/originals&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Getting the corpus aligned ==&lt;br /&gt;
Prerequisite:&lt;br /&gt;
stanford coreNLP at &amp;lt;tt&amp;gt;tap/stanford-corenlp&amp;lt;/tt&amp;gt;;&lt;br /&gt;
&amp;lt;tt&amp;gt;l2fstring&amp;lt;/tt&amp;gt; available.&lt;br /&gt;
&lt;br /&gt;
Run:&lt;br /&gt;
&amp;lt;pre&amp;gt;./everything&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the pdfs are not at &amp;lt;tt&amp;gt;originals/&amp;lt;/tt&amp;gt;, run: &amp;lt;tt&amp;gt;./everything &amp;amp;lt;directory&amp;amp;gt;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Aligned sentences are stored in &amp;lt;tt&amp;gt;aligned/&amp;lt;/tt&amp;gt; Tagged corpus is at &amp;lt;tt&amp;gt;tagged/&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The aligned and tagged corpora are available at: &amp;lt;tt&amp;gt;/afs/l2f/home/filcab/tap-corpus&amp;lt;/tt&amp;gt;&lt;/div&gt;</summary>
		<author><name>Filcab</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6503</id>
		<title>TAP Corpus</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6503"/>
		<updated>2011-11-29T14:37:50Z</updated>

		<summary type="html">&lt;p&gt;Filcab: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting the corpus  ==&lt;br /&gt;
&amp;lt;pre&amp;gt;git clone ssh://ssh.l2f.inesc-id.pt/afs/l2f/home/filcab/git/tap.git&amp;lt;/pre&amp;gt;&lt;br /&gt;
Assuming the pdfs are in &amp;quot;&amp;lt;tt&amp;gt;originals/UP*&amp;lt;/tt&amp;gt;…&amp;quot; &lt;br /&gt;
&lt;br /&gt;
(You can do the following commands to get the pdfs there.)&lt;br /&gt;
&amp;lt;pre&amp;gt;cd tap&lt;br /&gt;
mkdir originals&lt;br /&gt;
cd originals&lt;br /&gt;
lndir /afs/l2f/corpora/up-magazine/originals&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Getting the corpus aligned ==&lt;br /&gt;
Prerequisite:&lt;br /&gt;
stanford coreNLP at &amp;lt;tt&amp;gt;tap/stanford-corenlp&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Run:&lt;br /&gt;
&amp;lt;pre&amp;gt;./everything&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the pdfs are not at &amp;lt;tt&amp;gt;originals/&amp;lt;/tt&amp;gt;, run: &amp;lt;tt&amp;gt;./everything &amp;amp;lt;directory&amp;amp;gt;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Aligned sentences are stored in &amp;lt;tt&amp;gt;aligned/&amp;lt;/tt&amp;gt; Tagged corpus is at &amp;lt;tt&amp;gt;tagged/&amp;lt;/tt&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The aligned and tagged corpora are available at: &amp;lt;tt&amp;gt;/afs/l2f/home/filcab/tap-corpus&amp;lt;/tt&amp;gt;&lt;/div&gt;</summary>
		<author><name>Filcab</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6502</id>
		<title>TAP Corpus</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=TAP_Corpus&amp;diff=6502"/>
		<updated>2011-11-29T14:31:26Z</updated>

		<summary type="html">&lt;p&gt;Filcab: Instructions&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Getting the corpus  ==&lt;br /&gt;
&amp;lt;pre&amp;gt;git clone ssh://ssh.l2f.inesc-id.pt/afs/l2f/home/filcab/git/tap.git&amp;lt;/pre&amp;gt;&lt;br /&gt;
Assuming the pdfs are in &amp;quot;&amp;lt;tt&amp;gt;originals/UP*&amp;lt;/tt&amp;gt;…&amp;quot; &lt;br /&gt;
&lt;br /&gt;
(You can do the following commands to get the pdfs there.)&lt;br /&gt;
&amp;lt;pre&amp;gt;cd tap&lt;br /&gt;
mkdir originals&lt;br /&gt;
cd originals&lt;br /&gt;
lndir /afs/l2f/corpora/up-magazine/originals&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Getting the corpus aligned ==&lt;br /&gt;
Prerequisite:&lt;br /&gt;
stanford coreNLP at &amp;lt;tt&amp;gt;tap/stanford-corenlp&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Run:&lt;br /&gt;
&amp;lt;pre&amp;gt;./everything&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the pdfs are not at &amp;lt;tt&amp;gt;originals/&amp;lt;/tt&amp;gt;, run: &amp;lt;tt&amp;gt;./everything &amp;amp;lt;directory&amp;amp;gt;&amp;lt;/tt&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Aligned sentences are stored in &amp;lt;tt&amp;gt;aligned/&amp;lt;/tt&amp;gt; Tagged corpus is at &amp;lt;tt&amp;gt;tagged/&amp;lt;/tt&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Filcab</name></author>
	</entry>
	<entry>
		<id>https://www.hlt.inesc-id.pt/wiki/index.php?title=Filipe_Cabecinhas&amp;diff=6501</id>
		<title>Filipe Cabecinhas</title>
		<link rel="alternate" type="text/html" href="https://www.hlt.inesc-id.pt/wiki/index.php?title=Filipe_Cabecinhas&amp;diff=6501"/>
		<updated>2011-11-29T14:22:31Z</updated>

		<summary type="html">&lt;p&gt;Filcab: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{infobox|name=Filipe Cabecinhas|username=filcab|contact=filcab&lt;br /&gt;
|phone=+351-213-100-226 ext. 2514|fax=+351-213-145-843&lt;br /&gt;
}}&lt;br /&gt;
&amp;lt;!-- &amp;lt;inesc-id what='person' id=''&amp;gt;&amp;lt;/inesc-id&amp;gt; --&amp;gt;&lt;br /&gt;
&amp;lt;!-- EDIT AFTER THIS LINE --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Research Interests ==&lt;br /&gt;
&lt;br /&gt;
== Ongoing Projects ==&lt;br /&gt;
[[TAP Corpus]]&lt;br /&gt;
&lt;br /&gt;
[[category:People]]&lt;br /&gt;
[[category:Graduate Students]]&lt;/div&gt;</summary>
		<author><name>Filcab</name></author>
	</entry>
</feed>