|Line 21:||Line 21:|
This work explores the use of subject lists extracted from an annotated corpus to ﬁnd subject-verb pairs in untagged corpora. Our goal is to identify verb syntactic functions (subjects and direct objects) to characterize verb arguments. Identifying syntactic functions on corpora using parsers is time-consuming. It is desirable to automate the annotation process of the syntactic functions without parsing the corpus. We present a method that uses an annotated corpus, and SenseClusters, an unsupervised clustering tool for word sense disambiguation. Sentences with synonymous verbs were clustered. We observe that verbs in the same cluster have the same list of nouns as subject in the test corpus, even though the speciﬁc pair subject/verb does not appear in the annotated corpus. The result shows that annotating the subject/verb pair using the subject lists extracted from the clusters is quicker than syntactically parsing the corpus.