Structured Prediction Cascades


Revision as of 10:27, 14 July 2010 by Acbm (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Ben Taskar
Ben Taskar
Ben Taskar received his bachelor's and doctoral degree in Computer Science from Stanford University. After a postdoc at the University of California at Berkeley, he joined the faculty at the University of Pennsylvania Computer and Information Science Department in 2007, where he currently co-directs PRiML: Penn Research in Machine Learning. His research interests include machine learning, natural language processing and computer vision. He has been awarded the Sloan Research Fellowship and selected for the Young Investigator Program by the Office of Naval Research and the DARPA Computer Science Study Group. His work on structured prediction has received best paper awards at NIPS and EMNLP conferences.
Addresses: www mail


  • 14:10, Monday, July 19th, 2010
  • Room FA1 - Pavilhão de Informática I (IST), 14:10


  • Ben Taskar, Computer and Information Science Department, University of Pennsylvania


Structured prediction tasks pose a fundamental bias-computation trade- off: The need for complex models to increase predictive power on the one hand and the limited computational resources for inference in the exponentially-sized output spaces on the other.

We formulate and develop structured prediction cascades to address this trade-off: a sequence of increasingly complex models that progressively filter the space of possible outputs. We represent an exponentially large set of filtered outputs using max marginals and propose a novel convex loss for learning cascades that balances filtering error with filtering efficiency. We provide generalization bounds for error and efficiency losses and evaluate our approach on several natural language and vision problems.

We find that the learned cascades are capable of reducing the complexity of inference by up to several orders of magnitude, enabling the use of models which incorporate higher order dependencies and features and yield significantly higher accuracy.