17 Aug 2012

Marzieh Razavi successfully defended her MSc thesis [on Thursday August 16 2012](posts/2012-08-16-marzieh-masters-defence. Slides from the defence [are available](images/marzieh-masters-defence-2012.pdf. Anoop Sarkar was the senior supervisor, the external examiner was Giuseppe Carenini from UBC and Fred Popowich was on her supervisory committee (all pictured below with the candidate from left to right).

Marzieh Razavi defends her Masters thesis:images/marzieh-masters-defence-2012.jpg

Razavi, M. Ensembles of Diverse Clustering-based Discriminative Dependency Parsers.

abstract: Syntactic parsing and dependency parsing in particular are a core component of many Natural Language Processing (NLP) tasks and applications. Improvements in dependency parsing can help improve machine translation and information extraction applications among many others. In this thesis, we extend the framework of (Koo, Carreras, and Collins, 2008) for dependency parsing which uses a single clustering method for semi-supervised learning. We make use of multiple diverse clustering methods to build multiple discriminative dependency parsing models in the Maximum Spanning Tree (MST) parsing framework (Mc-Donald, Crammer, and Pereira, 2005). All of these diverse clustering-based parsers are then combined together using a novel ensemble model, which performs exact inference on the shared hypothesis space of all the parser models. We show that diverse clustering-based parser models and the ensemble method together significantly improves unlabeled dependency accuracy from 90.82% to 92.46% on Section 23 of the Penn Treebank. We also show significant improvements in domain adaptation to the Switchboard and Brown corpora.