The Berkeley NLP Group

Publications

2024

Ghostbuster: Detecting Text Ghostwritten by Large Language ModelsVivek Verma, Eve Fleisig, Nicholas Tomlin and Dan KleinProceedings of NAACL2024. [paper] [bib]

Autonomous Evaluation and Refinement of Digital AgentsJiayi Pan, Yichi Zhang, Nicholas Tomlin, Yifei Zhou, Sergey Levine and Alane SuhrProceedings of COLM2024. [paper] [bib]

Decision-Oriented Dialogue for Human-AI CollaborationJessy Lin, Nicholas Tomlin, Jacob Andreas and Jason EisnerTransactions of the ACL2024. [paper] [bib]

Re-evaluating the Need for Visual Signals in Unsupervised Grammar InductionBoyi Li, Rudy Corona, Karttik Mangalam, Catherine Chen, Daniel Flaherty, Serge Belongie, Kilian Weinberger, Jitendra Malik, Trevor Darrell and Dan KleinProceedings of NAACL, Findings2024. [paper] [bib]

The Cortical Representation of Language Timescales is Shared between Reading and ListeningCatherine Chen, Tom Dupré la Tour, Jack Gallant, Dan Klein and Fatma DenizCommunications Biology2024. [paper] [bib]

2023

DOC: Improving Long Story Coherence With Detailed Outline ControlKevin Yang, Dan Klein, Nanyun Peng and Yuandong TianProceedings of ACL2023. [paper] [bib]

Modular Visual Question Answering via Code GenerationSanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell and Dan KleinProceedings of ACL2023. [paper] [bib]

Neural Unsupervised Reconstruction of Protolanguage Word FormsAndre He, Nicholas Tomlin and Dan KleinProceedings of ACL2023. [paper] [bib]

InCoder: A Generative Model for Code Infilling and SynthesisDaniel Fried, Armen Aghajanyan, Jessy Lin, Sida Wang, Eric Wallace, Freda Shi, Ruiqi Zhong, Wen-tau Yih, Luke Zettlemoyer and Mike LewisProceedings of ICLR2023. [paper] [bib]

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific DocumentsCatherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey and Kyle LoProceedings of ACL, Findings2023. [paper] [bib]

Extracting Training Data from Diffusion ModelsNicholas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito and Eric WallaceUSENIX Security Symposium2023. [paper] [bib]

Large Language Models Struggle to Learn Long-Tail KnowledgeNikhil Kandpal, Haikang Deng, Adam Roberts, Eric Wallace and Colin RaffelInternational Conference on Machine Learning2023. [paper] [bib]

Poisoning Language Models During Instruction TuningAlexander Wan, Eric Wallace, Sheng Shen and Dan KleinInternational Conference on Machine Learning2023. [paper] [bib]

Revisiting Entropy Rate Constancy in TextVivek Verma, Nicholas Tomlin and Dan KleinProceedings of EMNLP, Findings2023. [paper] [bib]

2022

Re3: Generating Longer Stories With Recursive Reprompting and RevisionKevin Yang, Yuandong Tian, Nanyun Peng and Dan KleinProceedings of EMNLP2022. [paper] [bib]

Automated Crossword SolvingEric Wallace, Nicholas Tomlin, Albert Xu, Kevin Yang, Eshaan Pathak, Matthew Ginsberg and Dan KleinProceedings of ACL2022. [paper] [bib]

Inferring Rewards from Language in ContextJessy Lin, Daniel Fried, Dan Klein and Anca DraganProceedings of ACL2022. [paper] [bib]

Learned Incremental Representations for ParsingNikita Kitaev, Thomas Lu and Dan KleinProceedings of ACL2022. Best Paper Award. [paper] [bib]

Meta-learning via Language Model In-context TuningYanda Chen, Ruiqi Zhong, Sheng Zha, George Karypis and He HeProceedings of ACL2022. [paper] [bib]

ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionSanjay Subramanian, William Merrill, Trevor Darrell, Matt Gardner, Sameer Singh and Anna RohrbachProceedings of ACL2022. [paper] [bib]

Understanding Game-Playing Agents with Natural Language AnnotationsNicholas Tomlin, Andre He and Dan KleinProceedings of ACL2022. [paper] [bib]

Voxel-informed Language GroundingRodolfo Corona, Shizhan Zhu, Dan Klein and Trevor DarrellProceedings of ACL2022. [paper] [bib]

Addressing Resource and Privacy Constraints in Semantic Parsing Through Data AugmentationKevin Yang, Olivia Deng, Charles Chen, Richard Shin, Subhro Roy and Benjamin Van DurmeProceedings of ACL, Findings2022. [paper] [bib]

Attention weights accurately predict language representations in the brainMathis Lamarre, Catherine Chen and Fatma DenizProceedings of EMNLP, Findings2022. [paper] [bib]

Describing Differences between Text Distributions with Natural LanguageRuiqi Zhong, Charlie Snell, Dan Klein and Jacob SteinhardtInternational Conference on Machine Learning2022. [paper] [bib]

2021

Reference-Centric Models for Grounded Collaborative DialogueDaniel Fried, Justin Chiu and Dan KleinProceedings of EMNLP2021. [paper] [bib]

An Improved Model for Voicing Silent SpeechDavid Gaddy and Dan KleinProceedings of ACL2021. [paper] [bib]

Concealed Data Poisoning Attacks on NLP ModelsEric Wallace, Tony Z. Zhao, Shi Feng and Sameer SinghProceedings of NAACL2021. [paper] [bib]

Constructing Taxonomies from Pretrained Language ModelsCatherine Chen, Kevin Lin and Dan KleinProceedings of NAACL2021. [paper] [bib]

Detoxifying Language Models Risks Marginalizing Minority VoicesAlbert Xu, Eshaan Pathak, Eric Wallace, Suchin Gururangan, Maarten Sap and Dan KleinProceedings of NAACL2021. [paper] [bib]

FUDGE: Controlled Text Generation With Future DiscriminatorsKevin Yang and Dan KleinProceedings of NAACL2021. [paper] [bib]

Low-Complexity Probing via Finding SubnetworksSteven Cao, Victor Sanh and Alexander M. RushProceedings of NAACL2021. [paper] [bib]

Modular Networks for Compositional Instruction FollowingRodolfo Corona, Daniel Fried, Coline Devin, Dan Klein and Trevor DarrellProceedings of NAACL2021. [paper] [bib]

Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt CollectionsRuiqi Zhong, Kristy Lee, Zheng Zhang and Dan KleinProceedings of EMNLP, Findings2021. [paper] [bib]

Are Larger Pretrained Language Models Uniformly Better? Comparing Performance at the Instance LevelRuiqi Zhong, Dhruba Ghosh, Dan Klein and Jacob SteinhardtProceedings of ACL, Findings2021. [paper] [bib]

Interactive Assignments for Teaching Structured Neural NLPDavid Gaddy, Daniel Fried, Nikita Kitaev, Mitchell Stern, Rodolfo Corona, John DeNero and Dan KleinProceedings of the Teaching NLP Workshop at NAACL2021. [paper] [bib]

2020

A Streaming Approach For Efficient Batched Beam SearchKevin Yang, Violet Yao, John DeNero and Dan KleinProceedings of EMNLP2020. [paper] [bib]

Digital Voicing of Silent SpeechDavid Gaddy and Dan KleinProceedings of EMNLP2020. Best Paper Award. [paper] [bib]

Imitation Attacks and Defenses for Black-box Machine Translation SystemsEric Wallace, Mitchell Stern and Dawn SongProceedings of EMNLP2020. [paper] [software] [bib]

Semantic Evaluation for Text-to-SQL with Distilled Test SuitesRuiqi Zhong, Yu Tao and Dan KleinProceedings of EMNLP2020. [paper] [software] [bib]

Unsupervised Parsing via Constituency TestsSteven Cao, Nikita Kitaev and Dan KleinProceedings of EMNLP2020. [paper] [bib]

Pretrained Transformers Improve Out-of-Distribution RobustnessDan Hendrycks, Xiaoyuan Liu, Eric Wallace, Adam Dziedzic, Rishabh Krishnan and Dawn SongProceedings of ACL2020. [paper] [bib]

Semantic Scaffolds for Pseudocode-to-Code GenerationRuiqi Zhong, Mitchell Stern and Dan KleinProceedings of ACL2020. [paper] [bib]

Tetra-Tagging: Word-Synchronous Parsing with Linear-Time InferenceNikita Kitaev and Dan KleinProceedings of ACL2020. [paper] [software] [bib]

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of TransformersZhuohan Li, Eric Wallace, Sheng Shen, Kevin Lin, Kurt Keutzer, Dan Klein and Joseph E. GonzalezProceedings of ICML2020. [paper] [bib]

Multilingual Alignment of Contextual Word RepresentationsSteven Cao, Nikita Kitaev and Dan KleinProceedings of ICLR2020. [paper] [bib]

2019

A Deep Factorization of Style and Structure in FontsAkshay Srivatsan, Jonathan Barron, Dan Klein and Taylor Berg-KirkpatrickProceedings of EMNLP2019. [paper] [bib]

Are You Looking? Grounding to Multiple Modalities in Vision-and-Language NavigationRonghang Hu, Daniel Fried, Anna Rohrbach, Dan Klein, Trevor Darrell and Kate SaenkoProceedings of ACL2019. [paper] [bib]

Cross-Domain Generalization of Neural Constituency ParsersDaniel Fried, Nikita Kitaev and Dan KleinProceedings of ACL2019. [paper] [software] [bib]

Multilingual Constituency Parsing with Self-Attention and Pre-TrainingNikita Kitaev, Steven Cao and Dan KleinProceedings of ACL2019. [paper] [software] [bib]

Pre-Learning Environment Representations for Data-Efficient Neural Instruction FollowingDavid Gaddy and Dan KleinProceedings of ACL2019. [paper] [bib]

Pragmatically Informative Text GenerationSheng Shen, Daniel Fried, Jacob Andreas and Dan KleinProceedings of NAACL2019. [paper] [bib]

2018

Constituency Parsing with a Self-Attentive EncoderNikita Kitaev and Dan KleinProceedings of ACL2018. [paper] [software] [bib]

Policy Gradient as a Proxy for Dynamic Oracles in Constituency ParsingDaniel Fried and Dan KleinProceedings of ACL2018. [paper] [software] [bib]

Learning with Latent LanguageJacob Andreas, Dan Klein and Sergey LevineProceedings of NAACL2018. [paper] [bib]

Unified Pragmatic Models for Generating and Following InstructionsDaniel Fried, Jacob Andreas and Dan KleinProceedings of NAACL2018. [paper] [software] [bib]

What's Going On in Neural Constituency Parsers? An AnalysisDavid Gaddy, Mitchell Stern and Dan KleinProceedings of NAACL2018. [paper] [bib]

Speaker-Follower Models for Vision-and-Language NavigationDaniel Fried, Ronghang Hu, Volkan Cirik, Anna Rohrbach, Jacob Andreas, Louis-Philippe Morency, Taylor Berg-Kirkpatrick, Kate Saenko, Dan Klein and Trevor DarrellProceedings of NIPS2018. [paper] [software] [bib]

2017

Analogs of Linguistic Structure in Deep RepresentationsJacob Andreas and Dan KleinProceedings of EMNLP2017. [paper] [bib]

Effective Inference for Generative Neural ParsingMitchell Stern, Daniel Fried and Dan KleinProceedings of EMNLP2017. [paper] [bib]

Where is Misty? Interpreting Spatial Descriptors by Modeling Regions in SpaceNikita Kitaev and Dan KleinProceedings of EMNLP2017. [paper] [bib]

A Minimal Span-Based Neural Constituency ParserMitchell Stern, Jacob Andreas and Dan KleinProceedings of ACL2017. [paper] [bib]

Abstract Syntax Networks for Code Generation and Semantic ParsingMaxim Rabinovich, Mitchell Stern and Dan KleinProceedings of ACL2017. Outstanding Paper. [paper] [bib]

Fine-Grained Entity Typing with High-Multiplicity AssignmentsMaxim Rabinovich and Dan KleinProceedings of ACL2017. [paper] [bib]

Improving Neural Parsing by Disentangling Model Combination and Reranking EffectsDaniel Fried, Mitchell Stern and Dan KleinProceedings of ACL2017. [paper] [software] [bib]

Translating NeuraleseJacob Andreas, Anca Dragan and Dan KleinProceedings of ACL2017. [paper] [bib]

Modular Multitask Reinforcement Learning with Policy SketchesJacob Andreas, Dan Klein and Sergey LevineProceedings of ICML2017. Best Paper Honorable Mention. [paper] [bib]

Parsing with Traces: An O(n^4) Algorithm and a Structural RepresentationJonathan K. Kummerfeld and Dan KleinTransactions of the ACL2017. [paper] [bib]

2016

Reasoning About Pragmatics with Neural Listeners and SpeakersJacob Andreas and Dan KleinProceedings of EMNLP2016. [paper] [bib]

Learning-Based Single-Document Summarization with Compression and Anaphoricity ConstraintsGreg Durrett, Taylor Berg-Kirkpatrick and Dan KleinProceedings of ACL2016. [paper] [bib]

Capturing Semantic Similarity for Entity Linking with Convolutional Neural NetworksMatthew Francis-Landau, Greg Durrett and Dan KleinProceedings of NAACL2016. [paper] [bib]

Learning to Compose Neural Networks for Question AnsweringJacob Andreas, Marcus Rohrbach, Trevor Darrell and Dan KleinProceedings of NAACL2016. Best Paper Award. [paper] [bib]

Neural Module NetworksJacob Andreas, Marcus Rohrbach, Trevor Darrell and Dan KleinProceedings of CVPR2016. [paper] [bib]

2015

Alignment-based Compositional Semantics for Instruction FollowingJacob Andreas and Dan KleinProceedings of EMNLP2015. [paper] [bib]

An Empirical Analysis of Optimization for Max-Margin NLPJonathan K. Kummerfeld , Taylor Berg-Kirkpatrick and Dan KleinProceedings of EMNLP2015. [paper] [software] [bib]

Neural CRF ParsingGreg Durrett and Dan KleinProceedings of ACL2015. [paper] [bib]

Disfluency Detection with a Semi-Markov Model and Prosodic Features James Ferguson, Greg Durrett and Dan KleinProceedings of NAACL2015. [paper] [bib]

When and Why are Log-Linear Models Self-Normalizing?Jacob Andreas and Dan KleinProceedings of NAACL2015. [paper] [bib]

On the Accuracy of Self-Normalized Log-Linear ModelsJacob Andreas, Maxim Rabinovich, Michael I. Jordan and Dan KleinProceedings of NIPS2015. [paper] [bib]

2014

How much do word embeddings encode about syntax?Jacob Andreas and Dan KleinProceedings of ACL2014. [paper] [bib]

Improved Typesetting Models for Historical OCRTaylor Berg-Kirkpatrick and Dan KleinProceedings of ACL2014. [paper] [bib]

Less Grammar, More FeaturesDavid Hall, Greg Durrett and Dan KleinProceedings of ACL2014. [paper] [bib]

Sparser, Better, Faster GPU ParsingDavid Hall, Taylor Berg-Kirkpatrick, John Canny and Dan KleinProceedings of ACL2014. [paper] [bib]

Structured Learning for Taxonomy Induction with Belief PropagationMohit Bansal, David Burkett, Gerard de Melo and Dan KleinProceedings of ACL2014. Best Paper Honorable Mention. [paper] [bib]

Grounding language with points and paths in continuous spacesJacob Andreas and Dan KleinProceedings of CoNLL2014. [paper] [bib]

Unsupervised Transcription of Piano MusicTaylor Berg-Kirkpatrick, Jacob Andreas and Dan KleinProceedings of NIPS2014. [paper] [bib]

A Joint Model for Entity Analysis: Coreference, Typing, and LinkingGreg Durrett and Dan KleinTransactions of the ACL2014. [paper] [bib]

2013

Easy Victories and Uphill Battles in Coreference ResolutionGreg Durrett and Dan KleinProceedings of EMNLP2013. Best Paper Finalist. [paper] [bib]

A Multi-Teraflop Constituency Parser using GPUsJohn Canny, David Hall and Dan KleinProceedings of EMNLP2013. [paper] [bib]

Decipherment with a Million Random RestartsTaylor Berg-Kirkpatrick and Dan KleinProceedings of EMNLP2013. [paper] [bib]

Error-Driven Analysis of Challenges in Coreference ResolutionJonathan K. Kummerfeld and Dan KleinProceedings of EMNLP2013. [paper] [software] [bib]

Decentralized Entity-Level Modeling for Coreference ResolutionGreg Durrett, David Hall and Dan KleinProceedings of ACL2013. [paper] [bib]

Unsupervised Transcription of Historical DocumentsTaylor Berg-Kirkpatrick, Greg Durrett and Dan KleinProceedings of ACL2013. [paper] [bib]

An Empirical Examination of Challenges in Chinese ParsingJonathan K. Kummerfeld, Daniel Tse, James R. Curran and Dan KleinProceedings of ACL (Short Papers)2013. [paper] [software] [bib]

Supervised Learning of Complete Morphological ParadigmsGreg Durrett and John DeNeroProceedings of NAACL2013. [paper] [bib]

Automated reconstruction of ancient languages using probabilistic models of sound changeAlexandre Bouchard-Cote, David Hall, Thomas L. Griffiths and Dan KleinProceedings of the National Academy of Sciences2013. [paper] [url] [eprint] [bib]

Faster Optimal Planning with Partial-Order PruningDavid Hall, Aloni Cohen, David Burkett and Dan KleinProceedings of the International Conference on Automated Planning Systems2013. [paper] [bib]

Good, Great, Excellent: Global Inference of Semantic IntensitiesGerard de Melo and Mohit BansalTransactions of the ACL2013. [paper] [bib]

Grounding Spatial Relations for Human-Robot InteractionSergio Guadarrama, Lorenzo Riano, Dave Golland, Daniel Gohring, Yangqing Jia, Dan Klein, Pieter Abbeel and Trevor DarrellIEEE/RSJ International Conference on Intelligent Robots and Systems2013. [paper] [bib]

2012

An Empirical Investigation of Statistical Significance in NLPTaylor Berg-Kirkpatrick, David Burkett and Dan KleinProceedings of EMNLP2012. [paper] [bib]

Parser Showdown at the Wall Street Corral: An Empirical Investigation of Error Types in Parser OutputJonathan K. Kummerfeld, David Hall, James R. Curran and Dan KleinProceedings of EMNLP2012. [slides] [keynote] [paper] [software] [bib]

Syntactic Transfer Using a Bilingual LexiconGreg Durrett, Adam Pauls and Dan KleinProceedings of EMNLP2012. [paper] [bib]

Training Factored PCFGs with Expectation PropagationDavid Hall and Dan KleinProceedings of EMNLP2012. Distinguished Paper. [paper] [bib]

Transforming Trees to Improve Syntactic ConvergenceDavid Burkett and Dan KleinProceedings of EMNLP2012. [paper] [bib]

Coreference Semantics from Web FeaturesMohit Bansal and Dan KleinProceedings of ACL2012. [paper] [bib]

Large-Scale Syntactic Language Modeling with TreeletsAdam Pauls and Dan KleinProceedings of ACL2012. [paper] [software] [bib]

A Feature-Rich Constituent Context Model for Grammar InductionDave Golland, John DeNero and Jakob UszkoreitProceedings of ACL (Short Papers)2012. [paper] [bib]

Robust Conversion of CCG Derivations to Phrase Structure TreesJonathan K. Kummerfeld, James R. Curran and Dan KleinProceedings of ACL (Short Papers)2012. [paper] [software] [bib]

Fast Inference in Phrase Extraction Models with Belief PropagationDavid Burkett and Dan KleinProceedings of NAACL2012. [paper] [bib]

Unsupervised Translation Sense ClusteringMohit Bansal, John DeNero and Dekang LinProceedings of NAACL2012. [paper] [bib]

2011

Large-Scale Cognate RecoveryDavid Hall and Dan KleinProceedings of EMNLP2011. [paper] [bib]

Simple Effective Decipherment via Combinatorial OptimizationTaylor Berg-Kirkpatrick and Dan KleinProceedings of EMNLP2011. [paper] [bib]

Faster and Smaller N-Gram Language ModelsAdam Pauls and Dan KleinProceedings of ACL2011. [paper] [bib]

Gappy Phrasal Alignment By AgreementMohit Bansal, Chris Quirk and Robert MooreProceedings of ACL2011. [paper] [bib]

Jointly Learning to Extract and CompressTaylor Berg-Kirkpatrick, Dan Gillick and Dan KleinProceedings of ACL2011. [paper] [bib]

Learning Dependency-Based Compositional SemanticsPercy Liang, Michael I. Jordan and Dan KleinProceedings of ACL2011. [paper] [bib]

Web-Scale Features for Full-Scale ParsingMohit Bansal and Dan KleinProceedings of ACL2011. [paper] [bib]

An Empirical Investigation of Discounting in Cross-Domain Language ModelsGreg Durrett and Dan KleinProceedings of ACL (Short Papers)2011. [paper] [bib]

The Surprising Variance in Shortest-Derivation ParsingMohit Bansal and Dan KleinProceedings of ACL (Short Papers)2011. [paper] [bib]

Mention Detection: Heuristics for the OntoNotes annotationsJonathan K. Kummerfeld, Mohit Bansal, David Burkett and Dan KleinProceedings of CoNLL2011. [paper] [bib]

Iterative Monotonically Bounded A*David Burkett, David Hall and Dan KleinProceedings of AAAI2011. [paper] [bib]

2010

A Game-Theoretic Approach to Generating Spatial DescriptionsDave Golland, Percy Liang and Dan KleinProceedings of EMNLP2010. [paper] [bib]

A Simple Domain-Independent Probabilistic Approach to GenerationGabor Angeli, Percy Liang and Dan KleinProceedings of EMNLP2010. [paper] [bib]

An Entity-Level Approach to Information ExtractionAria Haghighi and Dan KleinProceedings of ACL2010. [paper] [url] [bib]

Discriminative Modeling of Extraction Sets for Machine TranslationJohn DeNero and Dan KleinProceedings of ACL2010. [paper] [bib]

Finding Cognate Groups using PhylogeniesDavid LW Hall and Dan KleinProceedings of ACL2010. [paper] [bib]

Hierarchical A* Parsing with Bridge Outside ScoresAdam Pauls and Dan KleinProceedings of ACL2010. [paper] [bib]

Phylogenetic Grammar InductionTaylor Berg-Kirkpatrick and Dan KleinProceedings of ACL2010. [paper] [bib]

Simple, Accurate Parsing with an All-Fragments GrammarMohit Bansal and Dan KleinProceedings of ACL2010. [paper] [bib]

Top-Down K-Best A* ParsingAdam Pauls, Dan Klein and Chris QuirkProceedings of ACL2010. [paper] [bib]

Coreference Resolution in a Modular, Entity-Centered ModelAria Haghighi and Dan KleinProceedings of NAACL2010. Best Paper Award. [paper] [url] [bib]

Joint Parsing and Alignment with Weakly Synchronized GrammarsDavid Burkett, John Blitzer and Dan KleinProceedings of NAACL2010. [paper] [bib]

Painless Unsupervised Learning with FeaturesTaylor Berg-Kirkpatrick, Alexandre Bouchard-Côté, John DeNero and Dan KleinProceedings of NAACL2010. [slides] [paper] [bib]

Type-Based MCMCPercy Liang, Michael Jordan and Dan KleinProceedings of NAACL2010. [paper] [bib]

Unsupervised Syntactic Alignment with Inversion Transduction GrammarsAdam Pauls, Dan Klein, David Chiang and Kevin KnightProceedings of NAACL2010. [paper] [bib]

Learning Better Monolingual Models with Unannotated Bilingual TextDavid Burkett, Slav Petrov, John Blitzer and Dan KleinProceedings of CoNLL2010. [paper] [url] [bib]

Learning Programs: A Hierarchical Bayesian ApproachPercy Liang, Michael Jordan and Dan KleinProceedings of ICML2010. [paper] [bib]

Teaching Introductory Artificial Intelligence with Pac-ManJohn DeNero and Dan KleinProceedings of the Symposium on Educational Advances in Artificial Intelligence (EAAI)2010. [paper] [software] [bib]

The Pac-Man Projects Software Package for Introductory Artificial IntelligenceJohn DeNero and Dan KleinProceedings of the Symposium on Educational Advances in Artificial Intelligence, Model Assignments Track2010. [software] [bib]

Iterated Learning of Multiple Languages from Multiple TeachersDavid Burkett and Thomas L. GriffithsProceedings of Evolang2010. [paper] [bib]

2009

Consensus Training for Consensus Decoding in Machine TranslationAdam Pauls, John DeNero and Dan KleinProceedings of EMNLP2009. [paper] [bib]

Simple Coreference Resolution with Rich Syntactic and Semantic FeaturesAria Haghighi and Dan KleinProceedings of EMNLP2009. [paper] [url] [bib]

Better Word Alignments with Supervised ITG ModelsAria Haghighi, John Blitzer, John DeNero and Dan KleinProceedings of ACL2009. [slides] [paper] [url] [bib]

K-Best A* ParsingAdam Pauls and Dan KleinProceedings of ACL2009. Best Paper Award. [paper] [bib]

Learning Semantic Correspondences with Less SupervisionPercy Liang, Michael Jordan and Dan KleinProceedings of ACL2009. [slides] [paper] [bib]

Efficient Parsing for Transducer GrammarsJohn DeNero, Mohit Bansal, Adam Pauls and Dan KleinProceedings of NAACL2009. [paper] [bib]

Hierarchical Search for ParsingAdam Pauls and Dan KleinProceedings of NAACL2009. [slides] [paper] [bib]

Improved Reconstruction of Protolanguage Word FormsAlexandre Bouchard-Côté, Thomas Griffiths and Dan KleinProceedings of NAACL2009. [paper] [bib]

Online EM for Unsupervised ModelsPercy Liang and Dan KleinProceedings of NAACL2009. [slides] [paper] [bib]

Learning from Measurements in Exponential FamiliesPercy Liang, Michael Jordan and Dan KleinProceedings of ICML2009. [slides] [paper] [bib]

Efficient Inference in Phylogenetic InDel TreesAlexandre Bouchard-Côté, Michael I. Jordan and Dan KleinProceedings of NIPS2009. [paper] [bib]

Convergence Bounds for Language Evolution by Iterated LearningAnna N. Rafferty, Thomas L. Griffiths and Dan KleinProceedings of the 31st Annual Conference of the Cognitive Science Society2009. [paper] [bib]

Probabilistic grammars and hierarchical Dirichlet processesPercy Liang, Michael Jordan and Dan KleinThe Oxford Handbook of Applied Bayesian Analysis2009. [chapter] [bib]

Asynchronous Binarization for Synchronous GrammarsJohn DeNero, Adam Pauls and Dan KleinProceedings of ACL-IJCNLP (Short Papers)2009. [paper] [bib]

2008

Coarse-to-Fine Syntactic Machine Translation using Language ProjectionsSlav Petrov, Aria Haghighi and Dan KleinProceedings of EMNLP2008. [slides] [paper] [url] [bib]

Sampling Alignment Structure under a Bayesian Translation ModelJohn DeNero, Alex Bouchard-Côté and Dan KleinProceedings of EMNLP2008. [paper] [bib]

Sparse Multi-Scale Grammars for Discriminative Latent Variable ParsingSlav Petrov and Dan KleinProceedings of EMNLP2008. [paper] [url] [bib]

Two Languages are Better than One (for Syntactic Parsing)David Burkett and Dan KleinProceedings of EMNLP2008. [paper] [bib]

Analyzing the Errors of Unsupervised InductionPercy Liang and Dan KleinProceedings of ACL2008. [slides] [paper] [bib]

Learning Bilingual Lexicons from Monolingual CorporaAria Haghighi, Percy Liang, Taylor Berg-Kirkpatrick and Dan KleinProceedings of ACL2008. [paper] [url] [bib]

The Complexity of Phrase Alignment ModelsJohn DeNero and Dan KleinProceedings of ACL (Short Papers)2008. [slides] [paper] [bib]

Fully Distributed EM for Very Large DatasetsJason Wolfe, Aria Haghighi and Dan KleinProceedings of ICML2008. [slides] [paper] [bib]

Structured Compilation: Trading off Structure for FeaturesPercy Liang, Hal Daume and Dan KleinProceedings of ICML2008. [slides] [paper] [bib]

A Probabilistic Approach to Language ChangeAlexandre Bouchard-Côté, Percy Liang, Thomas Griffiths and Dan KleinProceedings of NIPS2008. [slides] [paper] [bib]

Agreement-Based LearningPercy Liang, Dan Klein and Michael JordanProceedings of NIPS2008. [poster] [paper] [bib]

Discriminative Log-Linear Grammars with Latent VariablesSlav Petrov and Dan KleinProceedings of NIPS2008. [slides] [paper] [url] [bib]

Efficient Sentence Segmentation using Syntactic FeaturesBenoit Favre, Dilek Hakkani-Tur, Slav Petrov and Dan KleinProceedings of Spoken Language Technologies (SLT)2008. [poster] [paper] [url] [bib]

2007

Learning Structured Models for Phone RecognitionSlav Petrov, Adam Pauls and Dan KleinProceedings of EMNLP-CoNLL2007. [slides] [paper] [url] [bib]

A Probabilistic Approach to Diachronic PhonologyAlexandre Bouchard-Côté, Percy Liang, Thomas Griffiths and Dan KleinProceedings of EMNLP2007. [slides] [paper] [bib]

The Infinite PCFG using Hierarchical Dirichlet ProcessesPercy Liang, Slav Petrov, Michael Jordan and Dan KleinProceedings of EMNLP2007. [slides] [paper] [url] [bib]

Tailoring Word Alignments to Syntactic Machine TranslationJohn DeNero and Dan KleinProceedings of ACL2007. [slides] [paper] [bib]

Unsupervised Coreference Resolution in a Nonparametric Bayesian ModelAria Haghighi and Dan KleinProceedings of ACL2007. [slides] [paper] [url] [bib]

Approximate Factoring for A* SearchAria Haghighi, John DeNero and Dan KleinProceedings of HLT-NAACL2007. [slides] [paper] [url] [bib]

Improved Inference for Unlexicalized ParsingSlav Petrov and Dan KleinProceedings of HLT-NAACL2007. [slides] [paper] [url] [software] [bib]

A* Search via Approximate FactoringAria Haghighi, John DeNero and Dan KleinProceedings of AAAI (Nectar Track)2007. [paper] [bib]

Learning and Inference for Hierarchically Split PCFGsSlav Petrov and Dan KleinProceedings of AAAI (Nectar Track)2007. [slides] [poster] [paper] [url] [software] [bib]

Mixture-of-Parents Maximum Entropy Markov ModelsDavid Rosenberg, Dan Klein and Ben TaskarProceedings of Uncertainty in Artificial Intelligence (UAI)2007. [paper] [bib]

2006

An End-to-End Discriminative Approach to Machine TranslationPercy Liang, Alexandre Bouchard-Côté, Dan Klein and Ben TaskarProceedings of COLING-ACL2006. [slides] [paper] [url] [bib]

Learning Accurate, Compact, and Interpretable Tree AnnotationSlav Petrov, Leon Barrett, Romain Thibaux and Dan KleinProceedings of COLING-ACL2006. [slides] [paper] [url] [software] [bib]

Prototype-Driven Grammar InductionAria Haghighi and Dan KleinProceedings of COLING-ACL2006. [slides] [paper] [url] [bib]

Alignment by AgreementPercy Liang, Ben Taskar and Dan KleinProceedings of HLT-NAACL2006. [slides] [paper] [url] [software] [bib]

Prototype-Driven Learning for Sequence ModelsAria Haghighi and Dan KleinProceedings of HLT-NAACL2006. Best Student Paper Award. [slides] [paper] [url] [bib]

Word Alignment via Quadratic AssignmentSimon Lacoste-Julien, Ben Taskar, Dan Klein and Michael I. JordanProceedings of HLT-NAACL2006. [paper] [url] [bib]

Non-Local Modeling with a Mixture of PCFGsSlav Petrov, Leon Barrett and Dan KleinProceedings of CoNLL2006. [slides] [paper] [url] [bib]

Why Generative Phrase Models Underperform Surface HeuristicsJohn DeNero, Dan Gillick, James Zhang and Dan KleinWorkshop on Statistical Machine Translation at HLT-NAACL2006. [slides] [paper] [url] [bib]

2005

A Discriminative Matching Approach to Word AlignmentBen Taskar, Simon Lacoste-Julien and Dan KleinProceedings of EMNLP2005. [paper] [url] [bib]

Robust Textual Inference via Graph MatchingAria Haghighi, Andrew Ng and Christopher ManningProceedings of HLT-EMNLP2005. [paper] [url] [bib]

Unsupervised Learning of Field Segmentation Models for Information ExtractionTrond Grenager, Dan Klein and Chris ManningProceedings of ACL2005. [paper] [bib]

The Unsupervised Learning of Natural Language StructureDan Klein2005. [thesis] [bib]

2004

Max-Margin ParsingBen Taskar, Dan Klein, Michael Collins, Daphne Koller and Chris ManningProceedings of EMNLP2004. Best Paper Award. [paper] [bib]

Corpus-Based Induction of Syntactic Structure: Models of Dependency and ConstituencyDan Klein and Chris ManningProceedings of ACL2004. [paper] [bib]

Review of Data-Oriented ParsingDan KleinComputational Linguistics2004. [bib]

2003

Accurate Unlexicalized ParsingDan Klein and Chris ManningProceedings of ACL2003. Best Paper Award. [paper] [bib]

A* Parsing: Fast Exact Viterbi Parse SelectionDan Klein and Chris ManningProceedings of NAACL2003. [paper] [bib]

Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency NetworkKristina Toutanova, Dan Klein, Chris Manning and Yoram SingerProceedings of NAACL2003. [paper] [bib]

Named Entity Recognition with Character-Level ModelsDan Klein, Joseph Smarr, Huy Nguyen and Chris ManningProceedings of CoNLL2003. [paper] [bib]

Factored A* Search for Models over Sequences and TreesDan Klein and Chris ManningProceedings of the International Joint Conference on Artificial Intelligence (IJCAI)2003. [paper] [bib]

Spectral LearningSepandar Kamvar, Dan Klein and Chris ManningProceedings of the International Joint Conference on Artificial Intelligence (IJCAI)2003. [paper] [bib]

2002

Conditional Structure versus Conditional Estimation in NLP ModelsDan Klein and Chris ManningProceedings of EMNLP2002. [paper] [bib]

A Generative Constituent-Context Model for Improved Grammar InductionDan Klein and Chris ManningProceedings of ACL2002. [paper] [bib]

From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data ClusteringDan Klein, Sepandar Kamvar and Chris ManningProceedings of ICML2002. [paper] [bib]

Interpreting and Extending Classical Agglomerative Clustering Algorithms using a Model-Based ApproachSepandar Kamvar, Dan Klein and Chris ManningProceedings of ICML2002. [paper] [bib]

Fast Exact Inference with a Factored Model for Natural Language ProcessingDan Klein and Chris ManningProceedings of NIPS2002. [paper] [bib]

Combining Heterogeneous Classifiers for Word-Sense DisambiguationDan Klein, Kristina Toutanova, Tolga Ilhan, Sepandar Kamvar and Chris ManningProceedings of the ACL Workshop on Word Sense Disambiguation2002. [paper] [bib]

Evaluating Strategies for Similarity Search on the WebTaher Haveliwala, Aristides Gionis, Dan Klein and Piotr IndykProceedings of the International World Wide Web Conference (WWW)2002. [paper] [bib]

Parsing and HypergraphsDan Klein and Chris ManningNew Developments in Parsing Technology2002. [bib]

2001

Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn TreebankDan Klein and Chris ManningProceedings of ACL2001. [paper] [bib]

Distributional Phrase Structure InductionDan Klein and Chris ManningProceedings of CoNLL2001. [paper] [bib]

Natural Language Grammar Induction Using a Constituent-Context ModelDan Klein and Chris ManningProceedings of NIPS2001. [paper] [bib]

Parsing and HypergraphsDan Klein and Chris ManningProceedings of the International Workshop on Parsing Technologies (IWPT)2001. [paper] [bib]

An O(n^3) Agenda-Based Chart Parser for Arbitrary Probabilistic Context-Free GrammarsDan Klein and Chris Manning2001. [paper] [bib]