Yao Fu

Project maintained by FranxYao Hosted on GitHub Pages — Theme by mattgraham

Blue Hours Seattle. 2022

Google Scholar / Semantic Scholar / Github / Twitter / LinkedIn / Instagram / CV / Blog

Yao Fu 符尧. yao.fu@ed.ac.uk

I am a Ph.D. student at the University of Edinburgh (2020-) with professor Mirella Lapata and currently a research intern at Allen Institute for AI. I finished my M.S. at Columbia University (2018-2020) with professor John Cunningham and my B.S. at Peking University (2013-2018) with professor Yansong Feng. Before Ph.D., I spent great time visiting professor Alexander Rush at Cornell Tech (2019-2020).

I study large-scale probabilistic generative models for human language.

In the era of large language models, my research focuses on specialized language models, complex reasoning, emergent abilities, and how to inject strong abilities to language models from first principles. My article on tracing emergent abilities to their sources is now an important roadmap about large language model evolution.

Before the LLM era, I studied latent variable models for language generation and structure prediction.

Selected Work

Preprints and Conference Publications

Workshop Publications

Blog and Open Source