Home

Hi I'm Bruce! I'm a senior at the University of Pennsylvania, currently working with Tomek Korbak through ML Alignment & Theory Scholars to reduce risks from misaligned agents.

I'm learning to think more in public! When papers or talks influence my thinking, I try to share them in Readings. I also document ideas that interest me in Writings.

Before moving to the US, I served in the Republic of Korea Marines, where I operated helicopters. Now, I'm on Penn's men's lightweight rowing team.

Papers Go to Google Scholar →

Distillation Robustifies Unlearning Bruce W. Lee, Addie Foote, Alex Infanger, Leni Shor, Harish Kamath, Jacob Goldman-Wetzler, Bryce Woodworth, Alex Cloud, Alexander Matt Turner NeurIPS 2025 (Spotlight)

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs Mantas Mazeika, Xuwang Yin, Rishub Tamirisa, Jaehyuk Lim, Bruce W. Lee, Richard Ren, Long Phan, Norman Mu, Adam Khoja, Oliver Zhang, Dan Hendrycks NeurIPS 2025 (Spotlight)

Programming Refusal with Conditional Activation Steering Bruce W. Lee, Inkit Padhi, Karthikeyan Natesan Ramamurthy, Erik Miehling, Pierre Dognin, Manish Nagireddy, Amit Dhurandhar ICLR 2025 (Spotlight)

Informal ThoughtsBrowse all →

Error Bars as Degrees of Belief Blurb · November, 2025

Reasoning about Neural Network Training with Bias-Variance Tradeoff Blurb · November, 2025

Information Theory and Logistic Regression Both Arrive at Cross-Entropy Blurb · October, 2025

Equivariance and Invariance Explain So Much of Deep Learning Blurb · October, 2025

Why Not Initialize Neural Network Weight to Zero? Blurb · September, 2025

Neural Networks, Strange Attractors, and Orderliness in Chaos Post · August, 2025

On Getting Started in Research Post · November, 2024

Mechanistically Programming a Language Model's Behavior Post · September, 2024