Research
Publications
-
- Date
- Title
- Demonstration-Regularized RL
- Authors
- Venue
-
- Date
- Title
- AtP*: Efficient and scalable methods for localizing LLM behaviour to components
- Authors
- Venue
-
- Date
- Title
- How aligned are different alignment metrics?
- Authors
- Venue
-
- Date
- Title
- Towards Practical Reinforcement Learning for Tokamak Magnetic Control
- Authors
- Venue
-
- Date
- Title
- Approximating the Core of Cooperative Games
- Authors
- Venue
-
- Date
- Title
- Bad Students Make Great Teachers: Active Learning Accelerates Large Scale Visual Understanding
- Authors
- Venue
-
- Date
- Title
- Self-supervised video pretraining yields strong image representations
- Authors
- Venue
-
- Date
- Title
- Set Learning for Accurate and Calibrated Models
- Authors
- Venue
-
- Date
- Title
- Intriguing Properties of Generative Classifers
- Authors
- Venue
-
- Date
- Title
- A density estimation perspective on learning from pairwise human preferences
- Authors
- Venue
-
- Date
- Title
- Frozen Feature Augmentation
- Authors
- Venue
-
- Date
- Title
- Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
- Authors
- Venue
-
- Date
- Title
- On Limitations of the Transformer Architecture
- Authors
- Venue
-
- Date
- Title
- AlphaTensor for Optimizing Quantum Computations
- Authors
- Venue
-
- Date
- Title
- Genie: Generative Interactive Environments
- Authors
- Venue
-
- Date
- Title
- When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
- Authors
- Venue
-
- Date
- Title
- OmniPred: Language Models as Universal Regressors
- Authors
- Venue
-
- Date
- Title
- The Next 700 ML-Enabled Compiler Optimizations
- Authors
- Venue
-
- Date
- Title
- Simulacra as Conscious Exotica
- Authors
- Venue
-
- Date
- Title
- Learning to Learn Faster from Human Feedback with Language Model Predictive Control
- Authors
- Venue