Predicting from Strings: Language Model Embeddings for Bayesian Optimization

Published: 14 October 2024

Abstract

Bayesian Optimization is ubiquitous in the field of experimental design and blackbox optimization for improving search efficiency, but has been traditionally restricted to regression models which are only applicable to fixed search spaces and tabular input features. We propose Embed-then-Regress, a paradigm for applying in-context regression over string inputs, through the use of string embedding capabilities of pretrained language models. By expressing all inputs as strings, we are able to perform general-purpose regression for Bayesian Optimization over various domains including synthetic, combinatorial, and hyperparameter optimization, obtaining comparable results to state-of-the-art Gaussian Process-based algorithms.

Authors

Tung Nguyen, Qiuyi Zhang, Bangding Yang, Chansoo Lee, Jorg Bornschein, Yingjie Miao, Sagi Perel, Yutian Chen, Xingyou Song

Venue

arXiv

Gemini

Gemma

Generative models

Gemini model ecosystem

Projects

Publications

News

AI for biology

AI for climate and sustainability

AI for mathematics and computer science

AI for physics and chemistry

AI transparency

News

Careers

Milestones

Education

Responsibility

The Podcast

Predicting from Strings: Language Model Embeddings for Bayesian Optimization

Abstract

Authors

Venue

Predicting from Strings: Language Model Embeddings for Bayesian Optimization

Share

Abstract

Authors

Venue