This project will develop novel methods for removing sensitive information from genomic and protein language models without having to retrain them from scratch. The methods that will be developed here, will focus on ESM3 and EVO 2 but they will be transferable across genomic and protein language models. We will study how to identify and remove targeted biological knowledge from these models while keeping their broader capabilities intact, by measuring performance across different benchmarks.
The work will bring together three researchers with expertise in AI, genomics and biosafety, Dr. Georgakopoulos-Soares, Mr. Aris Karatzikos and Mr. Kimon Provatas. The team will design unlearning algorithms, test them on genomic and protein language models, and evaluate whether the models can successfully "forget" viral and bacterial virulence, transmissibility and toxicity capabilities without losing performance on useful downstream tasks.
The findings will include open-source code for unlearning across biological data, and a set of case studies showing how unlearning can improve the safety of different genomic and protein language models.