The 2024 Nobel Prize in Chemistry recognized Dr. David Baker for his pioneering work in computational protein design, along with Drs. Demis Hassabis and John Jumper for their contributions to protein structure prediction. Dr. Baker, a professor at the University of Washington and a Cancer Consortium member, has spent decades developing computational algorithms that allow scientists to design novel proteins—proteins not found in nature. His work focuses on how proteins acquire their 3D shapes (and, by extension, their functions). This ability to design customized proteins ‘from scratch’ unlocks the potential to create highly targeted pharmaceuticals, vaccines, nanomaterials, and more.
To design novel proteins, Baker developed the Rosetta software starting in 1998. This tool assembles fragments from diverse protein structures with similar local sequences in the Protein Data Bank, simultaneously optimizing the protein’s sequence and structure. Without Rosetta, determining a protein’s structure could take years, but with this computational approach, researchers can achieve results far faster and with remarkable precision. It is worth noting that when Rosetta was first developed, its accuracy was fairly poor -it is only recently that these algorithms were accurate enough to begin taking the place of experimental structure determination.
Now, let’s look at the real-world applications of Baker’s work, starting with Endocytosis-targeting proteins, or EndoTags. These proteins are engineered to enhance endocytosis, a cellular process where cells internalize extracellular molecules, which are then broken down via the lysosome pathway. EndoTags are designed to bind receptor sites without interfering with the receptor’s natural ligands, “hijacking” the cell’s internal pathways to promote degradation of targeted proteins. The results were recently published in Nature.
To test EndoTags, the team explored endocytic receptors with different tissue expression and downstream mechanisms. These targets included (1) Insulin-like Growth Factor 2 Receptor (IGF2R), which is found in most tissues and triggers endocytosis through receptor dimerization, (2) Asialoglycoprotein Receptor (ASGPR), which is primarily expressed in the liver and requires clustering for endocytosis, (3) Transferrin Receptor (TfR), which is expressed in the brain, liver, and muscles, and cycles between the cell surface and internal compartments, and (4) Sortilin, which is found in the central nervous system and similarly cycles through the cell’s internal structures.