Paper ReviewAI & Machine Learning

AlphaFold at Five Years: Boltz-2 and the Push Toward Binding Affinity

Five years after AlphaFold2 solved protein structure prediction, the field's frontier has shifted to biomolecular complexes and binding affinity — where AlphaFold3, Boltz-1 (open-source), and Boltz-2 represent successive steps toward the drug discovery application that structural biology always promised.

By ORAA Research

This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

When AlphaFold2 debuted at CASP14 in 2020, it effectively solved the single-chain protein structure prediction problem. Five years on, the research community has absorbed the implications and shifted attention to the next bottleneck: predicting not just what proteins look like, but how they interact with other molecules — and how tightly.

This progression matters because structure prediction, remarkable as it is, was always a waypoint. The application that pharma and biotech actually need is binding affinity prediction: given a drug candidate and a protein target, how strongly do they bind? That question remains only partially answered, but the trajectory from AlphaFold2 through AlphaFold3 to Boltz-1 and Boltz-2 shows a field converging on it.

The Research Landscape

AlphaFold3: From Single Chains to Complexes

Abramson et al. (2024) introduced AlphaFold3 with a substantially updated architecture based on diffusion rather than the structure module approach of AlphaFold2. The key capability extension: AF3 predicts complexes involving proteins, nucleic acids, small molecules, and ions — moving from single-chain folding to all-atom biomolecular structure prediction. On CASP16 benchmarks, AF3 demonstrated notable improvements in protein-ligand and protein-nucleic acid complex prediction compared to its predecessor.

However, Abriata and Dal Peraro (2025), analyzing practical outcomes from CASP16, note important nuances. Protein monomer and domain prediction is "largely solved, with barely any space for further improvements at the backbone level." But complex prediction — especially for protein-ligand interfaces — shows substantial room for improvement. The accuracy for ligand pose prediction, while better than previous methods, does not approach the accuracy achieved for protein backbone prediction.

Boltz-1: Open-Source Democratization

A significant development was the release of Boltz-1 (Wohlwend et al., 2024), an open-source model that achieves AlphaFold3-level accuracy on biomolecular complex structure prediction. Boltz-1 introduced innovations in model architecture and speed optimization while making the technology freely available — addressing a major community concern about AlphaFold3's restricted access.

Boltz-1's importance is as much political as technical. By providing open-source access to comparable capabilities, it enabled the research community to experiment with, extend, and build upon the technology in ways that proprietary access alone could not support.

Boltz-2: Binding Affinity as a First-Class Objective

Passaro, Corso, and Wohlwend (2025) introduce Boltz-2 with an explicit focus on the gap between structure prediction and binding affinity prediction. Their core argument: accurately predicting the 3D structure of a protein-ligand complex is necessary but not sufficient for predicting how tightly the drug binds — the thermodynamic quantity that determines pharmacological activity.

Boltz-2 extends the Boltz-1 architecture with modules designed to predict binding affinity alongside structure. Early results show improved correlation between predicted and experimental binding affinities compared to using structural prediction models alone. The paper has already accumulated substantial citations, reflecting the field's interest in this specific capability.

The Broader Affinity Prediction Landscape

Wang, Wu, and Wang (2024) provide context through a comprehensive review of structure-based deep learning models for protein-ligand binding affinity prediction. Their analysis identifies a critical gap: most existing methods achieve moderate correlation with experimental affinities on standard benchmarks (PDBbind) but struggle with generalization to novel protein families and out-of-distribution targets. The "benchmark success versus real-world failure" problem is a recurring theme.

Critical Analysis

Claim	Evidence	Verdict
AlphaFold3 extends accurate prediction to biomolecular complexes	CASP16 results show clear improvements over AF2 for complexes	✅ Supported — with the caveat that ligand pose accuracy lags behind backbone accuracy
Boltz-1 matches AlphaFold3 accuracy while being open-source	Benchmarks on CASP targets and PDB test sets show comparable performance	✅ Supported — an important democratization milestone
Boltz-2 advances binding affinity prediction	Early results show improved affinity correlation; community adoption is rapid	⚠️ Promising — independent benchmarking on diverse targets needed
Structure prediction is now sufficient for drug discovery	Predicted structures often lack the sub-angstrom accuracy needed for affinity estimation	❌ Not yet — structure prediction is necessary but not sufficient
The field is converging on a complete computational drug design pipeline	Each generation addresses a specific bottleneck (structure → complex → affinity)	⚠️ Directionally correct — but significant gaps remain (dynamics, entropy, solvation)

What Structure Prediction Cannot Yet Capture

Even perfect static structure prediction would not solve the binding affinity problem completely. Binding is a thermodynamic process involving:

Conformational dynamics: Proteins are not rigid. Binding often involves conformational changes (induced fit) that alter the binding interface. Current models predict a single (or few) static structures.

Entropic contributions: Binding entropy — the loss of rotational and translational freedom upon complex formation, the reorganization of solvent molecules — contributes substantially to binding free energy and is not captured by static structure prediction.

Water-mediated interactions: Bridging water molecules at the binding interface contribute to binding affinity in ways that are difficult to predict from structure alone.

Protonation states: The protonation state of amino acids at the binding interface affects electrostatic interactions and can change upon binding — a subtlety that structure prediction models do not explicitly model.

The CASP16 Reality Check

Abriata and Dal Peraro (2025) provide a measured assessment from CASP16. The competition confirmed that for protein monomers and domains, prediction is effectively solved. For protein-protein interfaces, accuracy has improved but remains below the threshold needed for confident binding energy estimation. For protein-ligand complexes, accuracy varies substantially by target class, with some predictions achieving excellent poses and others placing the ligand in fundamentally wrong orientations.

Open Questions

Dynamics integration: Can molecular dynamics simulations be efficiently coupled with deep learning structure prediction to capture conformational ensembles?

Prospective drug discovery validation: Most benchmarks are retrospective. Can Boltz-2 or similar models prospectively identify novel drug candidates that are validated experimentally?

Covalent and allosteric binding: Current models focus on orthosteric binding. Covalent inhibitors and allosteric modulators involve distinct mechanisms that may require specialized architectures.

Training data limitations: Experimental binding affinity data (PDBbind, BindingDB) is biased toward druggable targets and high-affinity binders. Models trained on these data may underperform on novel target classes.

Integration with medicinal chemistry: Structure prediction must interface with the practical constraints of drug design — synthetic accessibility, selectivity, pharmacokinetics — which are not captured by binding affinity alone.

Closing

The five-year trajectory from AlphaFold2 to Boltz-2 traces a clear research agenda: from single-chain structure to biomolecular complexes to binding affinity. Each step addresses a specific limitation of the previous generation, and each step moves closer to the drug discovery application that motivates the field. Boltz-2's explicit focus on binding affinity, combined with Boltz-1's open-source accessibility, represents the current frontier. But the gap between predicting a plausible structure and predicting a reliable binding affinity remains substantial, limited by dynamics, entropy, and solvation effects that static deep learning models do not yet capture.

면책 조항: 이 게시물은 정보 제공 목적의 연구 동향 개요이다. 학술 저작물에 인용하기 전에 구체적인 연구 결과, 통계 및 주장을 원본 논문과 대조하여 검증해야 한다.

AlphaFold 5년: Boltz-2와 결합 친화도를 향한 도전

AlphaFold2가 2020년 CASP14에서 처음 공개되었을 때, 단일 사슬 단백질 구조 예측 문제를 사실상 해결하였다. 5년이 지난 현재, 연구 커뮤니티는 그 함의를 받아들이고 다음 병목 지점으로 관심을 옮겼다. 바로 단백질의 구조가 어떠한지를 예측하는 것뿐만 아니라, 단백질이 다른 분자들과 어떻게 상호작용하는지, 그리고 얼마나 강하게 결합하는지를 예측하는 것이다.

이러한 발전 방향이 중요한 이유는, 아무리 놀라운 성과라 할지라도 구조 예측은 항상 하나의 경유지에 불과했기 때문이다. 제약 및 바이오테크 분야에서 실제로 필요로 하는 것은 결합 친화도 예측이다. 즉, 약물 후보물질과 단백질 표적이 주어졌을 때 얼마나 강하게 결합하는가에 대한 문제이다. 이 질문은 아직 부분적으로만 답이 주어진 상태이지만, AlphaFold2에서 AlphaFold3, Boltz-1, Boltz-2로 이어지는 흐름은 해당 문제로 수렴하고 있는 분야의 궤적을 보여 준다.

연구 현황

AlphaFold3: 단일 사슬에서 복합체로

Abramson 등(2024)은 AlphaFold2의 구조 모듈 방식 대신 확산(diffusion) 기반의 크게 업데이트된 아키텍처를 갖춘 AlphaFold3를 발표하였다. 핵심적인 기능 확장은 다음과 같다. AF3는 단백질, 핵산, 소분자 및 이온을 포함하는 복합체를 예측하며, 단일 사슬 폴딩에서 전원자(all-atom) 생체분자 구조 예측으로 나아갔다. CASP16 벤치마크에서 AF3는 이전 버전 대비 단백질-리간드 및 단백질-핵산 복합체 예측에서 주목할 만한 성능 향상을 보여 주었다.

그러나 CASP16의 실제 결과를 분석한 Abriata와 Dal Peraro(2025)는 중요한 맥락을 짚는다. 단백질 단량체 및 도메인 예측은 "주쇄(backbone) 수준에서 더 이상 개선의 여지가 거의 없을 만큼 대체로 해결되었다." 그러나 복합체 예측, 특히 단백질-리간드 계면에 대해서는 여전히 개선의 여지가 상당하다. 리간드 포즈(pose) 예측의 정확도는 이전 방법들보다 향상되었지만, 단백질 주쇄 예측에서 달성한 정확도 수준에는 미치지 못한다.

Boltz-1: 오픈소스를 통한 대중화

중요한 발전 중 하나는 생체분자 복합체 구조 예측에서 AlphaFold3 수준의 정확도를 달성한 오픈소스 모델인 Boltz-1(Wohlwend 등, 2024)의 공개이다. Boltz-1은 모델 아키텍처와 속도 최적화 부문에서 혁신을 도입하는 동시에 기술을 자유롭게 이용 가능하도록 공개함으로써, AlphaFold3의 제한된 접근성에 대한 커뮤니티의 주요 우려를 해소하였다.

Boltz-1의 중요성은 기술적인 측면만큼이나 정치적인 측면도 크다. 비교 가능한 수준의 기능에 대한 오픈소스 접근성을 제공함으로써, 연구 커뮤니티가 독점적 접근 방식만으로는 불가능했던 방식으로 이 기술을 실험하고 확장하며 그 위에서 새로운 연구를 수행할 수 있게 하였다.

Boltz-2: 결합 친화도를 일급 목표로

Passaro, Corso, Wohlwend(2025)는 구조 예측과 결합 친화도 예측 사이의 간극에 명시적으로 초점을 맞추어 Boltz-2를 발표하였다. 이들의 핵심 주장은 다음과 같다. 단백질-리간드 복합체의 3D 구조를 정확히 예측하는 것은 필요조건이지만, 약물이 얼마나 강하게 결합하는지, 즉 약리 활성을 결정하는 열역학적 양을 예측하는 데는 충분조건이 아니다.

Boltz-2는 구조와 함께 결합 친화도를 예측하도록 설계된 모듈을 통해 Boltz-1 아키텍처를 확장하였다. 초기 결과에 따르면, 구조 예측 모델만을 사용하는 것에 비해 예측된 결합 친화도와 실험적 결합 친화도 사이의 상관관계가 개선되었다. 이 논문은 이미 상당한 인용 수를 축적하였으며, 이는 해당 특정 기능에 대한 분야의 관심을 반영한다.

친화도 예측의 더 넓은 지형

Wang, Wu, Wang(2024)는 단백질-리간드 결합 친화도 예측을 위한 구조 기반 딥러닝 모델에 대한 포괄적인 리뷰를 통해 맥락을 제시한다. 그들의 분석은 핵심적인 격차를 식별한다: 대부분의 기존 방법들은 표준 벤치마크(PDBbind)에서 실험적 친화도와 보통 수준의 상관관계를 달성하지만, 새로운 단백질 패밀리 및 분포 외(out-of-distribution) 타깃에 대한 일반화에는 어려움을 겪는다. "벤치마크 성공 대 실제 세계 실패" 문제는 반복적으로 등장하는 주제이다.

비판적 분석

주장	근거	판정
AlphaFold3가 생체분자 복합체에 대한 정확한 예측으로 확장됨	CASP16 결과는 복합체에 대해 AF2 대비 명확한 개선을 보여줌	✅ 지지됨 — 리간드 포즈 정확도가 백본 정확도에 미치지 못한다는 단서 포함
Boltz-1이 오픈소스이면서 AlphaFold3 정확도에 필적함	CASP 타깃 및 PDB 테스트 세트 벤치마크에서 유사한 성능을 보여줌	✅ 지지됨 — 중요한 민주화 이정표
Boltz-2가 결합 친화도 예측을 발전시킴	초기 결과는 개선된 친화도 상관관계를 보여주며, 커뮤니티 채택이 빠름	⚠️ 유망함 — 다양한 타깃에 대한 독립적 벤치마킹 필요
구조 예측이 이제 신약 발굴에 충분함	예측된 구조는 종종 친화도 추정에 필요한 서브-옹스트롬 정확도가 부족함	❌ 아직 아님 — 구조 예측은 필요조건이지만 충분조건은 아님
이 분야가 완전한 계산 신약 설계 파이프라인으로 수렴하고 있음	각 세대는 특정 병목을 해결함 (구조 → 복합체 → 친화도)	⚠️ 방향적으로 올바름 — 그러나 상당한 격차가 남아 있음 (동역학, 엔트로피, 용매화)

구조 예측이 아직 포착하지 못하는 것

완벽한 정적 구조 예측조차도 결합 친화도 문제를 완전히 해결하지 못할 것이다. 결합은 다음을 포함하는 열역학적 과정이다:

입체 구조적 동역학: 단백질은 강체가 아니다. 결합은 종종 결합 계면을 변화시키는 입체 구조 변화(유도 적합)를 수반한다. 현재 모델들은 단일(또는 소수의) 정적 구조를 예측한다.

엔트로피 기여: 결합 엔트로피 — 복합체 형성 시 회전 및 병진 자유도의 손실, 용매 분자의 재배열 — 는 결합 자유 에너지에 실질적으로 기여하며, 정적 구조 예측으로는 포착되지 않는다.

물 매개 상호작용: 결합 계면의 가교 수분자는 구조만으로는 예측하기 어려운 방식으로 결합 친화도에 기여한다.

양성자화 상태: 결합 계면 아미노산의 양성자화 상태는 정전기적 상호작용에 영향을 미치며 결합 시 변화할 수 있다 — 구조 예측 모델이 명시적으로 모델링하지 않는 미묘한 부분이다.

CASP16 현실 점검

Abriata와 Dal Peraro(2025)는 CASP16에 대한 균형 잡힌 평가를 제공한다. 이 대회는 단백질 단량체 및 도메인에 대한 예측이 사실상 해결되었음을 확인했다. 단백질-단백질 계면의 경우, 정확도는 향상되었지만 신뢰할 수 있는 결합 에너지 추정에 필요한 임계값에는 미치지 못한다. 단백질-리간드 복합체의 경우, 정확도는 타깃 클래스에 따라 상당히 다르며, 일부 예측은 우수한 포즈를 달성하는 반면 다른 예측은 리간드를 근본적으로 잘못된 방향에 배치한다.

열린 질문들

동역학 통합: 분자 동역학 시뮬레이션이 딥러닝 구조 예측과 효율적으로 결합되어 입체 구조 앙상블을 포착할 수 있는가?

전향적 신약 발굴 검증: 대부분의 벤치마크는 후향적이다. Boltz-2 또는 유사한 모델이 실험적으로 검증된 신규 신약 후보를 전향적으로 식별할 수 있는가?

공유 결합 및 알로스테릭 결합: 현재 모델들은 정위(orthosteric) 결합에 초점을 맞춘다. 공유 결합 억제제와 알로스테릭 조절제는 특수화된 아키텍처가 필요할 수 있는 별개의 메커니즘을 포함한다.

훈련 데이터의 한계: 실험적 결합 친화도 데이터(PDBbind, BindingDB)는 druggable 표적과 고친화도 결합체에 편향되어 있다. 이러한 데이터로 훈련된 모델은 새로운 표적 클래스에서 성능이 저하될 수 있다.

의약화학과의 통합: 구조 예측은 합성 접근성, 선택성, 약동학 등 약물 설계의 실용적 제약 조건과 연계되어야 하는데, 이는 결합 친화도만으로는 포착되지 않는다.

마치며

AlphaFold2에서 Boltz-2까지의 5년간 궤적은 명확한 연구 의제를 보여준다: 단일 사슬 구조에서 생체분자 복합체로, 다시 결합 친화도로 나아가는 과정이다. 각 단계는 이전 세대의 특정 한계를 해결하며, 각 단계는 이 분야를 이끄는 신약 개발 응용에 한 걸음씩 가까워진다. 결합 친화도에 대한 Boltz-2의 명시적 집중과 Boltz-1의 오픈소스 접근성은 현재의 최전선을 대표한다. 그러나 그럴듯한 구조를 예측하는 것과 신뢰할 수 있는 결합 친화도를 예측하는 것 사이의 간극은 여전히 상당하며, 이는 정적 딥러닝 모델이 아직 포착하지 못하는 동역학, 엔트로피, 용매화 효과에 의해 제한된다.

References (5)

Abramson, J., Adler, J., & Dunger, J. et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature, 630, 493–500.

DOI Scholar

Wohlwend, J., Corso, G., & Passaro, S. et al. (2024). Boltz-1: Democratizing biomolecular interaction modeling. bioRxiv.

DOI Scholar

Passaro, S., Corso, G., & Wohlwend, J. et al. (2025). Boltz-2: Towards accurate and efficient binding affinity prediction. bioRxiv.

DOI Scholar

Abriata, L., & Dal Peraro, M. (2025). Practical outcomes from CASP16 for users in need of biomolecular structure prediction. Proteins: Structure, Function, and Bioinformatics.

DOI Scholar

Wang, D. D., Wu, W., & Wang, R. (2024). Structure-based, deep-learning models for protein-ligand binding affinity prediction. Journal of Cheminformatics, 16, Article 2.

DOI Scholar

AlphaFold at Five Years: Boltz-2 and the Push Toward Binding Affinity

The Research Landscape

AlphaFold3: From Single Chains to Complexes

Boltz-1: Open-Source Democratization

Boltz-2: Binding Affinity as a First-Class Objective

The Broader Affinity Prediction Landscape

Critical Analysis

What Structure Prediction Cannot Yet Capture

The CASP16 Reality Check

Open Questions

Closing

AlphaFold 5년: Boltz-2와 결합 친화도를 향한 도전

연구 현황

AlphaFold3: 단일 사슬에서 복합체로

Boltz-1: 오픈소스를 통한 대중화

Boltz-2: 결합 친화도를 일급 목표로

친화도 예측의 더 넓은 지형

비판적 분석

구조 예측이 아직 포착하지 못하는 것

CASP16 현실 점검

열린 질문들

마치며

References (5)

Explore this topic deeper