/ RELEASES

New AlphaFold DB release with 200+ million predicted structures

On the 22th of July of last year, DeepMind and EMBL released the most complete database of predicted 3D structures of human proteins1.

The initial release provided ~350,000 structure models including 20 biologically-significant organisms such as E.coli, fruit fly, mouse, zebrafish, malaria parasite and tuberculosis bacteria. The second release of the AlphaFold DB was composed by 992,316 structure models. That is >400,000 new protein structure predictions for most of the manually-curated UniProt entries in UniProtKB/SwissProt.

A year since the initial release, AlphaFold latest release2 provides an astronomical increase in the number of predicted 3D structures available. From less than a million to a whopping 214,684,311 structures. Deputy Director General and Joint Director of EMBL-EBI, Ewan Birney, said:

From today, the database has been expanded from one million protein structure predictions to more than 200 million, covering almost the whole of UniProt.

Today’s update includes structures for the widest possible range of species - including plants, bacteria, and additional animals and organisms. This expansion will help countless more scientists, including EMBL colleagues, in their work, and improve the quality and quantity of our research outputs.

This dataset release is available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.


1 DeepMind and EMBL release the most complete database of predicted 3D structures of human proteins

2 AlphaFold predicts structure of almost every catalogued protein known to science