New AlphaFold DB release with 200+ million predicted structures
On the 22th of July of last year, DeepMind and EMBL released the most complete database of predicted 3D structures of human proteins1.
Today with @emblebi, we're launching the #AlphaFold Protein Structure Database, which offers the most complete and accurate picture of the human proteome, doubling humanity’s accumulated knowledge of high-accuracy human protein structures - for free: https://t.co/vtBGmTkKhy 1/ pic.twitter.com/XgBQTn2fuC
— DeepMind (@DeepMind) July 22, 2021
EMBL and @DeepMind have partnered – a breakthrough for science.
— EMBL (@embl) July 22, 2021
Together, we're providing a treasure trove of protein structure predictions powered by #AlphaFold to herald a new era for #AI-enabled biology. https://t.co/QSM55eB2Fk pic.twitter.com/5FnHQjzusb
The initial release provided ~350,000 structure models including 20 biologically-significant organisms such as E.coli, fruit fly, mouse, zebrafish, malaria parasite and tuberculosis bacteria. The second release of the AlphaFold DB was composed by 992,316 structure models. That is >400,000 new protein structure predictions for most of the manually-curated UniProt entries in UniProtKB/SwissProt.
🎉New #AlphaFold data! With @DeepMind, we’ve more than doubled the size of the database & added predictions for most of the manually-curated @uniprot entries in UniProtKB/SwissProt.
— EMBL-EBI (@emblebi) December 9, 2021
That's >400,000 new protein structure predictions for you to explore!https://t.co/Mvq4cC2ilI pic.twitter.com/cV7dGf7Z8u
A year since the initial release, AlphaFold latest release2 provides an astronomical increase in the number of predicted 3D structures available. From less than a million to a whopping 214,684,311 structures. Deputy Director General and Joint Director of EMBL-EBI, Ewan Birney, said:
From today, the database has been expanded from one million protein structure predictions to more than 200 million, covering almost the whole of UniProt.
Today’s update includes structures for the widest possible range of species - including plants, bacteria, and additional animals and organisms. This expansion will help countless more scientists, including EMBL colleagues, in their work, and improve the quality and quantity of our research outputs.
This dataset release is available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
1 DeepMind and EMBL release the most complete database of predicted 3D structures of human proteins
2 AlphaFold predicts structure of almost every catalogued protein known to science