Dataset updates for October 2024
The latest dataset releases are available for sequence searching in our Sequence Similarity Search bioinformatics applications and Dbfetch.
Key updates
During October 2024, we have released UniProKB version 2024_05. We have also released Ensembl Genomes version 60.
Overview of the current biological databases
The current dataset composition as of 31st October 2024 are as shown below. Dataset composition can be browsed in the JD Data Statistics page.
Name | Seq. Type | No. Datasets | No. Entries | Last Updated |
---|---|---|---|---|
AFDB | protein | 1 | 214,684,312 | 28/07/2022 12:12:03 |
CDP | nucleotide | 1 | 200,030 | 27/06/2022 16:42:58 |
ChEMBL | protein | 1 | 14,321 | 05/09/2024 14:34:46 |
EMVec | nucleotide | 1 | 7,561 | 17/07/2024 04:59:53 |
ENA | nucleotide | 115 | 78,584,200 | 18/07/2024 19:50:37 |
ENA cds | nucleotide | 88 | 394,866,750 | 19/07/2024 17:34:29 |
ENA expcon | nucleotide | 1 | 43,043 | 17/07/2024 04:49:43 |
ENA ncr | nucleotide | 64 | 49,305,221 | 28/10/2024 01:01:15 |
ENA rrna | nucleotide | 38 | 4,340,725 | 23/08/2023 00:37:29 |
Ens | mixed | 13,796 | 383,195,110 | 25/10/2024 08:31:58 |
EnsCovid | mixed | 4 | 26 | 31/12/2023 00:24:02 |
EnsGenomes | mixed | 167,321 | 367,524,186 | 17/10/2024 10:51:38 |
EPO | protein | 1 | 5,108,408 | 29/10/2024 00:34:41 |
HMMER3 | protein | 7 | 207,034 | 16/06/2022 10:42:07 |
IMGTHLAcds | nucleotide | 1 | 41,428 | 11/10/2024 00:25:10 |
IMGTHLAgen | nucleotide | 1 | 23,373 | 11/10/2024 00:26:29 |
IMGTHLApro | protein | 1 | 41,215 | 11/10/2024 00:24:50 |
IMGTLIGM | nucleotide | 1 | 246,842 | 22/07/2024 11:23:42 |
IntAct | protein | 1 | 124,556 | 19/09/2024 00:38:24 |
InterPro | protein | 1 | 46,035 | 04/10/2024 01:02:06 |
IPDKIRcds | nucleotide | 1 | 1,534 | 22/07/2024 11:21:13 |
IPDKIRgen | nucleotide | 1 | 880 | 22/07/2024 11:22:11 |
IPDKIRpro | protein | 1 | 1,387 | 22/07/2024 11:22:15 |
IPDMHCcds | nucleotide | 1 | 11,506 | 22/07/2024 11:20:11 |
IPDMHCgen | nucleotide | 1 | 3,008 | 22/07/2024 11:21:43 |
IPDMHCpro | protein | 1 | 11,506 | 22/07/2024 11:22:42 |
IPDNHKIRcds | nucleotide | 1 | 1,072 | 22/07/2024 11:22:11 |
IPDNHKIRgen | nucleotide | 1 | 13 | 22/07/2024 11:22:11 |
IPDNHKIRpro | protein | 1 | 1,072 | 22/07/2024 11:23:12 |
IPRMC | protein | 1 | 248,906,029 | 04/10/2024 11:13:30 |
IPRMC_UNIPARC | protein | 1 | 1 | 10/10/2024 09:49:39 |
JPO | protein | 1 | 6,565,074 | 27/09/2024 00:52:22 |
KIPO | protein | 1 | 2,087,869 | 27/09/2024 00:30:16 |
MP | protein | 1 | 1,228,767 | 22/07/2024 11:18:39 |
MPEP | protein | 1 | 1,228,278 | 22/07/2024 11:18:07 |
MPRO | protein | 1 | 5,098 | 22/07/2024 11:17:40 |
PANTHER | protein | 1 | 123,151 | 10/11/2023 01:10:31 |
Patent Equivalents | protein | 1 | 119,710 | 03/04/2023 14:31:12 |
PDB | protein | 1 | 820,157 | 31/10/2024 00:35:26 |
PDBaa | protein | 1 | 820,157 | 31/10/2024 00:33:56 |
PDBna | nucleotide | 1 | 52,124 | 31/10/2024 00:32:46 |
Pfam | protein | 1 | 21,979 | 26/06/2024 01:22:33 |
Rfam | nucleotide | 1 | 4,178 | 19/09/2024 01:03:21 |
TAXONOMY | other | 1 | 1 | 31/10/2024 00:35:48 |
TreeFam | protein | 1 | 15,736 | 10/11/2023 00:48:42 |
UniParc | protein | 401 | 1,666,992,840 | 02/10/2024 18:22:15 |
UniProtKB | protein | 3 | 14,695,808 | 02/10/2024 18:21:49 |
UniProtKB Divisions | protein | 18 | 141,616,350 | 02/10/2024 18:22:03 |
UniRef | protein | 3 | 36,000,000 | 02/10/2024 18:22:11 |
UniVec | nucleotide | 1 | 6,111 | 24/08/2024 00:16:42 |
USPTO | protein | 1 | 9,733,759 | 12/07/2024 01:39:53 |
WormBase | mixed | 1,120 | 22,358,127 | 24/04/2024 15:23:51 |