simap

Dec. 29th, 2013 04:01 pm
paroh: (Default)
[personal profile] paroh
"2013, Dec 14: Removal of metagenomes and re-calculation of SIMAP
We going to migrate the entire SIMAP database in 2014 to a new algorithm for sequence similarity calculation. We will not longer use FASTA heuristics but switch to the exact Smith-Waterman algorithm. The scoring by BLOSUM50 will be continued, but will incorporate composition-based score adjustment, such as in BLAST. During the migration we will maintain the current, FASTA based SIMAP, thus keeping SIMAP up-to-date and online. However, in order to minimize the computational and storage requirements, we will temporary remove those metagenomes from SIMAP that were imported from IMG/M, Camera, HMP and ENA/WGS. Only environmental sequences in ENA (from Uniprot/TrEMBL) will remain. Metagenomes will be integrated again after the migration of the non-metagenomic SIMAP to the new algorithm has been finished."

2013-08-22 было 159,791,039 протеинов.
2013-12-15 - 135,381,744.

Я конечно не специалист, но видимо, 24 миллиона юнитов только временно убраны из базы, чтобы было проще считать.

May 2026

S M T W T F S
      12
3 4 5 678 9
101112 13141516
17181920212223
24252627282930
31      

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated May. 26th, 2026 03:46 am
Powered by Dreamwidth Studios