simap

Dec. 29th, 2013 04:01 pm
paroh: (Default)
[personal profile] paroh
"2013, Dec 14: Removal of metagenomes and re-calculation of SIMAP
We going to migrate the entire SIMAP database in 2014 to a new algorithm for sequence similarity calculation. We will not longer use FASTA heuristics but switch to the exact Smith-Waterman algorithm. The scoring by BLOSUM50 will be continued, but will incorporate composition-based score adjustment, such as in BLAST. During the migration we will maintain the current, FASTA based SIMAP, thus keeping SIMAP up-to-date and online. However, in order to minimize the computational and storage requirements, we will temporary remove those metagenomes from SIMAP that were imported from IMG/M, Camera, HMP and ENA/WGS. Only environmental sequences in ENA (from Uniprot/TrEMBL) will remain. Metagenomes will be integrated again after the migration of the non-metagenomic SIMAP to the new algorithm has been finished."

2013-08-22 было 159,791,039 протеинов.
2013-12-15 - 135,381,744.

Я конечно не специалист, но видимо, 24 миллиона юнитов только временно убраны из базы, чтобы было проще считать.

January 2026

S M T W T F S
    123
45678910
111213141516 17
18192021222324
25262728293031

Style Credit

Expand Cut Tags

No cut tags
Page generated Jan. 22nd, 2026 06:28 am
Powered by Dreamwidth Studios