simap

Dec. 29th, 2013 04:01 pm
paroh: (Default)
[personal profile] paroh
"2013, Dec 14: Removal of metagenomes and re-calculation of SIMAP
We going to migrate the entire SIMAP database in 2014 to a new algorithm for sequence similarity calculation. We will not longer use FASTA heuristics but switch to the exact Smith-Waterman algorithm. The scoring by BLOSUM50 will be continued, but will incorporate composition-based score adjustment, such as in BLAST. During the migration we will maintain the current, FASTA based SIMAP, thus keeping SIMAP up-to-date and online. However, in order to minimize the computational and storage requirements, we will temporary remove those metagenomes from SIMAP that were imported from IMG/M, Camera, HMP and ENA/WGS. Only environmental sequences in ENA (from Uniprot/TrEMBL) will remain. Metagenomes will be integrated again after the migration of the non-metagenomic SIMAP to the new algorithm has been finished."

2013-08-22 было 159,791,039 протеинов.
2013-12-15 - 135,381,744.

Я конечно не специалист, но видимо, 24 миллиона юнитов только временно убраны из базы, чтобы было проще считать.
This account has disabled anonymous posting.
If you don't have an account you can create one now.
HTML doesn't work in the subject.
More info about formatting

January 2026

S M T W T F S
    123
45678910
111213141516 17
18192021222324
25262728293031

Style Credit

Expand Cut Tags

No cut tags
Page generated Jan. 22nd, 2026 01:45 pm
Powered by Dreamwidth Studios