simap

Dec. 29th, 2013 04:01 pm
paroh: (Default)
[personal profile] paroh
"2013, Dec 14: Removal of metagenomes and re-calculation of SIMAP
We going to migrate the entire SIMAP database in 2014 to a new algorithm for sequence similarity calculation. We will not longer use FASTA heuristics but switch to the exact Smith-Waterman algorithm. The scoring by BLOSUM50 will be continued, but will incorporate composition-based score adjustment, such as in BLAST. During the migration we will maintain the current, FASTA based SIMAP, thus keeping SIMAP up-to-date and online. However, in order to minimize the computational and storage requirements, we will temporary remove those metagenomes from SIMAP that were imported from IMG/M, Camera, HMP and ENA/WGS. Only environmental sequences in ENA (from Uniprot/TrEMBL) will remain. Metagenomes will be integrated again after the migration of the non-metagenomic SIMAP to the new algorithm has been finished."

2013-08-22 было 159,791,039 протеинов.
2013-12-15 - 135,381,744.

Я конечно не специалист, но видимо, 24 миллиона юнитов только временно убраны из базы, чтобы было проще считать.

March 2026

S M T W T F S
1234 5 67
89 10 1112 13 14
15161718 192021
22232425262728
293031    

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Mar. 23rd, 2026 05:03 pm
Powered by Dreamwidth Studios