Reader Comments

Post a new comment on this article

Number of sequences vs. diversity of sequences

Posted by Becky on 17 Dec 2011 at 01:40 GMT

Like aarvey, I think the plot of number of sequences vs. time is interesting. But is number the right metric? Obtaining more sequences that are very similar doesn't help the analysis. Is there a measure of how the diversity of sequences obtained (w.r.t. this analysis) changes over time? I'd be fascinated to know how many covarying residues you need to predict a novel fold.

No competing interests declared.

RE: Number of sequences vs. diversity of sequences

sbliven replied to Becky on 10 Jan 2012 at 19:10 GMT

They do mention that only 12-40% of the sequences are effectively used after downweighting pairs with 70% identity.

No competing interests declared.