Monday, December 28, 2009

Protein sequences are still badly annotated in major databases

A new paper in PLoS Comp Bio reminds us that sequences are still often misannotated!

Schnoes AM, Brown SD, Dodevski I, Babbitt PC, 2009 Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies. PLoS Comput Biol 5(12): e1000605. doi:10.1371/journal.pcbi.1000605

http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000605
I have expected the problem to go away with more data available and more experience among database managers, but this does not seem to be the case. SwissProt annotators can pride themselves with few errors however.

No comments: