File:Annotation-Error-in-Public-Databases-Misannotation-of-Molecular-Function-in-Enzyme-Superfamilies-pcbi.1000605.s006.ogv
Size of this JPG preview of this OGG file: 606 × 600 pixels. Other resolutions: 243 × 240 pixels | 485 × 480 pixels | 748 × 740 pixels.
Original file (Ogg Theora video file, length 42 s, 748 × 740 pixels, 283 kbps, file size: 1.43 MB)
File information
Structured data
Captions
Summary edit
DescriptionAnnotation-Error-in-Public-Databases-Misannotation-of-Molecular-Function-in-Enzyme-Superfamilies-pcbi.1000605.s006.ogv |
English: Movie of the annotations from the NR database displayed by year (1993–2005). The movie tracks correctly annotated and misannotated sequences in the test set over the years 1993–2005. The similarity network is arranged by superfamily and colored as in figure 1 , i.e. all nodes of the same color were annotated to the same function. The network was generated from an all-by-all BLAST analysis of the test sequences with results that had BLAST E-value scores of 1×10−30 or lower retained. Nodes represent sequences deposited into the NR database during the years 1993–2005. Any two nodes are connected by an edge if at least one node found the other with a BLAST E-value less than or equal to 1×10−30. The network is visualized using Cytoscape v2.6.0-beta. The distance between any two connected nodes is roughly inversely proportional to the strength of the E-value between them (force-directed layout). The shapes of the nodes indicate annotation status: circles depict correctly annotated sequences and triangles depict incorrectly annotated sequences. Black arrows indicate examples in the haloacid dehalogenase family (HAD) and glyoxalase I family (VOC) that display potential evidence of error propagation. As these BLAST analyses were performed using a custom sequence database the resulting E-values are not necessarily comparable to the E-vaules determined by BLASTing against databases with large background models such as GenBank NR [60] . |
||
Date | |||
Source | Video S1 from Schnoes A, Brown S, Dodevski I, Babbitt P. "Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies". PLOS Computational Biology. DOI:10.1371/journal.pcbi.1000605. PMID 20011109. PMC: 2781113. | ||
Author | Schnoes A, Brown S, Dodevski I, Babbitt P | ||
Permission (Reusing this file) |
|
||
Provenance InfoField |
|
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 00:23, 4 May 2013 | 42 s, 748 × 740 (1.43 MB) | Pristurus (talk | contribs) | converted via Avisynth script: LoadVFAPIPlugin("QTReader.vfp", "QTReader") QTReader("pcbi.1000605.s006.mov") flipvertical() Crop(0, 6, -2, -4) ConvertToYUY2() ConvertFPS(60,zone=1) killaudio() | |
22:34, 3 May 2013 | 39 s, 752 × 752 (497 KB) | Pristurus (talk | contribs) | vlc transcoded | ||
20:44, 23 September 2012 | 42 s, 750 × 750 (361 KB) | Open Access Media Importer Bot (talk | contribs) | Uploaded with the Open Access Media Importer. (test edit) botrequest |
You cannot overwrite this file.
File usage on Commons
There are no pages that use this file.
Transcode status
Update transcode statusMetadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Software used |
---|