This server runs the LabelHash algorithm as described in our CSB 2008 paper. Given a point-based motif, it will compute all matches in either the full PDB or a non-redundant version of the PDB (with less than 95% sequence identity), both from November 11, 2008. The points in this case are the C-alpha coordinates of a number of residues. It is possible to specify alternate residue labels for each motif point. Computing matches in the full PDB is about 5 times slower than computing matches in the non-redundant PDB.
On the main page, fill out the form to specify a motif. If you click submit, your job will be submitted to a queue. You will receive an email message once your job starts. Once your job is finished, you will receive another email message with a file of matches attached. You can visualize the file with matches using our ViewMatch plugin for Chimera. You can also look at them as plain text using the command: xmllint -format mymatches.xml | less
Xmllint is part of the libxml2 package and is often already installed on UNIX-like operating systems such as Linux and OS X. If you would like to run LabelHash on your own machines, download the command line version of LabelHash.
Below are some example motifs that were used to test the LabelHash algorithm. For each motif we have pre-computed the 100 best matches in the entire PDB from November 11, 2008. For each motif we list the EC class it represents, the PDB ID of the protein that was used to get the 3D coordinates of the motif, and the motif residues. The residues are specified by residue ID in the corresponding PDB file. For each residue the allowed residue types are denoted in superscript by their one-letter residue abbreviations. So if only one residue type is listed, no substitutions are allowed. The last column has a link to the corresponding file of matches. Right-click on the link to save these match files on your computer. These files can be opened with the ViewMatch plugin for Chimera. You can also look at them as plain text using your browser or an editor.
| EC class | PDB ID | Motif | Matches |
|---|---|---|---|
| 1.14.13.39 | 1dww | 194C, 346V, 366W, 367FY, 371E, 376DN | 1dww-matches.xml |
| 2.1.1.73 | 7mht | 80P, 81C, 85ST, 119EL, 163R, 165R | 7mht-matches.xml |
| 2.1.1.77 | 1jg1 | 97DENQ, 99G, 101AGL, 160DNS, 179ILV, 183EGN | 1jg1-matches.xml |
| 2.1.1.79 | 1kpg | 17D, 72G, 74G, 75W, 76G, 200F | 1kpg-matches.xml |
| 2.7.4.6 | 1ucn | 12K, 13P, 92G, 105R, 115N, 118G | 1ucn-matches.xml |
| 3.1.3.1 | 1ani | 51AD, 101DE, 102S, 166CRS, 331GH, 412HNQ | 1ani-matches.xml |
| 3.2.1.15 | 1czf | 178N, 180D, 201D, 256HR, 258K, 291Y | 1czf-matches.xml |
| 3.4.24.27 | 8tln | 120LMW, 143AE, 144ILV, 157LSY, 231HL | 8tln-matches.xml |
| 4.1.1.48 | 1lbf | 51E, 56S, 57P, 89F, 91G, 112F, 159E, 180N, 211S, 233G | 1lbf-matches.xml |
| 4.1.1.49 | 1ayl | 249L, 250S, 251G, 253G, 254K, 255T | 1ayl-matches.xml |
| 4.2.1.84 | 2ahj | 53P, 120L, 127Y, 190V, 193D, 196I | 2ahj-matches.xml |
| 5.1.3.13 | 1ep0 | 53AST, 61AR, 64H, 73K, 90R, 172D | 1ep0-matches.xml |
| 5.3.1.5 | 1did | 25F, 53H, 56D, 93F, 136W, 182K | 1did-matches.xml |
| 6.1.1.14 | 1ggm | 188ET, 311LR, 239ET, 241E, 359ES, 361AS | 1ggm-matches.xml |
| 6.1.1.20 | 1b7y | 149AGW, 178HQ, 180ST, 206DER, 218Q, 258FNY, 260FY | 1b7y-matches.xml |
| 6.1.1.21 | 1ady | 81DE, 83T, 112RS, 130DE, 264LY, 311KNQR | 1ady-matches.xml |
| 6.3.4.5 | 1kp3 | 106R, 139F, 202ES, 286K, 288R, 331Y | 1kp3-matches.xml |
An XML match file is structured as follows: