The LabelHash Server

Help

This server runs the LabelHash algorithm as described in our CSB 2008 paper. Given a point-based motif, it will compute all matches in either the full PDB or a non-redundant version of the PDB (with less than 95% sequence identity), both from November 11, 2008. The points in this case are the C-alpha coordinates of a number of residues. It is possible to specify alternate residue labels for each motif point. Computing matches in the full PDB is about 5 times slower than computing matches in the non-redundant PDB.

On the main page, fill out the form to specify a motif. If you click submit, your job will be submitted to a queue. You will receive an email message once your job starts. Once your job is finished, you will receive another email message with a file of matches attached. You can visualize the file with matches using our ViewMatch plugin for Chimera. You can also look at them as plain text using the command:
xmllint -format mymatches.xml | less
Xmllint is part of the libxml2 package and is often already installed on UNIX-like operating systems such as Linux and OS X. If you would like to run LabelHash on your own machines, download the command line version of LabelHash.

Sample Output

Below are some example motifs that were used to test the LabelHash algorithm. For each motif we have pre-computed the 100 best matches in the entire PDB from November 11, 2008. For each motif we list the EC class it represents, the PDB ID of the protein that was used to get the 3D coordinates of the motif, and the motif residues. The residues are specified by residue ID in the corresponding PDB file. For each residue the allowed residue types are denoted in superscript by their one-letter residue abbreviations. So if only one residue type is listed, no substitutions are allowed. The last column has a link to the corresponding file of matches. Right-click on the link to save these match files on your computer. These files can be opened with the ViewMatch plugin for Chimera. You can also look at them as plain text using your browser or an editor.

EC classPDB IDMotifMatches
1.14.13.39 1dww 194C, 346V, 366W, 367FY, 371E, 376DN 1dww-matches.xml
2.1.1.73 7mht 80P, 81C, 85ST, 119EL, 163R, 165R 7mht-matches.xml
2.1.1.77 1jg1 97DENQ, 99G, 101AGL, 160DNS, 179ILV, 183EGN 1jg1-matches.xml
2.1.1.79 1kpg 17D, 72G, 74G, 75W, 76G, 200F 1kpg-matches.xml
2.7.4.6 1ucn 12K, 13P, 92G, 105R, 115N, 118G 1ucn-matches.xml
3.1.3.1 1ani 51AD, 101DE, 102S, 166CRS, 331GH, 412HNQ 1ani-matches.xml
3.2.1.15 1czf 178N, 180D, 201D, 256HR, 258K, 291Y 1czf-matches.xml
3.4.24.27 8tln 120LMW, 143AE, 144ILV, 157LSY, 231HL 8tln-matches.xml
4.1.1.48 1lbf 51E, 56S, 57P, 89F, 91G, 112F, 159E, 180N, 211S, 233G 1lbf-matches.xml
4.1.1.49 1ayl 249L, 250S, 251G, 253G, 254K, 255T 1ayl-matches.xml
4.2.1.84 2ahj 53P, 120L, 127Y, 190V, 193D, 196I 2ahj-matches.xml
5.1.3.13 1ep0 53AST, 61AR, 64H, 73K, 90R, 172D 1ep0-matches.xml
5.3.1.5 1did 25F, 53H, 56D, 93F, 136W, 182K 1did-matches.xml
6.1.1.14 1ggm 188ET, 311LR, 239ET, 241E, 359ES, 361AS 1ggm-matches.xml
6.1.1.20 1b7y 149AGW, 178HQ, 180ST, 206DER, 218Q, 258FNY, 260FY 1b7y-matches.xml
6.1.1.21 1ady 81DE, 83T, 112RS, 130DE, 264LY, 311KNQR 1ady-matches.xml
6.3.4.5 1kp3 106R, 139F, 202ES, 286K, 288R, 331Y 1kp3-matches.xml

An XML match file is structured as follows: