SPratt2 documentation
Inge Jonassen, Dept. of Informatics, University of Bergen, Norway
Installation
You receive a tar file containing the executable program(s) and this documentation.
In order to use the program, you need to unpack the tar file by writing for example
$ tar xvf Spratt22.tar
The following files will be produced:
spratt2
postp
help.html
The first two are executable programs, the third is this file.
You also need the dssp program dsspcmbi that can be obtained separately.
Running SPratt2
SPratt2 is run from the command line. The format of the command line is as follows:
spratt2 <filename> <maxdist> <minangle> <min_support> <max_rmsd>
where filename is the name of a file containing PDB file names
maxdist is in angstrom max dist between
two neighbours
minangle is the minimum cosine value
min_support is the minimum number of structures to match a pattern
max_rmsd is the maximum rms deviation between a pattern and a match
Additional parameters can be given.
- List of structures:
- SPratt2 takes as input a file containing a list of filenames/PDB identities. It is
assumed that the structures to be analysed are present in the working directory each
with a filename identical to its PDB id. PDB ids are by convention four characters
long. If only one chain from a PDB entry is to be read, the identity of the chain
should be given as a fifth letter, e.g., 1flmA identifies the file 1flm, chain A.
An example of an input file to SPratt2:
1hud
1gcb
1fieA
1aim
1ppn
SPratt2 also reads a dssp output file for each structure. In the example, SPratt2
will look for 1hud.dssp, 1gcb.dssp, 1fie.dssp, 1iam.dssp, 1ppn.dssp.
This can be produced by dsspcmbi that must be obtained separately and run prior to SPratt2.
In order to avoid the time consuming parsing of PDB and dssp files each time the program is
run, SPratt2 will produce binary files containing the necassary information about each
structure. In the example, the first time SPratt2 is run, it will read the pdb and dssp files
and produce binary files 1hud.bin, 1gcb.bin, 1fieA.bin, ... The next time the same structures
are analysed, the binary files will be read.
- maxdist
- This is the maximum distance (in angstroms) between two residues for
them to be considered as neighbours.
Typical values are 8, 10, 12.
- minangle
- To reduce the lengths of neighbour strings and still keep the residues most
likely to be interacting, i.e., residues facing eachother. Residues i and j are
facing each other if angle i and angle j both have cosine values larger than or
equal to minangle.
Angle i is the angle between the vector from c alpha i to
c alpha j and the vector from c alpha i to mean side chain atom i.
Angle j is the angle between the vector from c alpha j to
c alpha i and the vector from c alpha j to mean side chain atom j.
If minangle=-1, all residues within maxdist from residue i will be included in
residue i's neighbour string. If minangle=0, all residues within distance maxdist
that are facing roughly towards i (and i towards them) will be included in residue
i's neighbour string.
- minsupport
- This is the minimum number of structures to match a structure pattern.
- max_rmsd
- This is the maximum root mean square deviation to be accepted when superposing
a possible pattern match onto the pattern.
Running postp
The postp program takes as input an output file from SPratt2 and makes a summary
output.
postp is run from the command line. The format of the command line is as follows:
postp <pattern file> <pdbids file> <scop file> [options]
where pattern file is a spratt2 output file
pdbids file is a file with the IDs of the structures analysed by spratt2
scop file is a file with scop IDs for the pdbids
subset options: -nr - patterns ranked by support
-sc - patterns ranked by score
-cover - gives subset of patterns covering structure set
-all - include all patterns
output format: -html - produce html output
-latex - produce latex output (not yet supported)
Example
First running SPratt2:
spratt2 t 10 -1 3 1 > t.out
This produces output file t.out, then run the post-processing program:
postp t.out t dir.dom.scop.txt_1.50 -cover -sc -html > t.html
br>which produces the output file t.html which can be viewed using
any web browser, e.g., Netscape or Internet Explorer.