This is the manual for the program suite DAMAVER, a set of programs to align
ab initio models, select the most typical one and build an averaged model.
The following sections briefly describe the different programs that are part
of DAMAVER, how to run them and what the input and output files are.
If you use results from DAMAVER in your own publication, please cite:
DAMAVER is a set of programs to align ab initio low resolution models
(e.g. provided by DAMMIN and/or
GASBOR), select the most typical ("probable")
one and build an averaged model.The program package requires
SUPCOMB. In the following DAMAVER means the
program suite, whereas damaver refers to the actual program that is part of it.
This suite contains the following programs:
damsel: compare all models, find most
probable one and outliers (uses SUPCOMB)
damsup: align all models with the most probable one
(uses SUPCOMB)
damaver: average aligned models and compute
probability map
damfilt: filter the averaged model at a given
cut-off volume
damstart: generates from the averaged model
an input file with fixed core for DAMMIN
(for those who want to refine the averaged model)
Note that the most typical usage of this suite is to "let DAMAVER do all the work". Thus,
in most cases you only need DAMAVER and therefore one can go to the DAMAVER manual straight
away from here. Alternatively, you can use the different programs separately and the order
of the programs as listed above reflects the typical order
in which they are normally run in order to obtain an averaged model.
The examples shown are all based on the same models and therefore
essentially describe a full session of DAMAVER. Please refer to the paper
cited above for further details about the implemented algorithm.
Please note that besides DAMAVER there is also DAMCLUST
that allows for clustering of models instead of simply rejecting outliers from the
largest group.
DAMSEL compares sets of models, finds the most probable one and defines outliers
by using the NSD values computed by SUPCOMB.
Given several structures in PDB format, the program superimposes all possible
pairs by applying SUPCOMB. The output file
provides a cross correlation table according to
NSD and the list of input files with the respective recommondations of inclusion
or exclusion for each file. See the description of the output
file for details.
NOTE: For DAMMIN or GASBOR averaging, 10-20 models are recommended. Models whose
NSD exceeds 2 standard deviations from the mean are considered outliers.
Here, the FILE(s) are either a list of PDBs or a single file listing the file names
to read (see the description of the input files for details).
OPTIONS for DAMSEL are described in the following section.
Specify the symmetry to consider. Any P-n-m symmetry known by other programs
is supported (Pn, n=1, ..., 19 and Pn2, n=2, ..., 12). By default, no symmetry
is assumed (P1).
Enable/disable the search of enantiomorphs, i.e. either one of a pair of molecules that are mirror images of each other but are not identical. By default this is enabled.
As described in the introduction,
SUPCOMB is called for all pairs of models. Consequently,
the output of DAMSEL corresponds to N*(N-1)/2 times the output of SUPCOMB. Please
refer to the runtime output of SUPCOMB for details.
DAMSEL output files are built up in sections. After the header, a symmetric
cross correlation table which lists the NSD values of each model to each other.
For example:
Further, the mean value of the differences between the files and the respective standard
deviation are listed. The values are used to determine which files to include and exclude:
It is recommend to discard files with nsd > Mean + 2*Standard deviation
Recommendation NSD File
Include 1.131 t02.pdb
Include 1.133 t03.pdb
Include 1.148 t01.pdb
Include 1.175 t04.pdb
The first file listed shall be used as a reference frame in DAMSUP.
Here, FILE shall be a log file as written by DAMSEL. If no file
name is specfied, DAMSUP assumes damsel.log as input.
$ damsup [OPTIONS] <FILE> <FILE(s)>
Alternatively, a list of PDB files may be provided. The first FILE is the reference model.
The second to the N-th FILE(s) will be superimposed onto the reference.
Specify the symmetry to consider. Any P-n-m symmetry known by other programs
is supported (Pn, n=1, ..., 19 and Pn2, n=2, ..., 12). By default, no symmetry
is assumed (P1).
Enable/disable the search of enantiomorphs, i.e. either one of a pair of molecules that are mirror images of each other but are not identical. By default this is enabled.
As described in the introduction, SUPCOMB is called for all
superpositions of input files onto the reference. Consequently, the output of DAMSUP
corresponds to N-1 times the output of SUPCOMB. Please refer to the runtime output of
SUPCOMB for details.
DAMSUP either reads an output log of DAMSEL, or input models directly. If a DAMSEL
log file is given as input, the first included file is assumed to be the reference, all
other included files are superimposed. Any files marked as discarded by DAMSEL are ignored.
Each superimposed model is written to a file that gets r appended
to its basename, i.e.
$ damsup foo.pdb bar1.pdb bar2.pdb
will write bar1r.pdb and bar2r.pdb.
Further, DAMSUP also provides a log file which provides a list of the superimposed
PDB files plus their respective NSD compared to the reference. For example:
DAMAVER averages models aligned by SUPCOMB or
DAMSUP and computes a probability map. More specifically, given
a list of models in PDB format which are aligned to yield the best overlap, DAMAVER
remaps those onto a grid of densely packed beads in order to compute a frequency map.
For each bead, the cross volume with proximal dummy atoms in the input models (occupancy)
is computed and saved into the output file. This output
file can be processed further by DAMFILT for a filtered model
which does generally not fit the data, and DAMSTART which
creates an input model suitable for DAMMIN which, after
refinement, would fit the data.
Here, FILE shall be a log file as written by DAMSUP. If no file name
is specfied, damsup.log is assumed.
$ damaver [OPTIONS] <FILE> <FILE(S)>
Alternatively, a list of PDB files may be provided for averaging.
The models are assumed to be aligned. If also the --automatic flag is used,
models are selected and superimposed first.
Specify the symmetry to consider. Any P-n-m symmetry known by other programs
is supported (Pn, n=1, ..., 19 and Pn2, n=2, ..., 12). By default, no symmetry
is assumed (P1).
Enable/disable the search of enantiomorphs, i.e. either one of a pair of molecules that are mirror images of each other but are not identical. By default this is enabled.
--nbeads=<N>
Overall number of beads within the resulting DAM (default: 5000)
In the case of automatic mode, i.e., when all of the different programs
of the suite are called, please refer to their respective runtime output
sections.
DAMAVER either reads an output log of DAMSUP, or input models directly.
If a DAMSUP log file is given as input, the listed filenames are read and averaged.
If no file name is given, damsup.log is read by default.
If model file names are specified at the command line, they are used as given.
DAMFILT filters an averaged model created by DAMAVER at a
given cut-off volume. Given the frequency map computed by DAMAVER and
the value of a cut-off volume, DAMFILT removes low occupancy and loosely connected atoms
and writes a compact - most probable - model to the output file.
The output volume is selected to be close to the cut-off volume (expected
excluded volume of the particle). This value is either read from the
input file or specified on the command line.
Please note: the filtered model does generally not fit the data.
Here, FILE is an input PDB file of the model to be filtered. If no file name is specified,
damaver.pdb is assumed. The default cut-off value for filtering is taken from the input file
if available, half of the volume of the input model otherwise. A user-defined cut-off can be
specified at the command-line has precedende.
When running damfilt, the program
prints on the screen information about the number of atoms read, the
center of the dummy atom model, maximum radius, number of phases, atomic
radius, excluded volume per atom, average excluded volume read, minimum
and maximum number of contacts, number of atoms written and final excluded
volume. For an example, see damfilt examples.
$ damfilt taver.pdb -o tfilt.pdb
Read file .............................................. : taver.pdb
Number of atoms ........................................ : 925
Number of phases ....................................... : 1
Minimum number of contacts ............................. : 2
Maximum number of contacts ............................. : 12
Selected contact threshold ............................. : 4
Atomic radius .......................................... : 2.750
Excluded volume per atom ............................... : 117.7
Maximum radius ......................................... : 64.07
Average excluded volume ................................ : 0.0
Selected cut-off volume ................................ : 5.445e+4
Final contact threshold ................................ : 4
Final cut-off volume ................................... : 5.445e+4
Final number of atoms .................................. : 463
Final volume ........................................... : 5.451e+4
Wrote file ............................................. : tfilt.pdb
Given the frequency map computed by
DAMAVER and the value of a cut-off volume,
DAMSTART generates a modification of the DAMAVER model with fixed core for further use in
DAMMIN as an initial approximation.
The core indices of high occupancy atoms with fair number of contacts are
set to 1 so that their phases will not change in a DAMMIN run. DAMAVER
writes this model to the file damstart.pdb by default. The core volume is selected to
be close to the cut-off volume (one half of expected excluded
volume of the particle). This value is either read from the input file
(default file name is damaver.pdb, see damaver)
or specified by a command line option. If the information on the expected particle
volume is missing in the input pdb file then the cuf-off value is taken
to be one quarter of the volume of the model.
When running damstart in batch mode (see
Command-Line Arguments and Options, the program
prints on the screen information about the number of atoms read, the number of phase,
minimum and maximum number of contacts, the selected and final contact threshold, the
selected and final cut-off volume the atomic radius, the excluded volume per atom,
maximum radius, average excluded volume read, number of atoms written and final excluded
volume of the created model. For an example, see damstart
examples.
The output file of DAMSTART is the filtered model with a fixed core written to
damstart.pdb by default. The header of the file includes the filter conditions applied.
$ damstart taver.pdb -o tstart.pdb
Read file .............................................. : taver.pdb
Number of atoms ........................................ : 925
Number of phases ....................................... : 1
Minimum number of contacts ............................. : 2
Maximum number of contacts ............................. : 12
Selected contact threshold ............................. : 4
Atomic radius .......................................... : 2.750
Excluded volume per atom ............................... : 117.7
Maximum radius ......................................... : 64.07
Average excluded volume ................................ : 0.0
Selected cut-off volume ................................ : 5.445e+4
Final contact threshold ................................ : 4
Final cut-off volume ................................... : 5.445e+4
Final number of atoms .................................. : 463
Final volume ........................................... : 5.451e+4
Wrote file ............................................. : tstart.pdb