|
Biological Small Angle Scattering Group |
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
GASBOR manual |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
ATSAS v. 2.3.2 Documentation written by M.Petoukhov, D.I.Svergun, and M.J.Gajda. © ATSAS Team, 2000-2009 ManualIntroductionGASBOR is program for ab initio reconstruction of protein structure by a chain-like ensemble of dummy residues. It has been published along with the paper Determination of domain structure of proteins from X-ray solution scattering of D.I. Svergun, M.V. Petoukhov & M.H.J. Koch. Works on following platforms: Windows (WinNT, Win95, Win98, Win2K, Win XP, Vista), Linux. Algorithm descriptionThe use of GASBOR is similar to that of DAMMIN or DAMMIF. Most of parameters have the same meaning. The most important difference is that the protein structure is represented not by dummy spheres on lattice (called dummy atoms in DAMMIN/DAMMIF, but not corresponding to real atoms), but rather by an ensemble of dummy residues (corresponding to average residue densities) placed anywhere in continuous space with a preferred number of close distance neighbours for each atom. The centers of these residues aim to approximate positions of the C-α atoms in the protein structure. The number of residues should be equal to that in the protein. Note, however, that these residues are anonymous, in the sense that their ordinal numbers in the model has nothing to do with the numbering primary sequence of the protein! Accordingly, the program does not subtract any Porod constant from the experimental data. In DAMMIN, it was recommended to discard high angle portions of the scattering patterns; in GASBOR, on the contrary, one should use them. The program is able to fit the data up to the resolution of 5 ångströms, i.e. momentum transfer s=4*π*sin(θ)/λ = 1.2 [1/Ångström]. Running GASBORUNIX/LINUX users: It may be necessary to first increase stack size limit, before running the program: bash$ ulimit -s unlimited
otherwise program crashes with Command-Line Arguments and Options
Major parameters can be specified from the command line (see below)
to run the program in batch mode. gasbor <gnom.out> <Num_DRs> [/lo <log file name>] [/sy <symmetry>] [/id <"project description">]
where Interactive ConfigurationGASBOR reads in output files of GNOM (the latter is an indirect transformation program available from ATSAS package server.) After starting GASBOR you may specify:
There are two versions of GASBOR, one performing the fit of
the intensity in reciprocal space ( Multiprocessor systems and OpenMP
LINUX version of GASBOR may use multiple processors using OpenMP multithreading library. Number
of threads used may be set with For further details and configuration variables, please refer to OpenMP documentation. Runtime Output
After printing program version number and querying or printing all
parameters, GASBOR will display a message that j: 1 T: 0.100E-02 Suc: 5500 Eva: 11544 CPU: 0.427E+02 SqF: 0.5172 Rf: 0.08396 His: 26.20 Bnd: 1.302 Dis:0.1593 Per :0.2196
After run is completed, final χ2 against data is printed to the output. GASBOR Input FilesThe only input file is GNOM output containing both regularized scattering curve and P(r) (for real-space GASBOR.) GASBOR Output FilesAfter the program is finished, you will get the files:
PDB outputPDB-alike output file from GASBOR contains:
Limitations
For large proteins (>2000 aminoacids), DAMMIF/DAMMIN is recommended -- it will run much faster and give similar results. The influence of the internal structure for large macromolecules is less important and the shape approximation would do a good job. ExamplesLysozyme
Lysozyme has no symmetry, and 129 residues: You may also use command line: bash$ gasbor22i gnlyzfu.out 129 Here is resulting output: *** Ab inito reconstruction of a protein structure *** *** by a chain-like ensemble of dummy residues *** *** Version 2.2i build 31.07.08 *** *** Last modified --- 31/07/08 20:00 *** *** Please reference: D.I.Svergun, M.V.Petoukhov & *** *** M.H.J.Koch (2001) Biophys. J. 80, 2946-2953 *** *** Copyright (c) ATSAS Team *** *** EMBL, Hamburg Outstation, 2000 - 2008 *** Type gasbor22i /help for batch mode use === GASBOR Version 2.2i build 31.07.08 started on 29-Sep-2009 13:37:34 Project identificator .................................. : gnlyzf Enter project description .............. : Random sequence initialized from ....................... : 133734 ** Information read from the GNOM file ** Data set title: Angular axis n01000.sax Datafile n10000.sub Raw data file name: lyzful.dat Maximum diameter of the particle ....................... : 50.00 Solution at Alpha = 0.500E+00 Rg : 0.144E+02 I(0) : 0.526E+03 Radius of gyration ..................................... : 14.40 Number of GNOM data points ............................. : 230 Maximum s value [1/angstrom] ........................... : 1.316 Number of Shannon channels ............................. : 20.95 Number of knots in the curve to fit .................... : 42 Symmetry: P1...19 or Pn2 (n=1,..,12) Number of equivalent positions ......................... : 1 Number of dummy waters ................................ : 90 Excluded volume per residue ............................ : 28.73 Radius of the search volume ............................ : 25.00 Histogram penalty weight ............................... : 1.000e-3 Bond length penalty weight ............................. : 1.000e-2 Discontiguity penalty weight ........................... : 1.000e-2 Peripheral penalty weight .............................. : 1.000 Expected particle shape: <P>rolate, <O>blate, Contrast of the hydration layer ........................ : 3.000e-2 Computation of the initial intensity ... Histogram penalty value ................................ : 36.62 Bond length penalty value .............................. : 1.930 Initial DRM # of graphs ................................ : 61 Discontiguity value .................................. : 1.196 Peripheral penalty value ............................... : 0.2645 Weight: 0-2 = s^2, 3-5 = s, 6 = log .................... : 2 *** Accounting for constant background *** Initial scale factor ................................... : 1.409e-4 Constant background subtracted ......................... : -1.095 Initial R^2 factor ..................................... : 5.796e-2 Initial R factor ..................................... : 0.2408 Initial penalty ........................................ : 0.3324 Initial fVal ........................................... : 0.3904 R-factor fixing threshold .............................. : 0.0 Fixing threshold PenCha ................................ : 0.0 Fixing threshold PenLen ................................ : 0.0 Initial annealing temperature .......................... : 1.000e-3 Annealing schedule factor .............................. : 0.9000 # of independent atoms to modify ....................... : 1 Max # of iterations at each T .......................... : 55000 Max # of successes at each T ........................... : 5500 Min # of successes to continue ......................... : 55 Max # of annealing steps ............................... : 100 ==== Simulated annealing procedure started ==== j: 1 T: 0.100E-02 Suc: 5500 Eva: 11556 CPU: 0.329E+01 SqF: 0.5116 Rf: 0.10745 His: 26.97 Bnd: 1.419 Dis:0.0807 Per :0.2082 ... j: 36 T: 0.250E-04 Suc: 55 Eva: 1425327 CPU: 0.378E+03 SqF: 0.0936 Rf: 0.02932 His: 6.93 Bnd: 0.067 Dis:0.0000 Per :0.4966 Final Chi against raw data ............................. : 0.9592 === GASBOR Version 2.2i build 31.07.08 finished on 29-Sep-2009 13:43:55 Use in the batch mode: gasbor22i <Inp_File> <Num_DRs> [/key1 <key1>]... [/keyN <keyN>] where the compulsory arguments Inp_File and Num_DRs are the name of a GNOM output file (extension .out) and the number of dummy residues in asymmetric part The following program options can be given as keys with their values (defaults are given in brackets) /lo Log file name (same as the GNOM file name) /sy Particle symmetry (P1) /id Project description (command line content) Transketolase
Transketolase is homodimer in solution, and each monomer has 680 residues,
giving a total of 1360 residues): *** Ab inito reconstruction of a protein structure *** *** by a chain-like ensemble of dummy residues *** *** Version 2.2i build 21.06.06 *** *** Last modified --- 21/06/06 12:00 *** *** Please reference: D.I.Svergun, M.V.Petoukhov & *** *** M.H.J.Koch (2001) Biophys. J. 80, 2946-2953 *** *** Copyright (c) ATSAS Team *** *** EMBL, Hamburg Outstation, 2000 - 2005 *** Type gasbor22i /help for batch mode use === GASBOR Version 2.2i build 21.06.06 started on 06-Oct-2009 16:42:28 Computation mode (User or Expert) ...... < User >: Log file name .......................... < .log >: log Input data, GNOM output file name ...... < .out >: 1trk.out Project identificator .................................. : log Enter project description .............. : project Random sequence initialized from ....................... : 164228 ** Information read from the GNOM file ** Data set title: Transketolase collated from n85, o14+o16 6-11-98 Raw data file name: trkexp.dat Maximum diameter of the particle ....................... : 12.00 Solution at Alpha = .164E+01 Rg : .336E+01 I(0) : .190E+03 Radius of gyration ..................................... : 3.360 Number of GNOM data points ............................. : 283 Angular units in the input file : 4*pi*sin(theta)/lambda [1/angstrom] (1) 4*pi*sin(theta)/lambda [1/nm ] (2) < 2 >: 2 Angular units multiplied by ............................ : 0.1000 Maximum diameter divided by ............................ : 0.1000 Maximum s value [1/angstrom] ........................... : 0.3418 Number of Shannon channels ............................. : 13.06 Portion of the curve to be fitted ...... < 1.000 >: Number of knots in the curve to fit .................... : 26 Initial DRM (CR for random) ............ < .pdb >: Symmetry: P1...19 or Pn2 (n=1,..,12) or P23 or P432 or PICO ................. < P1 >: P2 Number of equivalent positions ......................... : 2 Number of residues in asymmetric part .. < 517 >: 680 Fibonacci grid order ................... < 15 >: Number of dummy waters ................................ : 988 Excluded volume per residue ............................ : 28.73 Radius of the search volume ............................ : 60.00 Histogram penalty weight ............................... : 1.000e-3 Bond length penalty weight ............................. : 1.000e-2 Discontiguity penalty weight ........................... : 1.000e-2 Peripheral penalty weight .............................. : 1.000 Expected particle shape: <P>rolate, <O>blate, or <U>nknown .......................... < Unknown >: Contrast of the hydration layer ........................ : 3.000e-2 Computation of the initial intensity ... Histogram penalty value ................................ : 37.38 Bond length penalty value .............................. : 1.604 Initial DRM # of graphs ................................ : 708 Discontiguity value .................................. : 2.191 Peripheral penalty value ............................... : 0.2647 Weight: 0-2 = s^2, 3-5 = s, 6 = log .................... : 2 *** Accounting for constant background *** Initial scale factor ................................... : 5.042e-7 Constant background subtracted ......................... : 0.3339 Initial R^2 factor ..................................... : 3.837e-2 Initial R factor ..................................... : 0.1959 Initial penalty ........................................ : 0.3400 Initial fVal ........................................... : 0.3784 R-factor fixing threshold .............................. : 0.0 Fixing threshold PenCha ................................ : 0.0 Fixing threshold PenLen ................................ : 0.0 Initial annealing temperature .......................... : 1.000e-3 Annealing schedule factor .............................. : 0.9000 # of independent atoms to modify ....................... : 1 Max # of iterations at each T .......................... : 130000 Max # of successes at each T ........................... : 13000 Min # of successes to continue ......................... : 130 Max # of annealing steps ............................... : 100 ==== Simulated annealing procedure started ==== j: 1 T: 0.100E-02 Suc: 13000 Eva: 14987 CPU: 0.142E+03 SqF: 0.5510 Rf: 0.11559 His: 36.01 Bnd: 1.958 Dis:0.4537 Per :0.2301 ... j: 56 T: 0.304E-05 Suc: 63 Eva: 3730743 CPU: 0.386E+05 SqF: 0.0789 Rf: 0.01835 His: 5.43 Bnd: 0.046 Dis:0.0000 Per :0.3158 Final Chi against raw data ............................. : 1.211 === GASBOR Version 2.2i build 21.06.06 finished on 07-Oct-2009 03:25:52 Use in the batch mode: gasbor22i <Inp_File> <Num_DRs> [/key1 <key1>]... [/keyN <keyN>] where the compulsory arguments Inp_File and Num_DRs are the name of a GNOM output file (extension .out) and the number of dummy residues in asymmetric part The following program options can be given as keys with their values (defaults are given in brackets) /lo Log file name (same as the GNOM file name) /sy Particle symmetry (P1) /id Project description (command line content) References |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Last edited : |
July 7, 2010 |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© Copyright BioSAXS Group 2010