
Copyright © 2009, The ARP/wARP development team
This user guide covers the fundamentals of the ARP/wARP software suite and most of its applications. If you cannot find the answers to your questions here, please visit the ARP/wARP web page http://www.arp-warp.org
Chapter 1. General information
ARP/wARP is a software project for automated protein model building and structure refinement. It is based on a unified approach to the structure solution process by combining electron density interpretation using the concept of the hybrid model, pattern recognition in an electron density map and maximum likelihood model parameter refinement with REFMAC. The ARP/wARP software is under continuous development. Its present release, version 7.1, can be used in the following ways:
For the latest news and announcements please visit the ARP/wARP page (www.arp-warp.org). The developers will greatly appreciate all bug reports or suggested changes from the users.
The ARP/wARP package is freely available to academic users provided that they agree to the ARP/wARP license conditions and the applications of ARP/wARP are properly cited. Please consult the ARP/wARP log file for the most relevant citation.
Industrial users are requested to obtain a commercial license via the ARP/wARP web page.
Chapter 2. Installing ARP/wARP
The installation of ARP/wARP should be straightforward; please follow the procedure described below:
% gunzip arp_warp_7.1.tar.gz
% tar xvf arp_warp_7.1.tar
The distribution will unpack under the directory called arp_warp_7.1 that will contain all the required files and subdirectories. install.sh is an installation script to help you set the appropriate environmental variables. README will walk you through the installation process. ARP_wARP_CCP4I-v5.tar.gz includes everything necessary to run ARP/wARP from the CCP4i interface of version 5; ARP_wARP_CCP4I-v6.tar.gz includes everything to run tasks from the CCP4i interface of version 6.
3. Go to the directory arp_warp_7.1 and run there the install.sh script by simply typing
% ./install.sh
The script will check the following (please inspect its output, it should look like the one given in Appendix A):
Finally the install.sh script will output one line that must be added to your .cshrc, .bashrc or .zshrc file in order to setup the proper environment for ARP/wARP. The setup script will automatically recognise the machine type upon login and use the correct ARP/wARP binaries. The install.sh script should finish with no warnings and the statement of successful installation:
*** INSTALLATION OF ARP/wARP 7.1 HAS BEEN SUCCESSFUL ***
4. Modify your .cshrc / .bashrc as suggested in the output of install.sh. Important: source CCP4 first, ARP/wARP then, e.g.:
source /Users/testuser/ccp4/ccp4-6.0.2/include/ccp4.setup
source /Users/testuser/arp_warp_7.1/arpwarp_setup.csh
5. By default the install.sh script will attempt to uninstall previous ARP/wARP installation from the CCP4i GUI and install the 7.1 version there. Should this not be successful due to, e.g. ownership permissions (please inspect the output when running install.sh) or you explicitly tell the script to not
(re)install the GUI by typing ./install.sh no-gui, you may
need to setup CCP4i manually. To do this first start CCP4i by typing ccp4i and setup your project if you use CCP4i for the first time in your user account. Within CCP4i go to "System administration" and uninstall an earlier version of ARP/wARP (if there is any). Restart CCP4i. Go to "System administration" again and install the GUI by navigating to the file arp_warp_7.1/ARP_wARP_CCP4I-v5.tar.gz or arp_warp_7.1/ARP_wARP_CCP4I-v6.tar.gz depending on the CCP4 version that you use. The interface should gunzip, untar and install the content automatically. At this point, the installation is complete. Restart CCP4i and click on the desired ARP/wARP module’s button. The CCP4i Model Building GUI panel should show 9 ARP/wARP modules, see the figure to the right. If you observe additional ARP/wARP modules, the GUI may not function properly. Please un-install previous ARP/wARP versions.
We recommend that the installation of the CCP4 GUI be done by the

person who installed the CCP4 package, so that all users have an up-to-date interface and the correct permissions are set.
Unless you are already an experienced ARP/wARP user, you should try to get started with the test files provided in the directory arp_warp_7.1/examples. These include the data for protein chain tracing, helix/strands search, ligand and solvent building. A README file is included with the example data which gives more detailed information, which data are to be used for what. If things do not work as expected please consult your system manager first. If problems remain, please contact us at the ARP/wARP bulletin board (available from the ARP/wARP web page).
Chapter 3. Using ARP/wARP

This module of ARP/wARP provides the execution of the following model building tasks:
Applications (a) and (b) (the so-called warpNtrace protocol) start with input experimental / density modified phases or available (preliminary refined or partially autotraced) model and are aimed to deliver an essentially complete model and obviously an improved map. The warpNtrace protocol utilises the idea of the hybrid model in which protein and free atoms can co-exist. warpNtrace keeps whatever was recognised as protein (in a form of polypeptide fragments) and the rest as free atoms and refines this hybrid model during a 'big' cycle, consisting of several (default is 5) ARP/REFMAC update/refinement cycles. At the end of each ‘big’ cycle the map is interpreted anew -a completely new polypeptide model is constructed with hopefully more residues in less fragments. This whole procedure is iterated (default is 10 times).
The output of warpNtrace is a set of refined polypeptide fragments. If the sequence is available, the traced fragments will be docked in sequence and side chains will be built during the iterative refinement procedure. Loops will be built during the procedure, if possible. After the last building cycle the fragments will be arranged to form a globular structure (or, for a case of NCS, several NCS-related structures). The remainder of the structure (cis-prolines, poorly ordered loops and terminal residues for each fragment) will have to be completed by the user manually. Since the output model is refined, its accuracy is expected to be comparable to the one of the final refined structure. Mis-tracing (incorrect tracing of polypeptide fragments) is not impossible but should normally not exceed 1 % of the whole structure with X-ray data to about 2.5 Å resolution or higher. An estimate of the correctness of the model is printed after every model building cycle, e.g.:
% Chains 12, Residues 434, Estimated correctness of the model 99.1 %
Application (c) includes no model building but still may provide improvement of the density map. The map is first interpreted as a pseudo protein model, consisting of unconnected free atoms. This model is then refined and updated with iterative cycles of ARP/REFMAC.
Below is the application (a) is described in detail, input to applications (b) and (c) is very similar and should be straightforward to figure out.
The default is not to use Rfree, since the number of traced residues serves as excellent indicator of the success of the job. You can turn the use of Rfree on but the authors have seen marginal cases (low resolution and hence low observation-toparameter ratio) when this adversely affected the model building.
o Se-Methionine -If you have Se-methionine substituted protein, regardless of the use of the refinement function, you can check the box thus asking ARP/wARP to build and refine Se-Met residues and use these for better refinement results.
• Now you are ready to start the job: Click on Run and choose Run now
There is a number of additional parameters that you normally should not worry about. A brief description is given below.
o Checking this button will activate remote submission. This is described below in a separate chapter.
The script auto_tracing.sh in the $warpbin directory allows running the automated model building from the command line without the use of the GUI. The use of auto_tracing.sh is fairly simple. If invoked without arguments the script will print help information.
Usage:
$warpbin/auto_tracing.sh \ datafile {mtzfile} \ [residues {number_of_residues_in_AU}] \ [workdir {FULLPATH_WORKING_DIRECTORY}] \ [fp {fp_label}] [sigfp {sigfp_label}] [freelabin {freer_label}] \ [fbest {weighted_amplitude_label}] [phibest {phibest_label}] [fom {fom_label}] \ [modelin {input_PDB_file_to_use_as_initial_model}] \ [seqin {sequence_file_for_one_NCS_copy}] \ [cgr {number_of_NCS_copies (if seqin is provided, default is 1) }] \ [buildingcycles {the_number_of_autobuilding_cycles (default is 10) }] \ [resol {'rmin rmax' (default is the full resolution range) }] \ [albe {1 to_always_invoke_albe, default is 0 for resol < 2.7A, else 1) }] \ [restraints {1 to use conditional restraints, default is 1 }] \ [twin {1 to try de-twining and twin refinement, default is 0 }] \ [sad {1 to turn on the SAD function refinement, \
needs also 'wavelength' and 'heavyin' on input, default is 0 }] \ [compareto {PDB_file_for_comparison}] \ [parfile {parfilename_if_only_parfile_is_to_be_created}] \
For start from phases: fp/sigfp/phibest/fom or fbest/sigfp/phibest to build initial free-atoms model and fp/sigfp to refine the model If 'fbest' is given, 'fom' will be ignored
For start from a model: fp/sigfp to refine the model
Additional useful tips:
Required keyword is: datafile (followed by the mtz-file name with the full path).
Optional keywords include: residues (followed by the number of residues), workdir (followed by the absolute path to the working directory), fp (followed by the fp label), sigfp (followed by the sigfp label), freelabin (followed by the Rfree label), fbest (followed by the label for the fom-weighted structure factor amplitudes to be used for initial map calculation), phibest (followed by the best phi label), fom (followed by the figure of merit label), modelin (followed by a starting pdb-file with the full path), seqin (followed by a sequence-file name with the full path), cgr (followed by a number of NSC-related copies), buildingcycles (followed by the number of building cycles), resol (followed by the resolution limit), albe (followed by the flag to enable or not helix/strands building), similarly for restraints, twin and sad. There are additional parameters, which can be castomised, and an experienced user should have no problem in figuring out how to do this. Alternatively, please contact the ARP/wARP developers for advice.
If auto_tracing.sh is called with an option ‘parfile’, the script will create a parameter file and a directory in the workdir whose name will be printed. The job can subsequently be launched by:
% $warpbin/warp_tracing.sh NAME_OF_PARFILE
If auto_tracing.sh is called without an option ‘parfile’, it will also launch the job. The log files and additional output files as well as the building results can be found in the directory created.
This option offers you the following possibilities:
a) Your model building will run using external computational facilities, where the CPU performance may be superior to your local installation.
b) You can be assured that the most recent working executables will be used, should you have a problem with your local installation.
c) Should the task crash, an automatic notification will be forwarded to the ARP/wARP developers who can then promptly help you (unless you have declared your task to be confidential, see below).
d) Upon your wish you can share the results of the completed task with software developers.
• Submitting from the GUI.
Clicking on the button with "Submit the job for remote execution at the Hamburg cluster" within the main ARP/wARP Classic GUI panel allows one to execute an autotracing task remotely. The panel will expand and ask for an email address to be provided. Please also choose one of the options from the drop down menu to indicate how you would like your data to be handled.

The options are:
a) the data must be kept confidential and deleted after the job has finished b) the data can be made available to ARP/wARP-AutoRickshaw-Refmac developers c) the data can be archived and made available to any software developer that requests them (this is default)
Option (b) will only allow the data share to the ARP/wARP, Auto-Rickshaw and Refmac development teams. Option (c) will extend the share to anyone who requests the data. In case of option (a) only the short log file, Wilson log files and the parameter file will be kept by the ARP/wARP developers, all other data and log files will be automatically deleted after the job has finished.
Once the job has been submitted for remote execution, the GUI window will indicate that the job has finished. Please inspect the log file from the pull-down menu option "View files from job" for further instructions. An email will be sent to you at the email address that you entered in the GUI window. Please follow the instructions in the email (http link, login and password) to connect to the Hamburg cluster. You can then monitor the log file in your browser window. As soon as the job is finished, you will be provided with a link to the results that you can then download. Keep in mind that once the job is finished, your data will be kept for one week only. Make sure that you download your data within that time.
The remote job submission relies on the curl software installed at your site. Availability of curl is checked while installing ARP/wARP and a warning is given if curl is not available.
• Submitting from a web browser.

Keep in mind that once the job is finished, your data will be kept for one week only. Make sure that you download your data within that time.
expect a job for a structure of 500 residues to be completed within about 1 hour (subject to the power of the computer you are using).

This module remains experimental. It still has the same aims as ARP/wARP Classic: to automatically build protein structures, starting either from molecular replacement models or experimental electron density maps. The way that information is presented to the user, and some refinement choices are different than the 'Classic' module, but the functionality is similar.
A main difference is that this module, when a model is 'more or less complete', it will use the typically available second CPU core to start a new job to clean up the model, add waters, and refine it, and make it available to the user. In parallel, the old job will continue to see if it can find a better solution, with more residues, but the user does not need to wait for that to finish.
Another difference concerns the Sequence file. If you have hetero-multimers in the asymmetric unit of your crystals, you should add each sequence separately, by clicking the Add Input PIR file button. Then, you can define any stoichiometry for complicated hetero-multimers. For each defined sequence the user can select from a pull-down menu the number of copies in the asymmetric unit. Based on that and the contents of the PIR file the contents of the AU in residues will be calculated automatically.
The input files are identical to the ARP/wARP Classic module.
There is a dedicated option to select that the Methionines are Se-Met residues if the dataset comes from a SAD or MAD experiment on a selenium edge; the SAD and TWIN functions, new in
7.1 are implemented in this module as well.
The Decision parameters are where the innovative choices for controlling ARP/wARP are given. The number of refinement and building cycles are not fixed, but are defined on the fly based on the program’s progression. The Decision parameters are defining these limits. If you leave the mouse over one of the input fields, a help text will appear explaining the use of each decision parameter.
The parameter maximum number of processes in parallel is important and is briefly explained below. When Flex-wARP decides that it has reached a more-or-less useful model, it will spawn a 'cleaning up and completion' process. However it will continue the iterative building in parallel. If the iterative building results in a better model, a new 'cleaning up and completion' process will be requested, possibly before the previous 'cleaning up and completion' process has finished. If you have only two processors (typical these days in dual core systems) the new process will be 'queued'; when the previous one is finished the new one will start.
We emphasise that the Expert System module is still ‘experimental’. We hope that in the future we will be able to offer more tricks and tips.
Running flex-wARP from command line, CAutoPyWARP.pyc
Please type to get on-line help:
python $pywarpbin/CAutoPyWARP.pyc --help

The procedure for building secondary structural elements in ARP/wARP version 7.1 is based on the use of discriminant analysis in a successive filtering scheme taking into account the geometry of alpha-helical and beta-stranded main-chain fragments. The electron density map is first analysed and a suitable threshold is automatically selected. In the next step stereochemical information on the helix and strand geometry is used; sets of overlapping fragments are constructed and filtered based on their geometric likelihood. All fragments that overlap at a particular location of a helix or a strand undergo an ensemble averaging process to provide the best estimate of CA positions. The output fragments are then regularised and the chain direction is chosen on the basis of their fit to the density. Finally the fragments are refined in real space.
The accuracy of the resulting model depends on many parameters. The module should be able tobuild helices and strands at resolutions as low as 4.5 Å. However, it may not result in complete helical/stranded structure and it may also contain parts that are mis-interpreted. The expected top performance is the correct location of 90% of the helices and 50% of the strands. The procedure is relatively fast and takes only a few minutes for proteins of moderate size (up to 500 residues).
The secondary structure recognition module is optimised to address lower resolution data and hard cases where, e.g. the straight model building protocol (Classic or flex-wARP) has not been successful. For a resolution higher than 2.6 Å the module will automatically trim the resolution and Wilson B-factor of the data to approach its design conditions.
• Parameters:
• Now you are ready to start the job: Click on Run and choose Run now
There is a number of additional parameters that you normally should not worry about. A brief description is given below
QUITTING … ARP/wARP module stopped with an error message: name_of_the_program indicated that one of the modules of the task has terminated with an error message. Please refer to the specified log file.
The script auto_albe.sh (where ‘albe’ stands for alpha-beta) in the $warpbin directory allows you to run the secondary structure building as a single-line command without the use of the GUI. The use of auto_albe.sh is fairly simple. The script prints out help information if it is invoked without arguments.
Usage:
$warpbin/auto_albe.sh \ datafile {mtzfile} \ [residues {number_of_residues_in_AU}] \ [workdir {FULLPATH_WORKING_DIRECTORY}] \ [helixfileout {output_PDB_file}] \ [jobId {desired_job_id_used_for_subdirectory_naming}] \ [fp {fp label} sigfp {sigfp label} phib {phi label}] \ [fom {fom label}] (input 'fom none' if no fom is to be used) \ [compareto {PDB_file_for_comparison}] \ [nostrands {0 or 1, default=0}] \ [parfile {parfilename_if_only_parfile_is_to_be_created}]
Required keyword is: datafile (followed by the mtz-file name with the full path).
Optional keywords include: residues (the expected number of residues in the asymmetric unit), workdir (followed by the full path to the working directory), helixfileout (the name of the PDB file where the traced both helical and stranded fragments will be output to), jobId (if you wish that the working sub-directory has a particular name), fp (followed by the fp label), sigfp (followed by the sigfp label), phib (followed by phibest label) and fom (followed by the label to fom). The defaults are FP, SIGFP, PHI and FOM, respectively. Alternatively, if the mtz file contains only one column for structure factor amplitudes and only one column for their standard deviations, these will be taken. If you wish FOM not to be used, please input ‘fom none’. For test purposes, the constructed helices/strands can be compared to known reference models (hand-or pre-fitted). The required keyword is compareto (followed by the full-path name of a PDB file). You can also enable/disable the construction of strands using the keyword nostrands, the default is 0 (build the strands).
If auto_albe.sh is called with an option ‘parfile’, the script will create a parameter file and a directory in the workdir whose name will be printed. The job can subsequently be launched by:
% $warpbin/warp_albe.sh NAME_OF_PARFILE
If auto_albe.sh is called without an option ‘parfile’, it will also launch the job. The log files and additional output files as well as the building results can be found in the directory created.

This side-chain building module can be used either to try and improve the density fit of existing side chains, or in the absence of side chains to perform side-chain docking and build the side chains from scratch:
There are a number of options that can be added. A brief description is given below.

This module tries to find likely loops to connect fragments of a partial protein structure based on the sequence and the density map. It builds the loops in three phases. First a tree of possible CAs between the fragments is build, next the unlikely ones are removed and the rest of the main chain atoms determined, and finally the best loops are selected. The tree can be build either towards the C-terminus of the N-terminus of the protein, or both. The built loops are ordered (in descending order) according to the density correlation at the main chain atoms (including CB if present) or the correlation of the side chains, or a combination of both. If the number of loops exceeds the chosen number only the best are saved to file.
There are a number of options that can be added. A brief description is given below.
Automated Building of Poly-Nucleotides Running nucleotide building from the GUI, ARP/wARP DNA/RNA

This is a prototype module for building fragments of DNA or RNA. The input is an MTZ file containing the phases from which the map best describing the nucleotide region can be computed. Thus the map could be a difference map (e.g. after the protein model is completed) or a sigma-weighted map for the whole asymmetric unit. The nucleotide building procedure within ARP/wARP Version 7.1 proceeds in several steps: first it locates putative phosphates in the density map, then uses them in a manner analogous to the CA-candidates for protein chain tracing. After the nucleotide fragments are obtained, a likely base is built and refined in real space. The type of the base is currently limited to A (large) or C (small) and the nucleotide sequence is not yet used.
The produced poly-nucleotides are quite accurate, a typical rmsd for the built backbone atoms is
0.6 Å with X-ray data extending to around 3.0 Å resolution. The method is not sensitive to a particular DNA or RNA conformation. The module is not very CPU efficient and may take about 10 minutes for a 20-nucleotide structure.
There are a number of options that can be added. A brief description is given below.
• OUTPUT files, short Log File:
QUITTING … ARP/wARP module stopped with an error message: name_of_the_program indicated that one of the modules of the task has terminated with an error message. Please refer to the specified log file.
Running nucleotide building from the command line, auto_nuce.sh
The script auto_nuce.sh in the $warpbin directory allows you to run the secondary structure building as a single-line command without the use of the GUI. The use of auto_nuce.sh is fairly simple. The script prints out help information if it is invoked without arguments.
Usage:
$warpbin/auto_nuce.sh \ datafile {mtzfile} \ [residues {number_of_protein_residues_in_AU}] \ [nucleotides {number_of_nucleotides_in_AU}] \ [workdir {FULLPATH_WORKING_DIRECTORY}] \ [fp {fp_label}] [sigfp {sigfp_label}] [fbest {weighted_amplitude_label}] \ [phib {phib_label}] [fom {fom_label}] \ [resol {'rmin rmax' (default is the full resolution range) }] \ [compareto {PDB_file_for_comparison}] \ [parfile {parfilename_if_only_parfile_is_to_be_created}] \
Required keyword is: datafile (followed by the mtz-file name with the full path).
Optional keywords include: residues (the expected number of residues in the asymmetric unit), nucleotides (the expected number of nucleotides in the asymmetric unit), workdir (followed by the full path to the working directory), fp (followed by the fp label), sigfp (followed by the sigfp label), phib (followed by phibest label) and fom (followed by the label to fom). The defaults are FP, SIGFP, PHI and FOM, respectively. Alternatively, if the mtz file contains only one column for structure factor amplitudes and only one column for their standard deviations, these will be taken. If you wish FOM not to be used, please fbest. You can set resol (followed by the resolution limit). For test purposes, the constructed model can be compared to known reference model. The required keyword is compareto (followed by the full-path name of a PDB file).
If auto_nuce.sh is called with an option ‘parfile’, the script will create a parameter file and a directory in the workdir whose name will be printed. The job can subsequently be launched by:
% $warpbin/warp_nuce.sh NAME_OF_PARFILE
If auto_nuce.sh is called without an option ‘parfile’, it will also launch the job. The log files and additional output files as well as the building results can be found in the directory created.
Automated Ligand Building Running ligand building from the GUI, ARP/wARP Ligands

The ligand building procedure within ARP/wARP Version 7.1 proceeds in three steps: first it locates the binding site in the difference density map, then builds there a number of putative ligand models and, finally, selects the best model, which is geometrised and real-space fit into the density. The binding region is selected automatically by matching ligand’s shape-related properties to the regions of high density. The chosen region is parameterised by a sparse set of putative positions (grid nodes) for the ligand atoms. The stereochemical information and van der Waals repulsions in combination with the electron density allows one to obtain a suitable estimate of the position, orientation and conformation of the ligand. For the construction of the ligand into this sparse set two algorithms are used. One algorithm exploits the combinatorial assignment of the ligand atom identities to the grid nodes, ‘label swap’. Another algorithm maximises the overlap between the sparse set and the ligand model by a random search in conformational space. The output from both algorithms undergoes a last stage of real-space refinement before the final model is selected.
The accuracy of ligand building is mainly dependent on ligand size and the resolution of the X-ray data. As a rough guide, about 75% of well-ordered ligands of a size up to 20 non-hydrogen atomsshould be built within r.m.s.d. of 1.0 Å from their correct location. For ligands that are larger in size, such ‘success rate’ decreases to about 50%. With the r.m.s.d. of 1.0 Å or less the constructed models should be accurate enough for REFMAC5 to straightforwardly refine the protein-ligand complex. The procedure can be iterated to locate additional ligands, if any are present.
The ARP/wARP ligand building module requires the X-ray data (in MTZ format), the built protein without ligands (in PDB format) and a template model of the ligand to build (in PDB format). Options include the possibility to specify the binding site and the number of starting grids, the ability to compare the run result to some reference ligand(s), and the possibility to build a ligand taken from a list of candidates ('cocktail'). In the latter case the coordinates of the ligand candidates should be concatenated into a single PDB file. The different ligands must be distinguished by their residue name (columns 18-20), chain identifier (column 22) or residue sequence number (columns 23-26). ARP/wARP will automatically choose the best-matching ligand candidate and will attempt to build it at the binding site, either determined automatically or supplied by the user. However, since this feature is new, the specification of the binding site (see below) is recommended.
There are a number of options that can be added either in the main GUI panel (scrolling bar Build the ligand) or under the Parameters section. You normally should not need to worry about these (except you want the ligand to be build around the known location or you would like to screen a list of candidates, ‘ligand cocktail’). A brief description is given below.
• Optional parameters:
o Build the ligand (Binding site location) In the most likely place of the complete asymmetric unit (default) around the same approximate place as a previous ligand -The binding site is
defined by the position of a compound known to bind at the desired location. If you use this option, the region is required in form of a PDB file, (previous ligand coordinates).
around an approximate XYZ position -The binding site is defined by (X, Y, Z)Cartesian coordinates and a search radius in Å (option Search for the ligand around).
of cycles. If this matters for large ligands you can set the number of ligand building cycles to 1.
QUITTING … ARP/wARP module stopped with an error message: name_of_the_program indicated that one of the modules of the task has terminated with an error message. Please refer to the specified log file.
Running ligand building from the command line, auto_ligand.sh
The script auto_ligand.sh in the $warpbin directory allows you to run the ligand building as a single-line command without the use of the GUI. The use of auto_ligand.sh is fairly simple. The script prints out help information if it is invoked without arguments.
$warpbin/auto_ligand.sh \ datafile {either mtzfile or mapfile} \ protein {starting_PDB_file_without_ligand} \ ligand {PDB_file_with_ligand_to_fit} \
[workdir {FULLPATH_WORKING_DIRECTORY}] \ [ligandfileout {output_PDB_file}] \ [fp {fp_label}] [sigfp {sigfp_label}] [freer {freer_label}] \
[nligandcycles {number_of_ligandbuild_cycles (default is 2)}] \ [search_model {PDB_file_with_model_at_expected_ligand_site}] \ [search_position {X Y Z}] \ [search_radius {radius_in_angstroms}] \ [reflist {textfile_with_FULLPATHnames_of_fitted_ligands_for_comparison}] \ [extralibrary {user_defined_library_for_Refmac5}] \ [parfile {parfilename_if_only_parfile_is_to_be_created}]
Required keywords are: datafile (followed by the mtz-file name with the full path), protein (followed by the pdb-file name of the protein model without the ligand with the full path) and ligand (followed by the pdb-file containing the ligand(s) description with the full path).
Optional keywords include: workdir (followed by the full path to the working directory), fp (followed by the fp label), sigfp (followed by the sigfp label). The defaults are FP and SIGFP, respectively. Alternatively, if the mtz file contains only one column for structure factor amplitudes and only one column for their standard deviations, these will be taken. The number of ligand building cycles (default is 2) can be changed with keyword nligandcycles. The approximate location of the binding site can be supplied by the user either by providing the pdb-file(s) of a ligand (or a just a list of atoms) located at the binding site (search_model), or by specifying the (XYZ) coordinates of a point defining the binding region using search_position and search_radius (default value for the latter is 5 Å). For test purposes, the constructed ligand can be compared to known reference models (hand-or pre-fitted). The required keyword is reflist (followed by the full-path name of a text file, containing a list of pdb-files with the reference ligands and their absolute paths). A user-defined ligand library can be input using keyword extralibrary.
To build the ligand from a list of candidates ('cocktail'), the coordinates of the ligand candidates should be concatenated into one file specified by the above mentioned keyword ligand. The different ligands must be distinguished by their residue name (columns 18-20) in the concatenated pdb file (different chain identifier or residue sequence number will do as well, however we recommend to use different residue names). ARP/wARP will automatically choose the best-matching ligand candidate and will attempt to build it at the binding site, either determined automatically or supplied by the user. However, since this feature is new, supplying the binding site using search_model or search_position keywords is recommended.
If auto_ligand.sh is called with an option ‘parfile’, the script will create a parameter file and a directory in the workdir whose name will be printed. The job can subsequently be launched by:
% $warpbin/warp_ligand.sh NAME_OF_PARFILE
If auto_ligand.sh is called without an option ‘parfile’, it will also launch the job. The log files and additional output files as well as the building results can be found in the directory created.
Automated Solvent Building Running solvent building from the GUI, ARP/wARP Solvent

Within solvent building module restrained reciprocal space refinement is carried out with REFMAC while ARP/wARP is performing automatic adjustment of the solvent structure. Resolution of thedata should be 2.5 Å or higher. The output is the protein model with the solvent molecules transformed with symmetry operations to lie around the protein.
The ARP/wARP solvent building module requires the X-ray data (in MTZ format) and the protein model (in PDB format) without solvent or with a partial solvent model.
There are a number of options that can be added. A brief description is given below.
QUITTING … ARP/wARP module stopped with an error message: name_of_the_program indicated that one of the modules of the task has terminated with an error message. Please refer to the specified log file.
Running solvent building from command line (auto_solvent.sh)
The script auto_solvent.sh in the $warpbin directory allows you to run the solvent building as a single-line command without the use of the GUI. The use of auto_solvent.sh is fairly simple. The script prints out help information if it is invoked without arguments.
$warpbin/auto_solvent.sh \ datafile {mtzfile} \ protein {starting_PDB_file} \ [workdir {FULLPATH_WORKING_DIRECTORY}] \ [solventfileout {output_PDB_file}] \ [fp {fp_label}] [sigfp {sigfp_label}] [freer {freer_label}] \ [restrcyc {number_of_cycles (default is 20) }] \ [extralibrary {user_defined_library_for_Refmac5}] \ [tlsin {fixed pre-refined TLS tensors from Refmac5}] \ [parfile {parfilename_if_only_parfile_is_to_be_created}]
Required keywords are: datafile (followed by the mtz-file name with the full path) and protein (followed by the pdb-file name of the protein model with the full path).
Optional keywords include: workdir (followed by the full path to the working directory), solventfileout (followed by the name of the PDB file where the output will be written), fp (followed by the fp label), sigfp (followed by the sigfp label) and freer (followed by the Rfree label). The defaults for the first two are FP and SIGFP, respectively. Alternatively, if the mtz file contains only one column for structure factor amplitudes and only one column for their standard deviations, these will be taken. The number of cycles (default is 20) can be changed with keyword restrcyc. The user-defined library and the tls-tensor for Refmac can be supplied by using the keywords extralibrary and tlsin.
If auto_solvent.sh is called with an option ‘parfile’, the script will create a parameter file and a directory in the workdir whose name will be printed. The job can subsequently be launched by:
% $warpbin/warp_solvent.sh NAME_OF_PARFILE
If auto_solvent.sh is called without an option ‘parfile’, it will also launch the job. The log files and additional output files as well as the building results can be found in the directory created.
ARP/wARP molecular graphics: ARP Navigator
Running small ARP/wARP tasks and viewing models and density maps
The proto-version of the graphical front-end to ARP/wARP Version 7.1 is an OpenGL/X-window based graphics program that can be launched by pressing the GUI ‘ARP Navigator’ button. The program can also be started from the command line by typing ‘arpnavigator’.

Mouse and Keyboard functions Rotation
Translation
Scaling
Clip planes
Map contouring
Map extent
• Left mouse button + e-key pressed and mouse moved up-down/left-right (size increases right/up).
Menus
The keyboard alone can have the following functions:
The ArpNavigator Menu
The Files Menu
Note: When a file is loaded and put on display, there will be little buttons appearing in the bottom left corner representing each of the graphical objects. Only one object can be active at a time.
An object can be made active by clicking on the button with the left mouse button. A little eye symbol shows whether this object is currently on display or if it's hidden. Clicking with the right mouse button on this button will pull out the mini-menu with actions applied to this object only (see also Mini menu).
The Mini-Menus
Each object loaded has its own menu of selected actions:
The Tasks Menu
Note: In difference to the functionality offered from the CCP4 GUI, the above two tasks will also accept density maps as input.
• Model Solvent: This runs the solvent building module of ARP/wARP.
The Display Menu
The Options Menu
Note: Screenshots will only produce files in uncompressed bmp-format.
The Help Menu
• Help Screen: When clicking on this item, a text view window pops up that contains this help text.
The quick actions
When the right mouse button is pressed with no movement, then a green button box is displayed that contains functionalities to be applied 'ad-hoc' and with no input dialog.
Dialog windows
The goto-atom dialog expects that atoms are specified as e.g. CA/123/A for the CA atom of residue 123 in chain A. Just specifying CA/123 means the first occurrence of CA in residue 123. Specifying /123/ means the first atom in residue 123. Typing //Z will be interpreted as the first atom of chain Z. The program will centre on the atom if found. In case the atom cannot be found, the dialog gets coloured in pink.
Chapter 4. Additional Remarks
Quality of the X-ray Data
The X-ray data should be as complete as possible, especially in the low resolution range (5 Å and lower). Ideally the X-ray data should have no low resolution cutoff. If the low resolution strong data are systematically incomplete (e.g. missing or overloaded reflections), the density map, even in the case of a good model, may be discontinuous and inconsistent with the model. Because ARP/wARP involves updating on the basis of density maps, such discontinuity can lead partially to slow convergence or even non-interpretable maps.
ARP/wARP automatically checks the fit of your data to the expected Wilson plot and will report if necessary. If suggested to cut the data from the high resolution side -follow the suggestion. If suggested to cut the data from the low resolution side -do so but do not cut to a resolution below 8or 10 Å. If suggested to ignore all data or there are still other complaints after the cut -go and recollect/reprocess your data. The current version of the ARP/wARP Wilson plot check might be too stringent. Nevertheless the user is advised to visually inspect the Wilson plot and apply his/her critical judgment whether or not the data should be cut. It has sometimes proved beneficial to cut the data which were flagged as poor, though in some cases the presence of these data were crucial for the model building.
Chapter 5. References
The most recent overview of the ARP/wARP can be found in:
• Langer, G., Cohen, S.X., Lamzin, V.S. & Perrakis, A. (2008) Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nature Protocols. 3, 1171-1179.
Applications are presented in:
For other publications please refer to the references therein or to the ARP/wARP web page.
Chapter 6. Author-Abuse Information and Acknowledgements
The authors to abuse of...
The Hamburg team (European Molecular Biology Laboratory (EMBL) Hamburg Outstation, c/o DESY, Notkestrasse 85, 22603 Hamburg, Germany):
Victor S. Lamzin (tel +49-40-89902-121, fax +49-40-89902-149, email: victor@embl-hamburg.de)
Ciaran Carolan
Helene Dörksen
Philipp Heuser
Gerrit G. Langer
Tim Wiegels
The Amsterdam team (Molecular Carcinogenesis Programme, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands):
Anastassis Perrakis (tel. +31-20-512-1951, fax +31-20-512-1954, email: a.perrakis@nki.nl)
Krista Joosten
Robbie Joosten
Former members
Serge X. Cohen
Guillaume X. Evrard
Francisco Fernandez
Johan Hattne
Marouane Ben Jelloul
Matheos Kakaris
Olga V. Kirillova
Wijnand Mooij
Richard J. Morris
Parthasarathy Venkataraman
Tilo Strutz
Petrus H. Zwart
The authors are especially grateful to:
Appendix A. Example output from install.sh on a Mac
ARP/wARP installer is checking your c-shell... c-shell is installed on your machine at /bin/csh Your login shell is: /bin/tcshChecking permissions for /dev/null -OK Checking availability of sed command -OK Checking availability of tail command -OK Checking availability of awk command -OK Checking decimal separator -OK Checking ARP/wARP directory path -OK Checking ARP/wARP directory structure -OK Checking java installation - installed version is “1.5.0_19” Checking java version number -OK Checking python installation - installed version is 2.5.1 Checking python version number -OK Python executables are available in /Users/testuser/arp_warp_7.1/byte-code/python-2.5 Checking operating system name -Darwin Checking processor type -i386 ARP/wARP version 7.1 executables for this platform are available in /Users/testuser/arp_warp_7.1/bin/bin-i386-Darwin
Installing script and data files for: bin-athlon-Linux bin-i386-Darwin bin-i686-Linux bin-ia64-Linux bin-powerpc-Darwin bin-x86_64-Linux
Checking CCP4 & ARP/wARP installation -OK Checking refmac5 installation - installed version is 5.5.0101 Checking refmac5 version number -OK
Testing the possibility of remote job submission - please wait...
curl seems to be able to communicate with the cluster at EMBL-Hamburg The remote job submission of ARP/wARP 7.0.1 from CCP4i should work
=========================================================================== Remember that you have accepted the terms of the license agreement when you downloaded ARP/wARP 7.1. Proceeding with installation reinforces your acceptance of the license agreement A main obligation of this agreement is that any reference to the software for crystallographic computations will cite one or more ARP/wARP publications as set forth in the manual and on http://www.arp-warp.org ===========================================================================
*** IMPORTANT FOR SETUP *** Do not forget to add this line in your .cshrc file: source /Users/testuser/arp_warp_7.1/arpwarp_setup.csh
*** INSTALLATION OF ARP/wARP 7.1 HAS BEEN SUCCESSFUL ***
An attempt will be made to install the graphical user interface automatically. Checking availability of tclsh command -OK The $CCP4I_TOP dir has an ARP/wARP GUI, which is owned and is writeable by you (the current user) (version is 7.0 or higher). Uninstalling previous GUI Installing GUI for ARP/wARP 7.1 CCP4 interface version detected is : 2.0.4 Using tarball ARP_wARP_CCP4I6.tar.gz (for ccp4 version 6 and newer).
*** INSTALLATION OF ARP/wARP 7.1 GUI HAS BEEN SUCCESSFUL ***