Phaser 2.1 (CCP4: Supported Program)
Phaser is crystallographic software for phasing macromolecular crystal structures
with maximum likelihood techniques. It is available through the Phenix and CCP4 software
suites, and directly from the authors.
Most people will not need to read this documentation to solve their structure! To solve
a structure by molecular replacement go to Automated
Molecular Replacement and copy and edit
the command script. Similarly, for SAD phasing, go to Automated Experimental Phasing and copy and edit the
command script.
Other good sources of information are found in Frequently Asked Questions and Top Tips
Index to Documentation
1. Introduction
2. Molecular Replacement Modes
3. Experimental Phasing Modes
4. General Modes
5. Keyword Input
6. XML Output (for developers)
7. Python Scripting (for developers)
8. Version History
9. References
1. Introduction
This is the documentation for Phaser2.1. There
are some changes between this version and previous versions
so input scripts may need editing.
1.1 Modes
Phaser runs in different modes, which perform Phaser's different functionalities.
Modes can either be basic modes or modes that combine the functionality of basic modes.
The mode of operation is controlled with the MODE keyword
| Functionality |
Mode |
Description |
| Anisotropy Correction |
ANO |
Corrects data for anisotropic diffraction, and makes the intensity distribution isotropic |
| Cell Content Analysis |
CCA |
Calculates the expected number of molecular assemblies of given molecular weight in the unit cell using the Matthews coefficient |
| Normal Mode Analysis |
NMA |
Perturbs a structure in rms deviation steps along combinations of normal modes |
| Automated Molecular Replacement |
MR_AUTO |
Combines anisotropy correction, cell content analysis, fast rotation and translation functions, and refinement and phasing to automatically solve a structure by molecular replacement |
| Fast Rotation Function |
MR_FRF |
Anisotropy correction and likelihood-enhanced fast rotation function calculated with Fast Fourier Transform |
| Fast Translation Function |
MR_FTF |
Anisotropy correction and likelihood-enhanced fast translation function calculated with Fast Fourier Transform |
| Brute Rotation Function |
MR_BRF |
Anisotropy correction and full likelihood rotation function calculated in a brute-force search of angles |
| Brute Translation Function |
MR_BTF |
Anisotropy correction and full likelihood translation function calculated in a brute-force search of positions |
| Packing |
MR_PAK |
Tests molecular replacement solutions to see whether they pack into the unit cell without overlap |
| Log-Likelihood Gain |
MR_LLG |
Anisotropy correction and re-scoring of molecular replacement solutions with the full likelihood target function |
| Refinement and Phasing |
MR_RNP |
Anisotropy correction and optimization of the orientation and position of molecular replacement models with the full likelihood target function |
| Automated Experimental Phasing |
EP_AUTO |
Combines anisotropy correction, cell content analysis, and SAD phasing to automatically solve a structure by experimental phasing |
| SAD Phasing |
EP_SAD |
Refines atoms using the SAD likelihood function, and completes the structure with log-Likelihood gradient maps |
1.2 Keyword Index
1.3 Tutorials and Example Files
The example scripts all refer to the tutorial test cases. The pdb, sequence and
mtz files required to run the tutorials are distributed with Phaser.
- BETA-BLIP
- The crystal structure
of a hetero-dimer of beta-lactamase (BETA) and beta-lactamase inhibitor protein
(BLIP), both with molecular replacement models from crystal structures of
the individual BETA and BLIP components. We thank Mike James and Natalie Strynadka for the diffraction data. Reference:
Strynadka, N.C.J., Jensen, S.E., Alzari, P.M. & James. M.N.G. (1996) Nat. Struct. Biol. 3 290-297.
- Insulin
- The crystal structure of insulin phased on intrinsic sulphurs. We thank Paul Adams for the diffraction data.
Reference: Adams (2001) Acta Cryst D57. 990-995.
1.4 Bug Reports
2. Molecular Replacement Modes
Phaser should be able to solve most structures with the Automated
Molecular Replacement mode, and this is the first mode that you should try.
Give Phaser your data (How to Define Data) and your
models (How to Define Models), tell Phaser what to
search for (use SEARch
keyword), and a list of possible spacegroups (in the same point group - use the
SGALternative
keyword). The flow diagram for the automated molecular replacement mode is shown
below. If this doesn't work (see "Has Phaser Solved It?"),
you can try selecting peaks of lower significance
in the rotation function in case the real orientation was not within the selection
criteria. By default peaks above 75% of the top peak are selected (see "How
to Select Peaks"). See "What to do in difficult
cases" for more hints and tips. If the automated molecular replacement
mode doesn't work even with non-default input you need to run the modes of Phaser
separately. The possibilities are endless - you can even try exhaustive searches
(translations of all orientations) if you want - but experience has shown that
most structures that can be solved by Phaser can be solved by relatively simple
strategies.
2.1 Automated Molecular Replacement
MODE
MR_AUTO combines the anisotropy correction, likelihood enhanced
fast rotation function, likelihood enhanced fast translation function, packing
and refinement modes for multiple search models and a set of possible spacegroups
to automatically solve a structure by molecular replacement. Top solutions
are output to the files FILEROOT.sol,
FILEROOT.#.mtz and FILEROOT.#.pdb
(where "#" refers to the sorted solution number, 1 being the best, and only 1 is output by default). Many structures
can be solved by running an automated molecular replacement search with
defaults, giving the ensembles that you expect to be easiest to find first.
Example command script for finding BETA and BLIP. This is the minimum input,
using all defaults (except the ROOT filename).
beta_blip_auto.com
phaser
eof
TITLe beta blip automatic
MODE MR_AUTO
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
SEARch ENSEmble beta NUM 1
SEARch ENSEmble blip NUM 1
ROOT beta_blip_auto # not the default
eof
Example command script for finding BETA and BLIP. The spacegroup recorded
on the mtz file is P3221 but the other hand is also a possibility.
Both search orders (BETA first, BLIP second and BLIP first, BETA second)
are tried, using the PERMutations ON keyword. We would not normally recommend
using the PERMutations ON keyword for this case, as it is obvious that the
larger molecule should be easier to find first. To speed up the calculation
only the top peak after the translation function is taken into refinement.
beta_blip_auto_sg.com
phaser
eof
TITLe beta blip automatic
MODE MR_AUTO
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
SEARch ENSEmble beta NUM 1
SEARch ENSEmble blip NUM 1
PERMutations ON # not the default
SGALternative HAND # not the default
ROOT beta_blip_auto_sg # not the default
FINA TRA SELEct NUM 1 # not the default
eof
Compulsory Keywords
Optional Keywords
Flow Diagram for Automated Molecular Replacement in Phaser
2.2 Has Phaser Solved It?
Ideally, only the number of solutions you are expecting should be found.
However if the signal-to-noise of your search is low, there will be noise peaks in the final selection also.
A highly compact summary of the history of a solution is given in the annotation of
a solution in the .sol file. This is a good place to start your analysis of the output. The annotation gives the Z-score of the solution at each
rotation and translation function, the number of clashes in the packing, and the refined LLG. You should see the TFZ (the translation function Z-score) is high at least for the final components of the solution, and that the LLG (log-likelihood gain) increases as each component of the solution is added. For example, in the case of beta-blip the annotation for the single solution output in the .sol file shows these features
SOLU SET RFZ=11.0 TFZ=22.6 PAK=0 LLG=434 RFZ=6.2 TFZ=28.9 PAK=0 LLG=986 LLG=986
SOLU 6DIM ENSE beta EULER 200.920 41.240 183.776 FRAC -0.49641 -0.15752 -0.28125
SOLU 6DIM ENSE blip EULER 43.873 80.949 117.141 FRAC -0.12290 0.29306 -0.09193
For a rotation function, the correct
solution may be in the list with a Z-score under 4, and will not be found
until a translation function is performed and picks out the correct solution.
For a translation function the correct solution will generally have a Z-score
(number of standard deviations above the mean value) over 5 and be well
separated from the rest of the solutions.
Of course, there will always be exceptions!
| TF Z-score |
Have I solved it? |
| less than 5 |
no |
| 5 - 6 |
unlikely |
| 6 - 7 |
possibly |
| 7 - 8 |
probably |
| more than 8 |
definitely* |
*Note, in particular, that in
the presence of translational NCS, pairs of similarly-oriented molecules
separated by the correct translation vector will give large Z-scores, even
if they are incorrect, because they explain the systematic variation in
intensities caused by the translational NCS.
You should always at least glance through the summary of the logfile. One thing
to look for, in particular, is whether any translation solutions with a
high Z-score have been rejected by the packing step. By default up to 10 clashes are allowed. Such a solution may be correct, and the
clashes may arise only because of differences in small surface loops. If
this happens, repeat the run allowing a suitable number of clashes. Note that, unless there is specific evidence in the logfile that
a high TF-function Z-score solution is being rejected with a few clashes, it is much better
to edit the model to remove the loops than to increase the number of allowed clashes. Packing criteria
are a very powerful constraint on the translation function, and increasing the number of allowed clashes
beyond the default will increase the search time enormously without the possibility of generating any
correct solutions that would not have otherwise been found.
2.3 What to do in difficult cases
Not every structure can be solved by molecular replacement, but the right
strategy can push the limits. What to do when the default jobs fail depends
on why your structure is difficult.
2.3.1 Flexible Structure
The relative orientations of the domains may be different in your crystal
than in the model. If that may be the case, break the model into separate
PDB files containing rigid-body units, enter these as separate ensembles,
and search for them separately. If you find a convincing solution for one
domain, but fail to find a solution for the next domain, you can take advantage
of the knowledge that its orientation is likely to be similar to that of
the first domain. The ROTAte AROUnd
option of the brute rotation search can be used to restrict the search to
orientations within, say, 30 degrees of that of the known domain. Allow
for close approach of the domains by increasing the allowed clashes with
the PACK
keyword by, say, 1 for each domain break that you introduce.
Alternatively, you could try generating a series of models perturbed by
normal modes, with the NMAPdb keyword. One of these
may duplicate the hinge motion and provide a good single model.
2.3.2 Poor or Incomplete Model
Signal-to-noise is reduced by coordinate errors or incompleteness of the
model. Since the rotation search has lower signal to begin with than the
translation search, it is usually more severely affected. For this reason,
it can be very useful to use the subsequent translation search as a way
to choose among many (say 1000) orientations. Try increasing the number
of clustered orientations in an AUTO job using the keyword FINAL,
e.g. FINAL ROT SELEct PERCent 65.
If that fails, try turning off the clustering feature in the save step (FINAL ROT STEP 2 CLUSter OFF),
because the correct orientation may sit on the shoulder of a peak in the
rotation function.
As shown convincingly by Schwarzenbacher et al. (Schwarzenbacher,
Godzik, Grzechnik & Jaroszewski, Acta Cryst. D60, 1229-1236,
2004), judicious editing can make a significant difference in the quality
of a distant model. In a number of tests with their data on models below
30% sequence identity, we have found that Phaser works best with a "mixed
model" (non-identical sidechains longer than Ser replaced by Ser).
In agreement with their results, the best models are generally derived using
more sophisticated alignment protocols, such as their FFAS protocol.
2.3.3 High Degree of Non-crystallographic Symmetry
If there are clear peaks in the self-rotation function, you can expect orientations
to be related by this known NCS. Methods to automatically use such information
will be implemented in a future version of Phaser. In the meantime, you can
work out for yourself the orientations that would be consistent with NCS and
use the ROTAte AROUnd
option to sample similar orientations. Alternatively, you may have an oligomeric
model and expect similar NCS in the crystal. First search with the oligomeric
model; if this fails, search with a monomer. If that succeeds, you can again
use the ROTAte AROUnd
option to force a subsequent monomer to adopt an orientation similar to the
one you expect.
2.3.4 Pseudo-translational Non-crystallographic Symmetry
It is frequently the case that crystallographic and non-crystallographic rotational
symmetry axes are parallel. The combination generates translational NCS, in
which more than one unique copy of the molecule is found in the same orientation
in the crystal. This can be recognized by the presence of large non-origin
peaks in the native Patterson map. If one copy of the search model can be
found, then the translational NCS tells you where to place another copy. Unfortunately,
the presence of translational NCS can make it difficult to solve a structure
using Phaser, because the current likelihood targets do not account for the
statistical effects of NCS. If there is a small difference in the orientation
of the two molecules (which will show up as a reduction in the height of the
non-origin Patterson peak as the resolution is increased), it may help to
use data to higher resolution than the default, because the translational
NCS is partially broken.
2.3.5 What not to do
The automated mode of Phaser is fast when Phaser finds a high Z-score solution to your problem. When Phaser cannot find
a solution with a significant Z-score, it "thrashes", meaning it maintains a list of 100-1000's of low Z-score potential solutions
and tries to improve them. This can lead to exceptionally long Phaser runs (over a week of
CPU time). Such runs are possible because the highly automated script allows many consecutive MR jobs to be run without
you having to manually set 100-1000's of jobs running and keep track of the results. "Thrashing" generally does not produce a solution:
solutions generally appear relatively quickly or not at all. It is more useful to go back and analyse your models and your data to see
where improvements can be made. Your system manager will appreciate you terminating these jobs.
It is also not a good idea to effectively remove the packing test. Unless there is specific evidence in the logfile that
a high TF-function Z-score solution is being rejected with a few clashes, it is much better
to edit the model to remove the loops than to increase the number of allowed clashes. Packing criteria
are a very powerful constraint on the translation function, and increasing the number of allowed clashes
beyond a few (e.g. 1-5) will increase the search time enormously without the possibility of generating any
correct solutions that would not have otherwise been found.
2.3.6 Other suggestions
Phaser has powerful input, output and scripting facilities that allow a large number of possibilities for altering default
behaviour and forcing
Phaser to do what you think it should. However, you will need to read the information in the manual below to take
advantage of these facilities!
2.4 How to Define Data
You need to tell Phaser the name of the mtz file containing your data and
the columns in the mtz file to be used using the HKLIn
and LABIn
keywords. Additional keywords (BINS
CELL OUTLier RESOlution
SPACegroup) define how the data are used.
2.5 How to Define Models
Phaser must be given the models that it will use for molecular replacement.
A model in Phaser is referred to as an "ensemble", even when it is described by a single file. This is because it is possible to provide a set of aligned homologous structures as an ensemble, from which a statistically-weighted averaged model is calculated. A molecular replacement model is provided either
as one or more aligned pdb files, or as an electron density map, entered as structure factors
in an mtz file. Each ensemble is treated as a separate type of rigid body
to be placed in the molecular replacement solution. An ensemble should only
be defined once, even if there are several copies of the molecule in the
asymmetric unit.
Fundamental to the way in which Phaser uses MR models (either from coordinates or maps) is to estimate
how the accuracy of the model falls off as a function of resolution, represented by the Sigma(A) curve.
To generate the Sigma(A) curve, Phaser needs to know the
RMS coordinate error expected for the model and the fraction of the scattering power in the asymmetric unit
that this model contributes.
If fp is the fraction scattering and RMS is the rms coordinate
error, then
Sigma(A) = SQRT{fp*[1-fsol*exp(-Bsol*(sin(theta)/lambda)2)]}
* exp{-(8 Pi2/3)*RMS2*(sin(theta)/lambda)2}
where fsol(default=0.95) and Bsol(default=300Å2) account for the effects
of disordered solvent on the completeness of the model at low resolution.
Molecular replacement models are defined with the ENSEmble
keyword and the COMPosition
keyword. The ENSEmble
keyword gives (amongst other things) the RMS deviation for the Sigma(A) curve.
The COMPosition
keyword is used to deduce the fraction of the scattering power in the asymmetric unit
that each ensemble contributes. The composition of the asymmetric unit is defined either by entering the
molecular weights or sequences of the components in the asymmetric unit,
and giving the number of copies of each. Expert users can also enter the
fraction of the scattering of each component directly, although the composition
must still be entered for the absolute scale calculation. Please note that the composition supplied to Phaser has to include everything in the asymmetric unit, not just what is being looked for in the current search!
3.5.1 Building an Ensemble from Coordinates
- You have one structure as a model with 44% sequence identity to the
protein in the crystal.
- ENSEmble
mol1 PDB homology1.pdb IDENtity
.44
- You have three structures as models with 44%, 39% and 35% identity to
the protein in the crystal.
- ENSEmble
mol2 PDB
homology1.pdb IDENtity .44 PDB
homology2.pdb IDENtity .39 PDB
homology3.pdb IDENtity .35
- You have an NMR Ensemble as a model. There is no need to split the coordinates
in the pdb file provided that the models are separated by MODEL and ENDMDL
cards. In this case the homology is not a good indication of the similarity
of the structural coordinates to the target structure. You should use
the RMS option; several test cases have succeeded where the ID was close to 100% with an RMS value of
about 1.5Å (see table below).
- ENSEmble
mol3 PDB nmr.pdb RMS
1.5
The RMS deviation is determined directly from RMS
or indirectly from IDENtity
in the ENSEmble
keyword using the formula RMS = max(0.8,0.4*exp(1.87*(1.0-ID))) where ID is the fraction identity.
The RMS deviation estimated from ID may be an underestimate of the true value if there is a slight conformational
change between the model and target structures. To find a solution in these cases it may be necessary to increase the
RMS from the default value generated from the ID, by say 0.5 Ångstroms. On the other hand, when Phaser succeeds in solving a structure from a model with sequence identity much below 30%, it is often found that the fold is preserved better than the average for that level of sequence identity. So it may be worth submitting a run in which the RMS error is set at, say, 1.5, even if the sequence identity is low. The table below can be used as a
guide as to the default RMS value corresponding to ID.
| Sequence ID |
RMS deviation |
| 100% |
0.80Å |
| 64% |
0.80Å |
| 63% |
0.799Å |
| 50% |
1.02Å |
| 40% |
1.23Å |
| 30% |
1.48Å |
| 20% |
1.78Å |
| --> limit 0% |
2.60Å |
If you construct a model by homology modelling, remember that the RMS error
you expect is essentially the error you expect from the template structure (if not worse!).
So specify the sequence identity of the template, not of the homology model.
2.5.2 Building an Ensemble from a Map
- You have low resolution electron density of your model. This density
has been cut out and converted to structure factors in a large cell.
- ENSEmble
mol1 HKLIn mol1.mtz F
= Fmol1 P = Pmol1 EXTEnt
23 25 29 RMS 2.0 CENTre
4 3 30 PROTein MW 10241 NUCLeic
MW 0
When using density as a model, it is necessary to specify both the extent
(x,y,z limits) of the cut-out region of density, and the centre of this
region. With coordinates, Phaser can work this out by itself. This information
is needed, for instance, to decide how large rotational steps can be in
the rotation search and to carry out the molecular transform interpolation
correctly. In the case of electron density, the RMS value does not have
the same physical meaning that it has when the model is specified by atomic
coordinates, but it is used to judge how the accuracy of the calculated
structure factors drops off with resolution. A suitable value for RMS can
be obtained, in the case of density from an experimentally-phased map, by
choosing a value that makes the SigmaA curve fall
off with resolution similarly to the mean figures-of-merit. In the case of
density from an EM image reconstruction, the RMS value should make the SigmaA
curve fall off similarly to a Fourier correlation curve used to judge the
resolution of the EM image.
For detailed information, including a tutorial with example scripts, see
Using density as a model
2.5 How to Define Composition
The composition defines the total amount of protein and nucleic acid that you have in the asymmetric unit not the fraction of the asymmetric unit that you are searching for.
2.5.1 Default Composition
For convenience, the composition defaults to 50% protein scattering by volume (the average for protein crystals). It is better to enter it explicitly, even if only to check that you have correctly deduced the probable content of your crystal. If your crystal has higher or lower solvent content than this, or contains nucleic acid, then the composition should be entered explicitly.
2.5.2 Composition by Solvent Content
Scattering is determined from the solvent content of the crystal, assuming that the crystal contains protein only, and the average distribution of amino acids in protein. If your crystal contains nucleic acid or your protein has an unusual amino acid distribution then the composition should be entered explicitly using the MW or sequence options.
- COMPosition SOLVent 0.6
2.5.3 Composition by Number of Residues in ASU
Scattering is determined from the number of residues in the asymmetric unit, assuming that the crystal contains protein only or nucleic acid only, and assuming an average distribution of residues for either. If your crystal contains a mixture then the composition should be entered explicitly using the MW or sequence options.
If your crystal has an unusual residue distribution then the composition should be entered explicitly using the sequence options.
- COMPosition PROTein NRES 187
- COMPosition NUCLeic NRES 24
-
2.5.4 Composition by Molecular Weight
The composition is calculated from the molecular weight of the protein and nucleic acid assuming the protein and nucleic acid have the average distribution of amino acids and bases. If your protein or nucleic acid has an unusual amino acid or base distribution the composition should be entered by sequence.
You can mix compositions entered by molecular weight with those entered by sequence.
- You have one protein (with MW 21022) in the asymmetric unit
- COMPosition PROTein MW 21022
- You have three copies of a protein (with MW 21022) in the asymmetric unit
- COMPosition PROTein MW 21022
- COMPosition PROTein MW 21022
- COMPosition PROTein MW 21022
- Another way of entering the same thing is
- COMPosition PROTein MW 21022
NUMber 3
- Yet another way of entering the same thing is
- COMPosition PROTein MW 63066
- You have two copies of a protein (with MW 21022), two copies of a protein
(with MW 9843) and RNA with (MW 32004) in the asymmetric unit
- COMPosition PROTein MW 21022
NUMber 2
- COMPosition PROTein MW 9843
NUMber 2
- COMPosition NUCLeic MW 32004
2.5.5 Composition by Sequence
The composition is calculated from the amino acid sequence of the protein and the base sequence of the nucleic acid in fasta format.
You can mix compositions entered by molecular weight with those entered by sequence.
Individual atoms can be added to the composition with the COMPOSITION ATOM keyword. This allows the explicit addition of heavy atoms in the structure e.g. Fe atoms.
- You have one protein (with sequence in fasta format in the file prot1.seq) in the asymmetric unit
- COMPosition PROTein SEQuence prot1.seq
- You have three copies of a protein (with sequence in fasta format in the file prot1.seq) in the asymmetric unit
- COMPosition PROTein SEQuence prot1.seq
- COMPosition PROTein SEQuence prot1.seq
- COMPosition PROTein SEQuence prot1.seq
- Another way of entering the same thing is
- COMPosition PROTein SEQuence prot1.seq
NUMber 3
- Yet another way of entering the same thing is to make a sequence file
with all the amino acids concatenated together (prot1.seq3)
- COMPosition PROTein SEQuence prot1.seq3
- You have two copies of a protein (with sequence in fasta format in
the file prot1.seq), two copies of a protein (with sequence in fasta format
in the file prot2.seq) and RNA with (with sequence in fasta format in
the file nucl1.seq) in the asymmetric unit
- COMPosition PROTein SEQuence prot1.seq
NUMber 2
- COMPosition PROTein SEQuence prot2.seq
NUMber 2
- COMPosition NUCLeic SEQuence nucl1.seq
2.5.6 Composition by Percentage Scattering
The fraction scattering of each ensemble can be entered directly. The fraction scattering of each ensemble is normally automatically worked out from the average scattering from each ensemble (calculated from the pdb files if entered as coordinates, or from the protein and nucleic acid molecular weights if entered as a map) divided by the total scattering given by the composition, but entering the fraction scattering directly overrides this calculation. This option is for use when the pdb files of the models in the ensemble are unusual e.g. consist only of C-alpha atoms, or only of hydrogen atoms (as in the CLOUDS method for NMR).
- Each copy of Ensemble mol1 gives 22% of the scattering
- COMPosition
ENSEmble mol1 FRACtional
0.22
- Each copy of Ensemble mol2 gives 78% of the scattering
- COMPosition
ENSEmble mol2 FRACtional
0.78
2.6 How to Define Solutions
Phaser writes out files ending in ".sol"
and ".rlist" that
contain the solution information from the job. The root of the files is given
by the ROOT
keyword. By default, the root filename is PHASER. These files can be read
back into subsequent runs of Phaser to build up solutions containing more
than one molecule in the asymmetric unit.
"PHASER.sol" files
are generated by all modes (rotation function modes with VERBOSE output),
and contain the current idea of potential molecular
replacement solutions.
"PHASER.rlist"
files are generated by the rotation function modes, and are used as input for performing
translation functions. (They are also produced by degenerate (2D) translation
functions, for performing a translation function to find the third dimension)
To include the files you should use the preprocessor
command @
@ filename.sol
@ filename.rlist
For simple MR cases you don't really need to know how to define molecular replacement solutions.
However, for difficult cases you might need to edit the files "PHASER.sol"
and "PHASER.rlist"
files manually
2.6.1 "sol" Files
At different stages of molecular replacement, an Ensemble will be oriented
but not positioned (after the rotation search), or oriented and positioned
(after the translation search), or, rarely, oriented and the position in 2
of 3 dimensions known. These three states correspond to solutions defined
by the keywords SOLUtion
3DIM, SOLUtion
6DIM, and SOLUtion
5DIM. Each Ensemble in the asymmetric unit has its own SOLUtion
keyword. Solutions of the type
3DIM are given by the rotation function, solutions of the type
6DIM are given by the translation function, and solutions of the type
5DIM are given by the degenerate translation function. Examples are:
- One copy of mol1 with known orientation and position (fractional coordinates)
- SOLUtion 6DIM
ENSEmble mol1 EULEr
17 20 32 FRACtional 0.12 0.05 0.74
- One copy of mol1 with known orientation only
- SOLUtion 3DIM
ENSEmble mol1 EULEr
17 20 32
- One copy of mol1 with known orientation and only the coordinates in 2 dimensions is
known. The degenerate direction is defined as the direction perpendicular
to the plane in which the position is given.
- SOLUtion 5DIM ENSEmble mol1 EULEr 17 20 32 DEGEnerate X FRACtional 0.05 0.74
When more than one (potential) molecular replacement solution is present, the solutions
are separated with the SOLUTION
SET keywords. For example, if the rotation function and
translation function for mol1 were very clear, then there will only be one
type of 6DIM solution for mol1.
If the rotation and translation functions for mol2 were then not clear, there
will be a series of possible 6DIM
solutions for mol2.
- SOLUtion SET
- SOLUtion 6DIM
ENSEmble mol1 EULEr
17 20 32 FRACtional 0.12 0.05 0.74
- SOLUtion 6DIM
ENSEmble mol2 EULEr
5 183 230 FRACtional 0.71 0.54 0.81
- SOLUtion SET
- SOLUtion 6DIM
ENSEmble mol1 EULEr
17 20 32 FRACtional 0.12 0.05 0.74
- SOLUtion 6DIM
ENSEmble mol2 EULEr
51 93 75 FRACtional 0.08 0.57 0.25
Useful Tip
If you have the coordinates of a partial solution with the pdb coordinates of the known structure in the correct orientation and
position, then you can force Phaser to use these coordinates. Use this pdb file to define an ensemble (named "mol1" in this example). Then manually create a .sol file of the following form and include it
in the Phaser command script with the @filename preprocessor command (or include it directly in the script)
- SOLUtion SET
- SOLUtion 6DIM
ENSEmble mol1 EULEr
0 0 0 FRACtional 0 0 0
2.6.2 "rlist" Files
These files define a rotation function list. The peak list is given with a
series of SOLUtion
TRIAl keywords.
SOLUtion TRIAl ENSEmble
mol1 EULEr 17 20 32 SCORE 4.5
SOLUtion TRIAl ENSEmble
mol1 EULEr 67 65 51 SCORE 4.4
SOLUtion TRIAl ENSEmble
mol1 EULEr 67 112 81 SCORE 4.3
If a partial solution is already known, then the information for the currently
"known" parts of the asymmetric unit is given in the form used for
the PHASER.sol file, followed
by the list of trial orientations for which a translation function is to be
performed.
SOLUtion SET
SOLUtion 6DIM ENSEmble
mol1 EULEr 17 20 32 FRACtional
0.12 0.05 0.74
SOLUtion TRIAl ENSEmble
mol1 EULEr 44 20 32 SCORE 5.8
SOLUtion TRIAl ENSEmble
mol1 EULEr 67 65 51 SCORE 5.2
SOLUtion SET
SOLUtion 6DIM ENSEmble
mol1 EULEr 17 20 32 FRACtional 0.13 0.55 0.76
SOLUtion TRIAl ENSEmble
mol1 EULEr 83 9 180 SCORE 6.3
SOLUtion TRIAl ENSEmble
mol1 EULEr 8 36 92 SCORE 4.2
SOLUtion TRIAl ENSEmble
mol1 EULEr 48 87 10 SCORE 4.0
If a degenerate translation function is performed, then a SOLUtion
TRIAl line is produced with the degenerate translation information
present, ready for performing the translation function on the third dimension.
SOLUtion TRIAl ENSEmble
mol1 EULEr 17 20 32 DEGEnerate
X FRACtional 0.05 0.74
2.7 How to Control Output
The output of Phaser can be controlled with the following optional keywords.
The ROOT keyword
is not compulsory (the default root filename is "PHASER"),
but should always be given, so that your jobs have separate and meaningful
output filenames.
Optional Keywords
Where HKLOut ON
is given as an optional keyword, Phaser produces an mtz file with "SigmaA"
type weighted Fourier map coefficients for producing electron density maps
for rebuilding.
| MTZ Column Labels |
Description |
| FWT |
PHWT |
Amplitude and phase for 2m|Fobs|-D|Fcalc| exp(i alpha-calc) map |
| DELFWT |
PHDELWT |
Amplitude and phase for m|Fobs|-D|Fcalc| exp(i alpha-calc) map |
| FOM |
m, analogous to the "Sim" weight, to estimate the reliability of alpha-calc |
2.8 How to Select Peaks
2.8.1 Fast Searches With Rescoring
The selection of peaks for the fast rotation and fast translation function
with rescoring of the top peaks with the full likelihood
target (default RESCORE ON),
is done in three steps, controlled by the keyword
FINAL
For automated molecular replacement, specifying [ROT|TRA] after
the keyword determines which of the rotation or translation function the selection
criteria apply to. If neither is specified, the selection criteria apply to both.
| Keyword |
Applies |
| FINAL [ROT|TRA] STEP 1 |
Controls the selection of peaks from the fast
search that will be rescored with the full likelihood target
|
| FINAL [ROT|TRA] STEP 2 |
Controls the selection of peaks from the rescoring to be
combined with other searches (e.g translation functions of different rotations, or
rotation functions with different fixed components present)
|
| FINAL [ROT|TRA] STEP 3 |
Controls the selection of peaks from the merged list for final output
|
Diagram showing how peaks are selected in three stages

2.8.2 Fast Searches Without Rescoring and Brute Searches
If RESCORE OFF is
requested (no rescoring of the fast search peaks is performed), or if the
brute rotation or translation searches are carried out, then there
are only two stages to selection: selection of peaks from the individual searches,
and selection of peaks from the combined list of solutions.
Selection of peaks at each stage is controlled, respectively, by the keywords
FINAL [ROT|TRA] STEP 2 and
FINAL [ROT|TRA] STEP 3.
Diagram showing how peaks are selected in two stages
2.8.3 Criteria
The selection of peaks saved for output in the rotation and translation functions
can be done in four different ways.
| Sub-Keyword |
Description |
Use |
| FINAL [ROT|TRA] STEP [1|2|3] SELEct PERCent <CUTOFF> |
Percentage of the top peak, where the value of the top peak is defined as
100% and the value of the mean is defined as 0%. |
Default, cutoff=75%. This criteria has the advantage that at least one peak (the top peak) always survives the selection.
If the top solution is clear, then only the one solution will be output, but if the distribution of
peaks is rather flat, then many peaks will be output for testing in the next
part of the MR procedure (e.g. many peaks selected from
the rotation function for testing with a translation function). |
| FINAL [ROT|TRA] STEP [1|2|3] SELEct SIGma <CUTOFF> |
Number of standard deviations (sigmas) over the mean (the Z-score) |
Absolute significance test. Not all searches will produce output if the
cutoff value is too high (e.g. 5 sigma). |
| FINAL [ROT|TRA] STEP [1|2|3] SELEct NUMber <CUTOFF> |
Number of top peaks to select |
If the distribution is very flat then it might be better to select a fixed
large number (e.g. 1000) of top rotation peaks for testing in the translation function. |
| FINAL [ROT|TRA] STEP [1|2|3] SELEct ALL |
None: all peaks are selected |
Enables full 6 dimensional searches, where all the solutions from the rotation
function are output for testing in the translation function. This
should never be necessary; it would be much faster and probably just as likely
to work if the top 1000 peaks were used in this way. |
Diagram showing selection criteria

2.8.4 Clustering
Peaks can also be clustered or not clustered prior to selection in steps 1 and 2.
| Sub-Keyword |
Description |
Use |
| FINAL [ROT|TRA] STEP [1|2] CLUSTER OFF |
All high peaks on the search grid are selected |
Default for STEP 1, because the position of the maximum may be different when the fast score
and the full likelihood function are used. |
| FINAL [ROT|TRA] STEP [1|2] CLUSTER ON |
Points on the search grid with higher neighboring points are removed from the selection |
Default for STEP 2. |
Diagram showing clustering
2.9 Basic Modes for Molecular Replacement
2.9.1 Fast Rotation Function
MODE
MR_FRF combines the anisotropy correction and likelihood-enhanced
fast rotation function (2), optionally rescored
with the full rotation likelihood function (1),
to find the orientation of a model in molecular replacement. Top rotation
solutions are output to the file FILEROOT.rlist
for input to a translation function. Top rotation solutions are also output
to the file FILEROOT.sol.
Example command script for fast rotation function to find the orientation
of BETA.
beta_frf.com
phaser
eof
TITLe beta FRF
MODE MR_FRF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
SEARCH ENSEmble beta
ROOT beta_frf
eof
Example command script for fast rotation function to find the orientation
of BLIP knowing the position and orientation of BETA, with the position
and orientation of BETA input from the command line.
blip_frf_with_beta.com
phaser
eof
TITLe blip FRF with beta rotation and translation
MODE MR_FRF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 #beta
COMPosition PROTein MW 17522 #blip
SEARch ENSEmble blip
SOLUtion 6DIM ENSEmble beta EULEr 201 41 184 FRACtional -0.49408 -0.15571
-0.28148
ROOT blip_frf_with_beta
eof
Example command script for fast rotation function to find the orientation
of BLIP knowing only the orientation of BETA, with the orientation of BETA
input using the output solution file from the beta_frf.com
job above.
blip_frf_with_beta_rot.com
phaser
eof
TITLe blip FRF with beta R
MODE MR_FRF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
SEARch ENSEmble blip
@beta_frf.sol # solution file output by phaser
ROOT blip_frf_with_beta_rot
eof
Compulsory Keywords
Optional Keywords
2.9.2 Brute Rotation Function
MODE
MR_BRF combines the anisotropy correction and brute force likelihood
rotation function (1) to find the orientation
of a model in molecular replacement. Top rotation solutions are output to
the file FILEROOT.rlist for
input to a translation function. Top rotation solutions are also output
to the file FILEROOT.sol.
Example command script for brute rotation function to find the orientation
of BETA
beta_brf.com
phaser
eof
TITLe beta BRF
MODE MR_BRF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
SEARch ENSEmble beta
ROOT beta_brf
eof
Example command script for brute rotation function to find the optimal orientation
of BETA in a restricted search range and on a fine grid around the position
from the fast rotation search.
beta_brf_around.com
phaser
eof
TITLe beta BRF fine sampling
MODE MR_BRF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
SEARch ENSEmble beta
ROTAte AROUnd EULEr 201 41 184 RANGE 10
SAMPling ROTation 0.5
XYZOut ON # not the default
TOPFiles 1 # not the default
ROOT beta_brf_around
eof
Compulsory Keywords
Optional Keywords
2.9.3 Fast Translation Function
MODE
MR_FTF combines the anisotropy correction and likelihood-enhanced
fast translation function (3), optionally rescored
by the full likelihood translation function (1),
to find the position of a previously oriented model in molecular replacement.
Top translation solutions are output to the file FILEROOT.sol.
Example command script for finding the position of BETA after the rotation
function has been run and the results output to the file beta_frf.rlist
beta_ftf.com
phaser
eof
TITLe beta FTF
MODE MR_FTF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
@beta_frf.rlist
ROOT beta_ftf
eof
Example command script for finding the position of BLIP after the rotation
function has been run and the results output to the file blip_frf_with_beta.rlist,
which has the SOLUtion 6DIM keyword
input for BETA and the SOLUtion
TRIAL keyword input for the orientations to try for BLIP with the
translation function.
blip_ftf_with_beta.com
phaser
eof
TITLe beta FTF
MODE MR_FTF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
@blip_frf_with_beta.rlist
ROOT blip_ftf_with_beta
eof
Compulsory Keywords
Optional Keywords
2.9.4 Brute Translation Function
MODE
MR_BTF combines the anisotropy correction and brute force likelihood
translation function (1) to find the position
of a previously oriented model in molecular replacement. Top translation
solutions are output to the file FILEROOT.sol.
Example command script for brute Translation function to find the position
of BETA after the rotation function has been run
beta_btf.com
phaser
eof
TITLe beta BTF
MODE MR_BTF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
@beta_frf.rlist
TRANslate AROUnd FRACtional POINt -0.49408 -0.15571 -0.28148 RANGe 5
ROOT beta_btf
eof
Example command script for brute Translation function to find the position
of BETA degenerate in X after the rotation function has been run
beta_btf_degen_x.com
phaser
eof
TITLe beta degenerate X
MODE MR_BTF
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
@beta_frf.rlist
TRANslate DEGEnerate X
ROOT beta_btf_degen_x
eof
Compulsory Keywords
Optional Keywords
2.9.5 Refinement and Phasing
MODE
MR_RNP combines the anisotropy correction and refinement against
the likelihood function (1) to optimize full or
partial molecular replacement solutions and phase the data. At the end of
refinement, the list of solutions is checked for duplicates, which are pruned.
Refined solutions are output to the file FILEROOT.sol.
Example command script to refine a set of solutions
beta_blip_rnp.com
phaser
eof
TITLe beta blip rigid body refinement
MODE MR_RNP
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
ROOT beta_blip_rnp # not the default
HKLOut OFF # not the default
XYZOut OFF # not the default
@beta_blip_auto.sol
eof
Compulsory Keywords
Optional Keywords
2.9.6 Log-Likelihood Gain
MODE
MR_LLG combines the anisotropy correction and the likelihood function
(1) to calculate the log-likelihood gain for full
or partial molecular replacement solutions. Solutions are output to the
file FILEROOT.sol.
Example command script to rescore the solutions using a different resolution
range of data and a different spacegroup
beta_blip_llg.com
phaser
eof
TITLe beta blip solution 6A P3121
MODE MR_LLG
HKLIn beta_blip.mtz
LABIn F=F SIGF = SIGF
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
ROOT beta_blip_llg # not the default
RESOlution 6.0
SPACegroup P 31 2 1
@beta_blip_auto.sol
eof
Compulsory Keywords
Optional Keywords
2.9.7 Packing
MODE
MR_PAK determines whether molecular replacement solutions pack in
the unit cell. Solutions that pack are output to the file FILEROOT.sol.
Example command script for determining whether a set of molecular replacement
solutions pack in the unit cell.
beta_blip_pak.com
phaser
eof
TITLe beta blip packing check
MODE MR_PAK
HKLIn beta_blip.mtz
LABIn F=F SIGF=SIGF
ENSEmble beta PDB beta.pdb IDENtity 100
ENSEmble blip PDB blip.pdb IDENtity 100
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
ROOT beta_blip_pak # not the default
PACK 1 # not the default
@beta_blip_auto.sol
eof
Compulsory Keywords
Optional Keywords
3. Experimental Phasing Modes
Phaser performs SAD phasing in two modes. In the Automated Experimental Phasing mode, Phaser corrects for anisotropy, puts the data on absolute scale, does a cell content analysis, refines heavy atom sites to optimize phasing, and completes the model from log-likelihood gradient maps. Alternatively, the SAD Phasing mode can be used, which only refines heavy atom sites to optimize phasing, and completes the model from log-likelihood gradient maps. For this mode, the data should be pre-corrected for anisotropy and put on an absolute scale. This mode should only be used as part of automation pipelines, where the correct preparation of the data can be guaranteed and it saves cpu time.
3.1 Automated Experimental Phasing
MODE
EP_AUTO combines the anisotropy correction, cell content analysis, and SAD Phasing modes
to automatically solve a structure by experimental phasing. The final solution
is output to the files FILEROOT.sol,
FILEROOT.mtz and FILEROOT.pdb. Many structures can be solved by running an automated experimental phasing job with
defaults.
Do SAD phasing of insulin. This is the minimum input, using all defaults (except the ROOT filename).
insulin_auto.com
phaser
eof
MODE EP_AUTO
TITLe sad phasing of insulin with intrinsic sulphurs
HKLIn S-insulin.mtz
COMPosition NUCLeic SEQ S-insulin.seq
CRYStal insulin DATAset sad LABIn F+=F(+) SIG+=SIGF(+) F-=F(-) SIG-=SIGF(-)
LLGComplete CRYStal insulin COMPLETE ON SCATtering ELEMent S
ATOM CRYStal insulin PDB S-insulin_hyss.pdb
ROOT insulin_auto
eof
Compulsory Keywords
Optional Keywords
3.2 How to Define Data
You need to tell Phaser the name of the mtz file containing your data and the columns
in the mtz file to be used. For SAD phasing, a single CRYSTAL and DATASET with anomalous data (F(+), SIGF(+), F(-) and SIGF(-)) must be given. The columns must have the correct CCP4 column type: 'G' for F(+) and F(-) and 'L' for SIGF(+) and SIGF(-). If the columns on your mtz file have somehow acquired the incorrect column type, you should change the column type with an mtz editing programme (e.g. sftools).
CRYStal insulin DATAset sad &
LABIn F+ = F(+) SIG+ = SIGF(+)
F- = F(-) SIG- = SIGF(-)
Compulsory Keywords
3.3 How to Define Atoms
Atom sites are defined with the ATOM
keyword. Atoms sites may be entered one at a time specifying fractional or orthogonal coordinates, occupancy and B-factor,
or from a PDB file, or from a mlphare-style HA file. The crystal to which the atoms correspond must be specified in the input.
3.4 How to Control Output
3.5. Basic Modes for Experimental Phasing
3.5.1 SAD Phasing
MODE
EP_SAD phases SAD data and completes the structure from log-likelihood gradient maps. The final solution
is output to the files FILEROOT.sol,
FILEROOT.mtz and FILEROOT.pdb
.
Do SAD phasing of insulin. This is the minimum input, using all defaults (except the ROOT filename)
insulin_sad.com
phaser
eof
MODE EP_SAD
TITLe sad phasing insulin with intrinsic sulphurs
HKLIn S-insulin.mtz
CRYStal insulin DATAset sad LABIn F+=F(+) SIG+=SIGF(+) F-=F(-) SIG-=SIGF(-)
LLGComplete CRYStal insulin COMPLETE ON SCATtering ELEMent S
ATOM CRYStal insulin PDB S-insulin_hyss.pdb
ROOT insulin_sad
eof
Compulsory Keywords
Optional Keywords
3.5.2 Combined MR and SAD Phasing
MODE
EP_SAD when used with the PART keyword phases SAD data and completes the structure from log-likelihood gradient maps using a partial structure as (at least part of) the initial atomic substructure. Only the real (i.e. non-anomalous) signal from the partial structure is used. The final solution
is output to the files FILEROOT.sol,
FILEROOT.mtz and FILEROOT.pdb
.
Do SAD phasing of insulin starting from partial model of one helix only. This is the minimum input, using all defaults (except the ROOT filename)
insulin_sad.com
phaser
eof
MODE EP_SAD
TITLe sad phasing insulin with intrinsic sulphurs
HKLIn S-insulin.mtz
CRYStal insulin DATAset sad LABIn F+=F(+) SIG+=SIGF(+) F-=F(-) SIG-=SIGF(-)
LLGComplete CRYStal insulin COMPLETE ON SCATtering ELEMent S
PART PDB helix.pdb ID 100
ROOT insulin_mr_sad
eof
Compulsory Keywords
Optional Keywords
5. General Modes
5.1 Anisotropy Correction
MODE
ANO corrects the experimental data for anisotropy. Data (amplitude
and associated sigma) are corrected for anisotropy and output to FILEROOT.mtz
with column label set to the input column label with the addition of _ISO.
Example command script to correct BETA-BLIP data for anisotropy
beta_blip_ano.com
phaser
eof
MODE ANO
TITLe beta blip data correction
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
ROOT beta_blip_ano # not the default
eof
Compulsory Keywords
Optional Keywords
5.2 Cell Content Analysis
MODE
CCA determines the composition of the crystals using the "new"
Matthews coefficients of Kantardjieff & Rupp (2003) "Matthews coefficient
probabilities: Improved estimates for unit cell contents of proteins, DNA
and protein-nucleic acid complex crystals". Protein Science 12:1865-1871.
The molecular weight of ONE complex or assembly
to be packed into the asymmetric unit is given with the COMPosition
keyword, and the possible Z values (number of copies of the complex or assembly)
that will fit in the asymmetric unit and the relative frequency of their
corresponding VM values is reported. RESOlution
should be set to the maximum resolution that has been observed for the crystal.
Example script for cell content analysis for BETA-BLIP
beta_cca.com
phaser
eof
TITLe BETA-BLIP cell content analysis
MODE CCA
HKLIn beta_blip.mtz
LABIn F=Fobs SIGF=Sigma
COMPosition PROTein MW 28853 NUM 1 #beta
COMPosition PROTein MW 17522 NUM 1 #blip
RESO 3.0
ROOT beta_blip_cca # not the default
eof
Compulsory Keywords
Optional Keywords
5.3 Normal Mode Analysis
MODE
NMA writes out pdb files that have been perturbed along normal
modes, in a procedure similar to that described by Suhre & Sanejouand
(Acta Cryst. D60, 796-799, 2004). Each run of the program
writes out a matrix FILEROOT.mat
that contains the eigenvectors and eigenvalues of the atomic Hessian, and
can be read into subsequent runs of the same job, to speed up the analysis.
Do normal mode analysis only, write out eigenfile but not coordinates
beta_nma.com
phaser
eof
TITLe beta normal mode analysis
MODE NMA
ENSEmble beta PDB beta.pdb IDENtity 100
XYZOut OFF
ROOT beta_nma # not the default
eof
Write out pdb files perturbed in 0.5 Ångstrom rms intervals in "forward" (positive dq values) along modes
7 and 10 (and combinations of 7 and 10)
beta_nma_pdb.com
phaser
eof
TITLe beta normal mode analysis pdb file generation
MODE NMA
ENSEmble beta PDB beta.pdb IDENtity 100
ROOT beta_nma_pdb # not the default
EIGEn beta_nma.mat
NMAPdb MODE 7 MODE 10 RMS 0.5 FORWARD
eof
Compulsory Keywords
Optional Keywords
5. Keyword Input
Phaser can be controlled using keyword input. Not all keywords are relevant
for all modes of operation (the list of relevant keywords for each mode is given
with each mode above). Some keywords are only for single use, others have meaning
when used more than once. The input values of many parameters are constrained
to physically meaningful values. All non-compulsory parameters have defaults.
5.1 Preprocessor
Preprocessor commands may be used in the keyword
input to incorporate files, add comments or allow line continuation.
- @ filename
- To include a file in the input stream use the "@" (at) character
- Recursive application (can nest @ in files)
- # comment
- line of keyword input # comment
- All characters on a line after a "#" (hash) character are ignored
- line of keyword input &
line of keyword input continued
- Line continuation with the "&" (ampersand) character
- END GO RUN START STOP QUIT EXIT KILL
- End the input and start Phaser
- phaser HKLIN filename
- Simple keyword-value pairs may also be added to the command line. This enables the command
line to be used for input such as HKLIN, in traditional CCP4 style.
5.2 Syntax of Documentation
- KEYWord
- Courier font in blue means a keyword. Only the first four letters (sometimes
less) of any input keyword are required/recognized. The required characters
for each keyword are given in uppercase and those not required in lowercase.
Keywords are not case sensitive.
- <PARAMETER>
- Angle brackets mean a parameter value. Input strings are
case sensitive: the case of titles and filenames is preserved.
- { KEYWord
<X Y Z> }
- Curly brackets mean a group of keywords/parameters must come together.
- [ X | Y ]
- Square brackets and line separating options means X or Y.
- KEYWord <X Y Z>
- Italics mean the keyword/input is optional.
- *KEYWord
- Keywords marked with an asterisk are for "expert" use only, or use in development.
- Constraint: X=%
- If 1<X≤100, stored value = X, else if 0≤X≤1, stored value = X*100, else error
- Default:
- Some default values are constants, others are set by Phaser after it has analyzed the input data.
Most keywords only refer to a single parameter, and if used multiple times, the parameter will take the last value input.
Some keywords are meaningful when entered multiple times. The order may or may not be important.
-
-
ATOM CRYStal <XTALID>
ELEMent <TYPE>
[ORTH|FRAC] <X Y Z>
OCCupancy <OCC>
[{ISOB <ISOB>} |
{[ANOU|USTAR] <HH KK LL HK HL KL>}]
FIXX FIXO FIXB BSWAP
{LABEL <SITE_NAME>}
- Definition of atom position in crystal XTALID. B-factor defaults to isotropic and Wilson B-factor if not set and ATOM BFACTOR WILSON is used (the default). The B-factors can be set to another value with ATOM BFACTOR VALUE
-
ATOM CRYStal <XTALID>
PDB <FILENAME>
- Definition of atom positions in crystal XTALID using a pdb file.
-
ATOM CRYStal <XTALID>
HA <FILENAME>
- Definition of atom positions in crystal XTALID using a ha file (from RANTAN, MLPHARE etc.).
-
ATOM BFACtor [ WILSON | {VALUE <ISOB>} ]
- Reset all atomic B-factors to the Wilson B-factor or to ISOB
Constraint: ISOB>=0
-
- *BINS
{MINimum
<L>} {MAXimum <H>}
{NUMber <N>} {WIDTh
<W>} {CUBIc <A B C>}
- The binning of the data. L = minimum number of bins, H = maximum number
of bins, N = number of bins, W = width of the bins in number of reflections,
A B C are the coefficients for the binning function A(S*S*S)+B(S*S)+CS where
S = (1/resolution). If N is given then the values of L and H are ignored.
Constraint: CUBIC coefficients restricted to monotonically increasing function: A >0, B >0, C >0 and either
(a) A=B=0 or (b) A=0 or (c) B=0
Default: BINS MIN 6 MAX 50 WIDTH 1000 CUBIC 0 1 0
-
- *BFACtor {[WILSon|SPHEricity] [ON|OFF]} {SIGMA <SIGMA>}
- Toggle to set the Wilson restraint on the isotropic component of the atomic B-factors, and the Sphericity restraint on the anisotropic B-factors, and the sigma of the restraint, for experimental phasing.
Default: BFACTOR WILSON ON SIGMA 5
Default: BFACTOR SPHERICITY ON SIGMA 5
-
- *BOXScale <BS>
- Scale for box for calculating structure factors. The ensembles are put
in a box equal to (extent of molecule)*BS.
Constraint: BS > 2.4
Default: BOXSCALE 4
-
- *CELL <A B C ALPHA BETA GAMMA>
- Unit cell dimensions
- *CELL CRYStal <XTALID> <A B C ALPHA BETA GAMMA>
- Unit cell dimensions for crystal (experimental phasing)
Constraint: A>0,B>0,C>0,ALPHA>0,BETA>0,GAMMA>0
Default: Cell read from MTZ file
-
- *CLMN
{SPHEre <SPHERE>}
{LMINimum <LMIN>}
{LMAXimum <LMAX>}
- The radii or L values for the decomposition of the Patterson in Ångstroms. Values depend on the geometric mean radius (GMR)of the ENSEMBLE
Constraint: SPHERE>5, LMIN>0, LMAX>LMIN
Default: CLMN SPHERE <2*GMR> LMIN 2
-
- COMPosition SOLVent <SOLVENT>
- Composition entered by solvent content.
Default: SOLVENT=0.5
- COMPosition [PROTein|NUCLeic]
[{MW <MW>}|{SEQuence <FILE>}]
NUMber <NUM>
- Composition of the crystals. The number of copies NUM of molecular
weight MW or SEQ given in fasta format (in a file FILE) of protein or nucleic
acid in the asymmetric unit.
Constraint: MW>0
Default: NUM=1
- COMPosition ATOM <ATOMTYPE>
NUMber <NUM>
- NUM atoms of ATOMTYPE are added to the composition.
- COMPosition ENSEmble <MODLID>
FRACtional <FRAC>
- Alternative way of defining composition. Fraction scattering is entered
explicitly for each ENSEMBLE.
Constraint: 0<FRAC<=1
-
- CRYStal <XTALID>
DATAset <WAVEID>
LABIn F=<F> SIGF=<SIGF>
- Columns of MTZ file to read for this (non-anomalous) dataset
- CRYStal <XTALID>
DATAset <WAVEID>
LABIn F(+)=<F+> SIGF(+)=<SIG+> F(-)=<F-> SIGF(-)=<SIG->
- Columns of MTZ file to read for this (anomalous) dataset
-
CRYStal <XTALID> DATAset <WAVEID>
SCATtering [
CUKA |
{WAVElength <L>}
{MEASured FP=<FP> FDP=<FDP>}
{ELEMent <TYPE> FP=<FP> FDP=<FDP>} ]
- Set wavelength at which dataset was collected or set F-prime and F-double-prime explicitly.
Default: CRYSTAL <XTALID> SCATTERING CUKA
-
CRYStal <XTALID> DATAset <WAVEID>
FIXP FIXDP
- Fix Fp and Fdp for all scatterers in dataset
Default: Fp and Fdp refined
-
CRYStal <XTALID> DATAset <WAVEID>
RESOlution <HIRES> <LORES>
- Restrict resolution range of this dataset
- *CRYStal <XTALID> DATAset <WAVEID>
BINS {MINimum <L>} {MAXimum <H>} {NUMber <N>} {WIDTh <W>} {CUBIc <A B C>}
- Binning parameters for this dataset
Default: As for BINS keyword
-
- EIGEn [ {READ <EIGENFILE>} | {WRITe [ON|OFF]} ]
- Read or write a file
containing the eigenvectors and eigenvalues. If reading, the eigenvalues
and eigenvectors of the atomic Hessian are read from the file generated
by a previous run, rather than calculated. This option must be used with
the job that generated the eigenfile
and the job reading the eigenfile must have identical (or default) input
for keyword NMAMethod. Use WRITe to control whether or not the eigenfile
is written when not using the READ mode.
Default: EIGEN WRITE ON
-
- ENSEmble <MODLID>
PDB <PDBFILE> [RMS|IDENtity] <NUM> {PDB
<PDBFILE> [RMS|IDENtity]
<NUM>}
- The names of the PDB files used to build the ENSEMBLE, and either the expected RMS deviation of the coordinates to the
"real" structure or the percent sequence identity with the real sequence.
Constraint: NUM=% if IDENtity
- ENSEmble
<MODLID> HKLIn <MTZFILE>
F=<F> PHI=<P> EXTEnt
<EX EY EZ> RMS <RMS> CENTre
<CX CY CZ> PROTein MW <PMW> NUCLeic
MW <NMW>
- An ENSEmble defined from a map (via an mtz file). The molecular weight
of the object the map represents is required for scaling. The effective
RMS coordinate error is needed to judge how the map accuracy falls off with
resolution. The extent is needed to determine reasonable rotation steps,
and the centre is needed to carry out a proper interpolation of the molecular
transform. The extent and the centre are both given in Ångstroms.
- *ENSEmble <MODLID>
BOXScale <BS>
- Scale for box for calculating structure factors. The ensemble is put
in a box equal to (extent of molecule)*BS.
Constraint: BS >2.4
Default: ENSEmble <MODLID> BOXSCALE 4
- *ENSEmble
<MODLID> BINS {MINimum
<L>} {MAXimum <H>} {NUMber
<N>} {WIDTh <W>} {CUBIc
<A B C>}
- Bins for the MODLID
Constraint: As for keyword BINS
Default: ENSEmble <MODLID> BINS MIN 6 MAX 200 WIDTH 1000 CUBIC 0 1 0
-
- *ENSIn <MODLID>
HKLIn <MTZFILE>
SCATtering <SCAT>
EXTEnt <EX EY EZ>
PR <P1 P2 P3 P4 P5 P6 P7 P8 P9>
PT <TX TY TZ>
- This option can be used to read back a molecular transform computed in an earlier Phaser job run in the MR_ENS mode.
May be useful if the spherical harmonic decomposition is very long. This can only be used when repeating the search for a component
of the asymmetric unit with no (or the same) known fixed structure as part of the search.
-
- *FFTS MINimum [ATOMS_MIN] MAXimum [ATOMS_MAX]
- The minimum and maximum number of atoms of the range between which direct summation and fft methods are tested to see which is faster for structure factor and gradient calculation (for this unit cell and resolution). For a number of atoms below ATOMS_MIN direct structure factor calculation is always used, and for a number of atoms above ATOMS_MAX ffts are always used for the structure factor calculation and the gradient calculations. Direct summation is always used for the curvatures. Use FFTS MIN 0 MAX O to always use ffts.
Default: FFTS MIN 20 MAX 80
-
- FINAl [ROT|TRA] STEP [1|2|3] SELEct
[{SIGma <S>} |
{NUMber <N>} |
{PERCent <P>} |
ALL]
- Peaks satisfying selection criteria are saved. With neither ROT nor TRA given, the criteria apply to both. With STEP [1|2|3]
not given, the criteria apply to all three.
Constraint: S>0,P=%
Default: FINAL SELECT PERCENT 75
- FINAl [ROT|TRA] STEP [1|2] CLUSter [ON|OFF]
{DUMP <NDUMP>}
{LOG [ON|OFF]}
- Toggle for CLUSTER selects clustered
peaks for rescoring (STEP 1) or saving (STEP 2). If clustered peaks are used, then NDUMP raw peaks will
be dumped to the output. If unclustered peaks are used
then you may still perform the clustering and log the results to the log
file (this may be time consuming).
Constraint: NDUMP>0
Default: FINAL STEP 1 CLUSTER OFF
Default: FINAL STEP 2 CLUSTER ON
-
- HAND [ON|OFF]
- Toggle for whether or not to use other hand of space group in experimental phasing
Default: HAND OFF
-
- HKLIn <FILENAME>
- The mtz file containing the data
-
- HKLOut [ON|OFF]
- Flags for output of an mtz file containing the phasing information
Default: HKLOUT ON
-
- LABIn F = <F>
SIGF = <SIGF>
- Columns in mtz file. F must be given. SIGF should be given but is optional.
-
- LLGComplete NCYC <NMAX>
- Maximum number of cycles of log-likelihood gradient structure completion. By default, NMAX is 50, but this limit should never be reached, because all features in the log-likelihood gradient maps should be assigned well before 50 cycles are finished. This keyword should be used to reduce the number of cycles to 1 or 2.
- LLGComplete CRYStal <XTALID>
{COMPlete [ON|OFF]}
{CLASh [ DEFAULT | DISTANCE <clash>]}
{SIGma <sigma>}
{SCATtering [ ELEMent <atomtype>
|
REAL |
IMAG ]}
- Toggle for structure completion by log-likelihood gradient maps,
minimum distance between atoms in log-likelihood gradient maps (default determined by resolution),
Z-score (sigma) for accepting peaks as new atoms in log-likelihood gradient maps,
and
atom type(s) to be used for log-likelihood gradient completion. If more than one element is entered for log-likelihood gradient completion, the atom type that gives the highest Z-score for each peak is selected.
As an alternative to using an atom type(s) for the calculation of the log-likelihood gradient map, real or imaginary components of the map may be calculated.
Default: LLGCOMPLETE CRYSTAL <XTALID> COMPLETE OFF
-
- MODE
[ ANO | CCA | NMA | MR_AUTO | MR_FRF | MR_FTF | MR_BRF | MR_BTF | MR_RNP | MR_LLG | MR_PAK | EP_AUTO | EP_SAD]
- The mode of operation of Phaser
-
-
NMAMethod
[RTB|CA|ALL]
{NRESidues <NRES>}
{MAXBlocks <MAXBLOCKS>}
{RADIus <RADIUS>}
{FORCe <FORCE>}
- Input for the normal mode analysis. Writes out pdb files perturbed along
normal mode(s). {RTB|CA|ALL} define the atoms used for the analysis.
RTB uses the rotation-translation block method, CA uses C-alpha atoms only
to determine the modes, and ALL uses all atoms to determine the modes (only
for use on very small molecules, less than 250 atoms). For the RTB analysis,
by default NRES is calculated so that it is as high as it can be without
reaching MAXBlocks.
The interaction radius used for the calculations and the force constant
can also be altered.
Default: NMAMETHOD RTB MAXBLOCKS 250 RADIUS 5 FORCE 1
-
- NMAPdb
{MODE <M1> {MODE
<M2>
}} {RMS <RMS>} {CLASh
<CLASH>} {STREtch
<STRETCH>} {MAXRms
<MAXRMS>} [FORWard|BACKward|TOFRo]
{DQ <DQ1> {DQ <DQ2>
}}
{COMBination }
- MODE keyword gives the mode along which to perturb the structure. If multiple modes are
entered, the structure is perturbed along all the modes AND combinations
of the modes given. There is no limit on the number of modes that can be
entered, but the number of pdb files explodes combinatorially. RMS
is the increment in rms Ångstroms between pdb files to be written. The structure
will be perturbed along each mode until either the C-alpha atoms clash with
(come within CLASH Ångstroms
of) other C-alpha atoms, the distances between C-alpha atoms STRETCH
too far (note that normal modes do not preserve the geometry) or the MAXRMS
deviation has been reached. The structure is perturbed either forwards or
backwards or to-and-fro (FORWARD|BACKWARD|TOFRO)
along the eigenvectors of the modes specified. Alternatively, the DQ
factors (as used by the Elnemo server (K. Suhre & Y-H. Sanejouand, NAR
2004 vol 32) by which to perturb the atoms along the eigenvectors can
be entered directly. The keyword COMBINATION
controls how many modes are present in any combination.
Default: NMAPDB MODE 7 RMS 0.3 TOFRO STRETCH 10.0 CLASH 2.0 MAXRMS 0.5 COMBINATION 3
-
- *MACANO
ANISotropic [ON|OFF] BINS
[ON|OFF] SOLK [ON|OFF] SOLB
[ON|OFF]
{NCYCle <NCYC>}
{MINImizer [BFGS|NEWTON|DESCENT]}
- Macrocycles for the refinement of SigmaN in the anisotropy correction
Default: MACANO ANIS ON BINS ON SOLK OFF SOLB OFF NCYC 50 MINIMIZER BFGS
- *MACANO OFF
- Turns off the anisotropy correction
-
- *MACMR ROT [ON|OFF]
TRA [ON|OFF]
{NCYCle <NCYC>}
{MINImizer [BFGS|NEWTON|DESCENT]}
- Molecular replacement refinement macrocycle. The macrocycles are performed in the order that they are entered
Default:
MACMR ROT ON TRA ON NCYC 50 MINIMIZER BFGS
-
- *MACSAD
K [ON|OFF]
B [ON|OFF]
SIGMA [ON|OFF]
XYZ [ON|OFF]
OCCupancy [ON|OFF]
BFACtor [ON|OFF]
FDP [ON|OFF]
SA [ON|OFF]
SB [ON|OFF]
SP [ON|OFF]
SD [ON|OFF]
{NCYCle <NCYC>}
{MINImizer [BFGS|NEWTON|DESCENT]}
{TARGet[NOT_ANOM_ONLY| ANOM_ONLY]}
- SAD refinement macrocycle. The macrocycles are performed in the order that
they are entered
Default:
MACSAD K OFF B OFF SIGMA OFF XYZ OFF OCC ON BFAC OFF FDP OFF SA OFF SB OFF SP OFF SD OFF NCYC 50 MINIMIZER BFGS
MACSAD K OFF B OFF SIGMA OFF XYZ ON OCC ON BFAC ON FDP OFF SA OFF SB OFF SP ON SD ON NCYC 50 MINIMIZER BFGS
-
- *MUTE [ON|OFF]
- Toggle for running in "silent" mode, with no summary, logfile or verbose output written to "standard output". Output can be extracted from Results object in python, or from XML file.
Default: MUTE OFF
-
- *OUTLier [ON|OFF] <PROB>
- Control of the large unlikely E-value rejection. Outliers with a probability
less than PROB are rejected.
Default: OUTLIER ON 0.000001
-
- PACK BEST
MAX <MAX_CLASHES> DISTANCE <CLASH_DISTANCE>
- Allow the best packing solutions provided the total number of clashes does not exceed MAX_CLASHES. Distance within which C-alpha atoms clash given by CLASH_DISTANCE Ångstroms. If the model is RNA or DNA, phosphate and carbon atoms in the phosphate backbone, and nitrogen atoms in the bases are taken as the marker atoms for clashes.
- PACK ALLOW <ALLOWED_CLASHES> DISTANCE <CLASH_DISTANCE>
- Allow all solutions that pack with number of C-alpha clashes within Ångstroms less than or equal to ALLOWED_CLASHES.
- PACK ALL DISTANCE <CLASH_DISTANCE>
- Allow all solutions. Clashes within CLASH_DISTANCE Ångstroms reported to output.
Default: PACK BEST MAX 10 DISTANCE 3.0
-
- PARTial PDB
<PDBFILE> [RMS|IDENtity] <NUM>
- The partial structure for SAD refinement
Constraint: NUM=% if IDENtity
-
- PERMutations [ON|OFF]
- Toggle for whether the order of the search set is to be permuted.
Default:
PERMUTATIONS OFF
-
- PURGe [ON|OFF] PERCent <PERC>
- Automated Molecular Replacement only. Toggle for whether to purge the solution list from the translation function and the refinement and phasing steps (when searching multiple space groups) according to the best solution found so far.
Default:
PURGE ON PERCENT 75
-
- RESCore [ROT|TRA] [ON|OFF]
- Toggle for rescoring of fast search peaks. If neither ROT nor TRA are present, toggle applies to both the rotation and translation function rescoring (respectively), otherwise to the one specified.
Default: RESCore ON
-
- RESOlution <HIRES> <LORES>
- Resolution range in Ångstroms. If only one limit is given, it is the high
resolution limit; otherwise the limits can be in either order.
Constraint: HIRES>0,LORES>0
- RESOlution AUTO <HIRES> <LORES>
- Resolution range in Ångstroms for high resolution refinement in MR_AUTO mode. If only one limit is given, it is the high
resolution limit; otherwise the limits can be in either order.
Constraint: HIRES>0,LORES>0
-
- ROOT <FILEROOT>
- Root filename for output files (e.g. FILEROOT.log)
Default: ROOT PHASER
-
- ROTAte
[ FULL |
{ROTAte AROUnd EULEr <A B G> RANGe <RANGE>} ]
- Sample all unique angles (FULL) or restrict the search to the region of +/- RANGE degrees around orientation given by EULER
Constraint: RANGE>0
Default: ROTAte FULL
-
- SAMPling [ROT|TRA] <SAMP>
- Sampling of search given in degrees for a rotation search and Ångstroms
for a translation search. Sampling for rotation search depends on the geometric mean radius (GMR) of the Ensemble and the
high resolution limit (dmin) of the search.
Constraint: SAMP>0
Default: SAMP = 2*atan(dmin/(4*GMR)) (MODE = MR_BRF)
Default: SAMP = 2*atan(dmin/(4*GMR)) (MODE = MR_FRF)
Default: SAMP = dmin/5; (MODE = MR_BTF)
Default: SAMP = dmin/3; (MODE = MR_FTF)
-
- *SCRIpt [ON|OFF]
- Write Phaser script file
Default: SCRIPT ON
-
- SEARch
ENSEmble <MODLID> {OR
ENSEmble <MODLID>}
{NUMber
<NUM>}
-
The ENSEMBLE
to be searched for in a rotation search or an automatic search. When multiple
ensembles are given using the OR keyword, the search is performed for each
ENSEMBLE
in turn. The final results are the best of all the searches (controlled
with the FINAL keyword). When the keyword is entered multiple times in the
MR mode, each SEARCH keyword refers to a new component of the structure.
If the component is present multiple times the sub-keyword NUMber can be
used (rather than entering the same SEARCH keyword NUMber times). If the
MR mode is being used with a fixed partial solution, only enter SEARCH keywords
(or associated NUMbers) for the components that remain to be found.
- SEARch MATThews [ON|OFF]
- The number of copies of the search components to search for is determined
by the Matthews coefficient. The stoichiometry of the search is given using
the NUM keyword above.
Default: SEARCH MATTHEWS OFF
-
- SGALternative
[ALL
| HAND | {TEST <SG>}]
- Alternative space groups to test in the translation function. All tests
all possible space groups, hand tests the given space group and its enantiomorph
and <SG> tests the give space group.
Default: Only the SPACEGROUP is tested
-
- *SHANnon <SHARAT>
- Shannon sampling given by (2*SHARAT) for the Ensemble maps. Increase SHARAT to sharpen the sampling.
Constraint: SHARAT>1.1
Default: SHANNON 1.5
-
- SOLUtion
SET <ANNOTATION>
- Start new set of solutions
- SOLUtion
3DIM ENSEmble <MODLID> EULEr <A
B G> FIXR
- Rotation only solution. Use this keyword if only the orientation in 3
dimensions is known. This keyword is repeated for each case. A B G are the
Euler angles in degrees.
- SOLUtion
5DIM ENSEmble <MODLID> EULEr <A
B G> DEGEnerate [X|Y|Z] FRACtional
<U V> FIXR FIXT
- Use this keyword if the orientation in 3 dimensions and and the position
of the MODLID in only 2 dimensions is known. This keyword is repeated for
each case. A B G are the Euler angles in degrees. The keywords [X|Y|Z] specify
the degenerate direction and U V are the translation in the other two directions.
- SOLUtion
6DIM ENSEmble <MODLID> EULEr <A
B G> [ORTH|FRAC]
<X Y Z> FIXR FIXT
- This keyword is repeated for each known position and orientation of a
ENSEmble ID. A B G are the Euler angles and X Y Z are the translation.
- SOLUtion
TRIAl ENSEmble <MODLID> EULEr
<A B G> {DEGEnerate [X|Y|Z] FRACtional
<U V> } {SCORe <score>}
- Rotation List for translation function
-
- *SOLPARAMETERS <FSOL> <BSOL>
- Optionally change solvent parameters for Sigma(A) curves from the default
values. The results are not terribly sensitive to these parameters, which
affect only lower resolution data. FSOL and BSOL can be given in either
order, the lower number being taken as FSOL.
Constraint: 0<FSOL<1, BSOL>0
Default: SOLPARAMETERS FSOL 0.95 BSOL 300
-
- *SORT [ON|OFF]
- Sort the reflections into resolution order upon reading MTZ file, for performance gain in molecular replacement
Default: SORT ON
-
- SPACegroup
HALL <SG>
- Space group may be altered from the one on the MTZ file to a space group
in the same point group. The space group name or number can be given e.g.
P 21 21 21 or 19. If the optional keyword HALL
is present, then the space group is interpreted as a Hall symbol.
Default: Read from MTZ file
-
- *TARGet [LERF1|LERF2|CROWTHER]
- Target function for fast rotation searches (2)
Default: TARGet LERF1
- *TARGet [LETF1|LETF2|LETFL|LETFQ|CORRelation]
- Target function for fast translation searches (3)
Default: TARGet LETF1
-
- TITLe <TITLE>
- Title for job
Default: TITLE [no title given]
-
- TOPFiles <NUM>
- Number of top pdbfiles or mtzfiles to write to output.
Default: TOPFILES 1
-
- TRANslate
FULL
- Search volume for brute force translation function. Cheshire cell or
Primitive cell volume.
- TRANslate
LINE [ORTH|FRAC]
STARt <XS YS ZS> END
<XE YE ZE>
- Search volume for brute force translation function. Search along line.
- TRANslate
REGIon [ORTH|FRAC]
STARt <XS YS ZS> END
<XE YE ZE>
- Search volume for brute force translation function. Search region.
- TRANslate
AROUnd [ORTH|FRAC]
POINt <X Y Z> RANGe
<RANGE>
- Search volume for brute force translation function. Search within +/-
RANGE Ångstroms (not fractional coordinates, even if the search point
is given as fractional coordinates) of a point <X Y Z>.
- TRANslate
DEGEnerate [X|Y|Z]
- Search volume for brute force translation function. The search volume
is the plane perpendicular to the direction of the search.
Default: TRANSLATE FULL
-
- *VARSAD [
{K <N>}
{B <N>}
{SIGMA <N>}
{SA <B1 B2
>}
{SB <B1 B2
>}
{SP <B1 B2
>}
{SD <B1 B2
>}
- SAD variance parameters SA and SB (the real an imaginary components of Sigma Minus)
SP (Sigma Plus) and SD (Sigma Delta) by resolution bin and the overall scale (K) B-factor (B) and sigma-scale
(SIGMA) parameters.
-
- VERBose [ON|OFF] EXTRA
- Toggle to send verbose output to log file. If ON or OFF are not specified,
verbose is switched ON. If keyword EXTRA is present, then extra verbose information is logged.
Default: VERBOSE OFF
-
- XYZOut [ON|OFF]
- Toggle for output coordinate files.
Default: XYZOUT OFF (MODE MR_FRF, MODE MR_BRF)
Default: XYZOUT ON (other relevant modes)
6. XML Output (for developers)
7. Python Scripting (for developers)
8. Version History
- Phaser2.1
- Default packing criteria relaxed to allow a small number of clashes
- Default packing distance increased to identify intercalation of helices
- Composition can be specified via solvent content
- Default composition corresponds to 50% solvent
- If automated MR search finds some (but not all) components, then the partial solution is output
- Improved SAD refinement
- SAD refinement starting from partial model from MR
- Output of PDB hybrid-36 atom numbers and col(21-22) chainids for largest structures
- Phaser2.0
- First release of experimental phasing (SAD only) through phenix
- Phaser1.3.1
- Minor bug fixes after Phenix-1.1a release
- R-factor reported by MR_RNP and MR_LLG modes (verbose output)
- Phaser1.3
- Released as part of Phenix-1.1a
- New Normal Mode Analysis of structures
- New Cell Content Analysis mode
- Improved Automated MR
- Map coefficients appended to input mtz file
- Score stored in .sol file is Z-score rather than LLG
- Final phasing and refinement to full resolution on mtz file
- Packing of RNA/DNA
- Packing to form close packed oligomeric complexes
- Waters in pdb files excluded from packing
- Only the most homologous model in an ensemble is used for packing analysis
- Reduced memory requirements
- Composition/MW determined from sequence
- Automated MR now available as a python script
- Bug fixes for Phaser1.2
- Phaser1.2
- First "official" release of Phaser software
- Automated solution of structures
- New Likelihood Enhanced fast Translation Function (LETF)
- Ability to use electron density maps as molecular replacement models
- Rigid body solution refinement against maximum likelihood target
- Pruning of duplicate solutions from solution list
- Searching of multiple alternative space groups
- Searching of multiple alternative models
- Better minimizers
- Phaser1.1
- Bug in anisotropy refinement corrected
- Phaser1.0
9. References
Citation:
Other references:
- Read, R.J. (2001). Pushing
the boundaries of molecular replacement with maximum likelihood. Acta
Cryst. D57, 1373-1382
- Storoni, L.C., McCoy, A.J. & Read, R.J. (2004).
Likelihood-enhanced
fast rotation functions. Acta Cryst D60, 432-438
- McCoy, A.J., Grosse-Kunstleve, R.W., Storoni,
L.C. & Read, R.J. (2005). Likelihood-enhanced
fast translation functions. Acta Cryst D61, 458-464
- McCoy, A.J., Storoni, L.C. and Read, R.J. (2004) Simple
algorithm for a maximum-likelihood SAD function Acta Cryst. D60, 1220-1228.