Superposition of target (S-HPCDH) and template (R-HPCDH)
In general, proteins with similar amino acid sequences (primary structure) have similar protein structures. This forms the basis of comparative protein modeling. We discovered in Part 5 that the two HPCDH sequences share 41% sequence identity. This degree of similarity is such that an accurate sequence alignment between the two is likely to have only minor errors if any. Yet, despite an accurate sequence alignment, the structures will likely have diverged enough that small shifts in the main chain trace and side chain conformations would be expected (should the structure at some point be solved). Nevertheless, we may be able to construct a model of S-HPCDH based on the structure of R-HPCDH that is accurate enough to facilitate insight into the basis of stereospecificity. The technology of comparative modeling (sometimes called homology modeling or template-based modeling) is really just a sophisticated copying procedure.
MODELLER is a computer program that models three-dimensional structures of proteins and their assemblies by satisfaction of spatial restraints.
MODELLER is most frequently used for homology or comparative protein structure modeling: The user provides an alignment of a sequence to be modeled with known related structures and MODELLER will automatically calculate a model with all non-hydrogen atoms.
More generally, the input to the program are restraints on the spatial structure of the amino acid sequence(s) and ligands to be modeled. The output is a 3D structure that satisfies these restraints as well as possible. Restraints can in principle be derived from a number of different sources. These include related protein structures (comparative modeling), NMR experiments (NMR refinement), rules of secondary structure packing (combinatorial modeling), cross-linking experiments, fluorescence spectroscopy, image reconstruction in electron microscopy, site-directed mutagenesis, intuition, residue-residue and atom-atom potentials of mean force, etc. The restraints can operate on distances, angles, dihedral angles, pairs of dihedral angles and some other spatial features defined by atoms or pseudo atoms. Presently, MODELLER automatically derives the restraints only from the known related structures and their alignment with the target sequence.A 3D model is obtained by optimization of a molecular probability density function (pdf). The molecular pdf for comparative modeling is optimized with the variable target function procedure in Cartesian space that employs methods of conjugate gradients and molecular dynamics with simulated annealing.
The following are some general rules on how these programs work:
In this step, the authors of the tutorial simply show you the input to the popular comparative protein modeling program, MODELLER, for constructing a model of S-HPCDH. The program requires an external sequence alignment file (construction of this is shown in Step 4 for interested students), a PDB file of the template and an input file to the program. The PDB file originates from the Protein Data Bank and has the code 2cfc [PDB].
You have several options for building the model:
In Part 6, we will visualize the S-HPCDH model generated through comparative modeling. You do not actually have to run MODELLER as we have provided the results for you in the next step.
# File: model-hcds.py
# Homology modelling by the automodel class
from modeller.automodel import * # Load the automodel class
log.minimal() # request minimal output
env = environ() # create a new MODELLER environment to build this model in
# directories for input atom files
env.io.atom_files_directory = './:../atom_files'
a = automodel(env,
alnfile = 'HCDS_0.ali', # alignment filename
knowns = 'HPCDH', # codes of the templates
sequence = 'HCDS', # code of the target
assess_methods=(assess.DOPE))
a.starting_model= 1 # index of the first model
a.ending_model = 1 # index of the last model
# (determines how many models to calculate)
a.md_level = None
a.make() # do the actual homology modelling
$> mod9v4 model-hcds.py
This should generate your model HCDS.B99990001.pdb, as well as a MODELLER output log.
In the previous step, you created an alignment in MODELLER format. You will need to copy that alignment verbatim to correctly build the model.
If there were errors, check your alignment and resubmit. If everything looks OK, the server will build your model and display the results.