Locally Developed Software and Resources
Good computer programs have always been vitally important to the work of the LMB.
Many of the groups here write, or contribute to scientific software, and wherever possible this work is released to the open source software community.
MRC Image Processing Software
Members of Greg Jefferis' group have contributed the following:
CMTK, a software toolkit for computational morphometry of biomedical images, CMTK comprises a selection of command line tools and a general-purpose library for processing and I/O. Contributions to this package made by the group, include the development of a simple front end for CMTK.
NeuronAnatomy Toolbox, a suite of packages written in R for analysis of neuronal branching patterns, 3D density data describing neuronal projections and deformation based morphometry (shape analysis of groups of co-registered brains). Includes I/O functions to allow import (and in some cases export) for:
- SWC, Neurolucida and Amiramesh neuronal tracing formats
- Biorad PIC, Nrrd, Amiramesh 2D/3D image data
The main programs in producing a list of reflection intensities from diffraction images are:-
- MOSFLM (Andrew Leslie, Harry Powell): indexes the diffraction pattern and produces integrated reflection intensities. An interactive graphical interface iMosflm makes this easy for the user.
- POINTLESS (Phil Evans): determines the point group & space group symmetry from the list of intensities from Mosflm.
- AIMLESS (Phil Evans): scales symmetry-related intensities together to correct for differences in their measurement methods, averages them and produces a large variety of statistics indicating the data quality.
Semi-Automated Routines for Functional Image Analysis (SARFIA)
Key features are image-based detection of structures of interest using the Laplace operator, determining the positions of units in a layered network, clustering algorithms to classify units with similar functional responses, and a database to store, exchange and analyse results across experiments. GUI access to a wide range of analysis functions for image stacks.
The custom image processing functions include thresholding based on the Laplace operator, filtering of 3D waves using principal component analysis (PCA), rotating functions, images/image stacks without interpolation, line scan analysis; Automated baseline detection, hierarchical clustering and bleach subtraction from fluorescence traces.
The package includes a manual describing the control panels and a help file that describes all functions in detail.
An open-source software package for analysis of the orientation distribution of cryo-electron microscopy data, developed by Chris Russo and Katerina Naydenova.
The orientation distribution of a single-particle electron cryo-microscopy specimen can limit the resolution of the reconstructed density map if the particles are not randomly oriented on the support surface. This method describes the quality of an orientation distribution in terms of providing uniform resolution in all directions, by a single number – the efficiency. The cryoEF program will assist you in determining to what extent this affects the resolution of your 3D reconstruction.
More information and the software package are available here.
Computational Structural Biology Group Software
Garib Murshudov’s group’s research is centred on the development of efficient mathematical, statistical, computational algorithms for macromolecular X-ray crystallography (MX) and electron cryo-microscopy (cryo-EM) structure analysis. Software developed in the group includes the following:
REFMAC5 (Garib Murshudov) – REFinement of MACromolecular Structures. Use the Maximum Likelihood method and some elements of Bayesian Statistics to perform full model refinement and map calculation. Originally designed for use with data from MX, REFMAC5 has been adapted and extended to support data from other sources including cryo-EM.
Coot (Paul Emsley) – Crystallographic Object-Oriented Toolkit. For macromolecular model building, model completion and validation, particularly suitable for protein modelling using MX and cryo-EM data. Coot displays maps and models and allows model manipulations such as idealization, real space refinement, manual rotation/translation, rigid-body fitting, ligand search, solvation, mutations, rotamers, Ramachandran plots, skeletonization, non-crystallographic symmetry and more.
AceDRG (Fei Long) – stereo-chemical description generator for monomers/ligands. Encapsulates information about local chemical and topological environments derived from a small molecule database (the Crystallography Open Database), and uses this information to derive ideal bond lengths, angles, etc. for an unknown monomer/ligand. AceDRG can also generate link information to encapsulate a covalent bond between two monomers.
ProSMART (Rob Nicholls) – PROcrustes Structural Matching Alignment and Restraints Tool. Designed for the conformation-independent comparison of macromolecular structural models, and the generation of external restraints for use during macromolecular refinement at low resolution. Restraints to reference structures, hydrogen bonding patterns, and generic self-restraints may be generated. Supports models corresponding to protein and nucleic acids, from MX and cryo-EM.
LibG (Fei Long) – Generates restraints to stabilise refinement of DNA/RNA models during refinement. Restraints are generated for base-pairs, stacking planes, sugar puckers, and other torsion angles.
LORESTR (Oleg Kovalevskiy) – LOw RESolution STRucture refinement pipeline. Executes multiple model refinement instances using different parameters in order to find the best protocol. Generates restraints using ProSMART and LibG, and refines models using REFMAC5. The pipeline performs auto-detection of twinning and selects the optimal scaling method and solvent parameters. Can either use user-supplied homologous structures, or run an automated BLAST search and download homologues from the PDB
ProSHADE (Michal Tykac) – PROtein SHApe DEscription and symmetry detection. Library and associated tool providing functionalities for computing shape-wise structural distances between pairs of molecules, detecting symmetry over the centre of mass of a single structure, map re-sizing, as well as matching density maps and coordinate models.
BALBES (Fei Long) – automatic molecular replacement pipeline. A system for solving protein structures using x-ray crystallographic data, which aims to integrate all components necessary for finding a solution structure by molecular replacement. It comprises a database, scientific programs and a python pipeline. The system is automated so that it needs no user intervention when running a complicated combination of jobs such as model searching, molecular replacement and refinement.
More information about this software can be found on the Computational Structural Biology Group website.
Computational Genomics Group
Julian Gough’s group has produced a range of software and resources that span the scales of biology from DNA, proteins, interactions, networks, and cells, up to whole organism phenotypes. These include the following:
SCOP – Structural Classification of Proteins. Nearly all proteins have structural similarities with other proteins and, in some of these cases, share a common evolutionary origin. The SCOP database, created by manual inspection and abetted by a battery of automated methods, aims to provide a detailed and comprehensive description of the structural and evolutionary relationships between all proteins whose structure is known.
SUPERFAMILY – A database of structural and functional annotation for all proteins and genomes.
Mogrify – A directory of defined factors for direct cell reprogramming. The Mogrify resource predicts how to trans-differentiate from any human cell-type to any other human cell-type by perturbing the transcriptional network. Mogrify includes all cell-types from the FANTOM5 project and uses a network-based algorithm designed to find transcription factors that impart the most influence on changes in cellular state.
The full list of software and resources managed by the Computational Genomics Group can be found on their group page.
Protein Contacts Atlas
Protein Contacts Atlas is a tool that helps biologists analyse and visualise protein structures at atomic resolution using residue-residue contact networks, developed by Melis Kayikci, AJ Venkatakrishnan, and M Madan Babu.
Visualisations of biomolecular structures allow researchers to gain insights into biological functions and generate testable hypotheses. Typical visualisations primarily depict covalent bonds only and not other, non-covalent, contacts between atoms. However the structure also depends on these non-covalent contacts, thereby they effect physiological function, pathogenesis, and drug action. The Protein Contacts Atlas is an interactive resource of non-covalent contacts from over 100,000 PDB crystal structures that provides a range of representations and properties by which individual residues can be studied.