balbes logo BALBES - Automatic Molecular Replacement Pipeline

    Please cite the following paper in any publication arising from use of BALBES

    F.Long, A.Vagin, P.Young and G.N.Murshudov "BALBES: a Molecular Replacement Pipeline " Acta Cryst. D64 125-132(2008)

BALBES Manual

This manual is for BALBES, version 1.1.5, on Linux, Mac OS X. Contact Fei Long for more information. Please also email any comments, questions and suggestions. Thank you.

Contents:



Introduction

BALBES is an automated software suite for structure solutions using protein X-ray crystallographic data. Molecular Replacement(MR) is core scientific method. It is called a MR pipeline because it integrates all components, necessary for finding a solution structure by MR, into one system. The system is automated so that a user needs only to provide a sequence file and a X-ray data file. After that BALBES will perform jobs such as data checking, homologous searching using its internal database to generate a variety of template models, and selecting different algorithms to do MR and refinement on those models and their combinations.

BALBES contains three major parts:


Go Back to Contents

Download and Install BALBES

Download BALBES

Currently, BALBES releases two pre-compiled versions that work respectively on:
  1. Linux
  2. Mac OS X
The latest versions can be download at MRC-LMB site or YSBL site. It is also included in a CCP4 suite. But the version in CCP4 suite is sometime out-dated.

Install BALBES

Perequisites for BALBES: You need to install CCP4 suite before installing BALBES

Install BALBES under CCP4 Package (replace a old version with a new one)

  1. After you download one of gziped tar files, e.g. BALBES_1.1.1_DB_Oct_1_2010_Linux.tar.gz, move it under CCP4 package by, e.g.
     mv BALBES_1.1.5_DB_Nov_1_2011_Linux.tar.gz $CCP4/share/ 
  2.  cd $CCP4/share/ 
  3. Unpack the BALBES package e.g.
    tar zxvf BALBES_1.1.5_DB_Nov_1_2011_Linux.tar.gz 
    (or another *.tar.gz file you downloaded)
  4. Close the old xterms and open new ones. Re-start ccp4i and select BALBES. The version of BALBES you are using should be the new one.

Install standalone BALBES

If you want to install a version of BALBES under a directory different from where your ccp4 suite is installed
You need to do the following steps.

  1. After you download one of gziped tar files, e.g. BALBES_1.1.1_DB_Oct_1_2010_Linux.tar.gz, move it to the directory you want to install BALBES, e.g.
     mv BALBES_1.1.5_DB_Nov_1_2011_Linux.tar.gz your_dir 
  2.  cd your_dir 
  3. Unpack the BALBES package you just moved e.g.
    tar zxvf BALBES_1.1.1_DB_Oct_1_2010_Linux.tar.gz
  4. cd BALBES
  5. You will see files README.SETUP and setup.py.
    ./setup.py or python ./setup.py
The installation script will do everything for you

 


Go Back to Contents

Update BALBES Database

We update and release the internal database of BALBES for homologous searching every month to include entries related to new structures released by Protein Data Bank(PDB). Presume you have installed BALBES, and would like to use the newly released database of BALBES, you need to do the following steps.

  1. Download a version of BALBES database, e.g. BALBES__DB.tar.gz.
  2. Move the database you have downloaded to the directory where your environment variable $BALBES_ROOT defines ($BALBES_ROOT exists if you have installed CCP4 suite).
     mv BALBES_DB.tar.gz $BALBES_ROOT 
  3. Unpack the BALBES database you have just moved e.g.
    tar zxvf BALBES_DB.tar.gz 
After finishing the above steps, BALBES will use the new internal database.

 


Go Back to Contents

Run BALBES

Because BALBES is designed to minimize user's intervention, run BALBES is very simple. Three ways are provided to run BALBES, i.e.

  1. Run BALBES via BALBES webserver
  2. Run BALBES via CCP4i interface
  3. Run BBALBES from command-lines

In this section, we will also describe

Requirements for input files

The requirements depend on what you want to do with BALBES. If you would like to

  1. solve structures using Molecular Replacement, you need to provide two files, i.e.
    1. a structural factor file of MTZ or CIF format, such as mySF.mtz or mySF.cif
    2. a sequence file in form of FASTA format, e.g. mySeq.seq
  2. just search for homologous models, you need to provide one file, i.e.
    1. a sequence file in form of FASTA format, e.g. mySeq.seq

Go Back to "Run BALBES"

Run BALBES via BALBES webserver

Run BALBES for structural solutions via BALBES web server is shown is the following picture:

balbes-webserver-pict
  1. Upload a structural factor file and a sequence file
  2. Click submit button
Go Back to "Run BALBES"

Run BALBES via CCP4i

Steps in run BALBES for structural solutions via CCP4i interface are shown in the following pictures.

balbes-ccp4i-picture1
  1. Select "Molecular Replacement" Module in CCP4i main manu
  2. Select "Run BALBES" task in Molecular Replacement Module
balbes-ccp4i-picture2
  1. Upload a structural factor file and a sequence file to the input file rows
  2. Give names to the output PDB and MTZ files, otherwise default names will be applied
  3. Click "Run" button
  4. Other steps (such as view the results) are standard procedures in ccp4i. Please consult with ccp4i's help manual if you are not familiar with them.
Go Back to "Run BALBES"

Run BALBES from command lines

Typical command line for structural solution is :

balbes -o a_directory_name -f a_structure_factor_file_name -s a_sequene_file_name

Explain:

  1. program name: balbes
  2. options (include symbol - ):
    1. -o (optional) ------- following by the name of the directory into which you want to put all of the output generated by balbes.
        e.g. -o MyjobOne
        or -o ../output_abcd

        If the directory does not exist, the program will create the directory( MyjobOne, or ../output_abcd) for you.

      If -o is omitted, the program will create a output directory called output_username (e.g. output_fei)

  3. -f (required) ------- followed by the name of a structural factor file. The file could be a .mtz file or a .cif file.
      If the file is in the current directory, you can just type name, e.g,
        -f 1ab2.mtz
        or -f 3cd4.cif
      Otherwise you need to input the file name with the absolute path. e.g,
        -f /home/dir1/dir2/1ab2.mtz
        or -f ../3cd4.mtz
  4. -s (optional) -------- followed the name of a sequence file. You can find the format of a sequence files containing one sequence or multiple sequences

    Depending on where the file is,it can be,

      -s aaaaa.seq
      or -s ../../aaaa.seq
      or -s /Users/abcd/data/aaaa.seq

Go Back to "Run BALBES"

All BALBES command line arguments:

-o
Root directory for BALBES job output. It contains following subdirectories:
    results -- stores the final solutions in terms of pdb and mtz files, and a file recording over-all process
    process_details -- stores all *.log, *.ps files, generated by individual programs within BALBES during executing balbes.
    template_models--contains all homologous models found
-f
Input x-ray data file of MTZ or CIF format
-s
Input sequence file of FASTA format
-m

Input model file of PDB format. BALBES will no long search for homologous models, just use the provided one

It can be,

    -m aaaaa.pdb
    -m ../../aaaa.pdb
    -s /Users/abcd/data/aaaa.pdb

-l

user's own database. BALBES will search homologous models using that database instead of its internal database. The user's own database is actually a subdirectory contains a group of user's model files of PDB format.

For example, the user's database is called myLib, then

 a_user$ ls myLib/
 a_user$ 1.pdb 2.pdb 12.pdb 26.pdb aaaa.pdb xxxx.pdb yyyy.pdb 
    
 a_user$ balbes -o myJobOut -f myMTZ.mtz  -s mySeq.seq -l myLib/ 
    

 

Go Back to "Run BALBES"
Go Back to Contents

Basic Tutorials