Classification example

From Relion
Jump to navigation Jump to search

10,000 70S ribosomes wi/wo EFG

The test data set that will be used below was provided by Joachim Frank and is stored at the EBI-EMDB.

Download data and reference

Download the test data set from here, as well as the corresponding metadata that is stored in this PDF file. Save both files in your working directory.

EMDB entry 1056 will be used as initial reference. Save this file in the same working directory. Note this reference has the same pixel size (2.8A) and the same box size (130x130) as the data set above. Therefore, no re-scaling or windowing operations are necessary. Although this map is at a resolution of ~9 A, note that it will be low-pass filtered to 80 A in the RELION run below.

Unpack the data as follows:

tar -xf J-Frank_70s_real_data.tar
gunzip emd_1056.map.gz
mv emd_1056.map emd_1056.mrc

Prepare the input STAR file

From the PDF provided by Joachim Frank select the following (14) lines and save them in a text file called defocus.dat

1 3 1347.0 1347.0 21580.
2 3 505.00 1852.0 24833.
3 3 989.00 2841.0 26450.
4 3 857.00 3698.0 28320.
5 3 475.00 4173.0 30993.
6 3 349.00 4522.0 33150.
7 3 478.00 5000.0 34588.
8 3 1242.0 6242.0 21580.
9 3 713.00 6955.0 24833.
10 3 1255.0 8210.0 26450.
11 3 1022.0 9232.0 28320.
12 3 304.00 9536.0 30993.
13 3 232.00 9768.0 33150.
14 3 232.00 10000. 34588.

Then save the following lines as a file called make_star.csh

#!/usr/bin/env csh
ls -l win/*dat | awk '{print $NF}' >imagelist
#
relion_star_loopheader rlnImageName rlnMicrographName rlnDefocusU rlnVoltage rlnSphericalAberration rlnAmplitudeContrast > all_images.star
#
set ngr = 14
set gr = 0
while ($gr < $ngr)
 @ gr++
 set nn=`head -n $gr defocus.dat  | tail -1 | awk '{print int($3)}'`
 set tot=`head -n $gr defocus.dat | tail -1 | awk '{print int($4)}'`
 set def=`head -n $gr defocus.dat | tail -1 | awk '{print $5}'`
 head -n ${tot} imagelist | tail -n ${nn} |awk -v"def=$def" -v"gr=$gr" '{print $1, gr, def, 200, 2, 0.1}' >> all_images.star
end

And execute it to generate the input STAR file with all image names and CTF information, using the command:

csh make_star.csh

Run RELION

Save the following lines in a file called example_gui3d.settings:

is_continue == false
Output rootname: == run1/K4
Continue from here:  == 
Input images: == /lmb/home/scheres/work/relion/ribo_test_case/all_images.star
Reference map: == /lmb/home/scheres/work/relion/ribo_test_case/emd_1056.mrc
Ref. map is on absolute greyscale? == No
Initial low-pass filter (A): == 80
Particle diameter (A): == 340
Pixel size (A): == 2.8
Number of iterations: == 25
Regularisation parameter: == 4
Do solvent flattening? == Yes
Solvent mask: == 
Do normalisation correction? == Yes
Do intensity correction? == Yes
Do CTF-correction? == Yes
Only flip phases? == No
Have data been phase-flipped? == No
Ignore CTFs until first peak? == No
Has reference been CTF-corrected? == No
Symmetry group: == C
Symmetry number: == 1
Angular sampling interval (deg): == 7.5 degrees
Limit the tilt angle? == No
Limited tilt angle: == -91
Perform local angular searches? == No
Local angular search range: == 5
Number of classes: == 4
Oversampling order: == 1
Offset search range (pix): == 10
Offset search step (pix): == 1
Number of MPI procs: == 8
Number of threads: == 8
Submit to queue? == Yes
Queue name:  == openmpi_8
Queue submit command: == qsub
Standard submission script: == /lmb/home/scheres/app/relion/gui/qsub.csh
Additional arguments: ==  --random_seed 1

Start the RELION GUI by typing relion from the command line (inside the working directory). Adapt the settings for your particular cluster setup in the Running tab, and submit the job by clicking on the orange "Run!" button.

That's it! Just wait until your job is finished. Using 8 MPI nodes, each with 8 threads, this took XXX wall-clock hours on the LMB cluster.

Anticipated results