| Quick navigation: | Home | Site Map || References | Biography || Copyright | Other copyright | Contact us | Advert | | |
Re: [ccp4bb] Problems with phasing a protein (1300aa) |
||
- Protein crystallographyMain steps:- Protein purification- Crystallisation Special:- Programs for crystallography- X-ray detectors Basic tutorials:- Chemistry- Protein - Peptide - Amino Acids Xtal community:- CCP4BB |
CCP4bb navigationCCP4bb <-- 1999 <-- November 1999 <-- 30 November 1999Subject: Re: Problems with phasing a protein (1300aa) From: James Holton JMHolton {- at -} LBL {- dot -} GOV Date: 2009-03-21 Kumar wrote: > Hello CCP4bb members, > > I have been trying to obtain phases for a protein which contain > ~1300aa. We have obtained native data to a resolution of 3.3A (Space > group I222 or I212121). But we are having tough time phasing it. > > 'Se' labeled crystals diffracts maximally up to 3.5 to 4 A and dies > very quickly on most of the beamlines. Apart from Se, do you have any atoms heavier than sulphur in your crystal/solvent? This could make your crystals decay faster than normal. Other than that, I refer you to my table of how long you can expect a typical protein crystal to last at most of the world's beamlines: http://bl831.als.lbl.gov/damage_rates.pdf I invite corrections from any beamline scientist who thinks this table is in error! > We have scanned at Se wavelength and it gives very strong signal as it > contain ~45 Se in AU (1300 aa). It is difficult to collect a complete > dataset (maximally we get 50-60 % completion with Rmerge ~15) out of > one crystal on regular beamline. At microfocus beamline (APS), we were > able to collect data in 3-4 batches and merge them to get a complete > dataset (Rmerge ~18-20) out of one crystal. We used data collected on > microfocus beamline (at peak wavelength) for locating heavy atom > position using SHELXD, Solve and Phenix.hyss. SOlve and Phenix.hyss > find very few heavy atom sites 1-5 whereas SHELX-CDE lists many but > shows no difference in original and inverted (contrast and > connectivity). Our phasing attempts with datasets obtained after > merging two incomplete dataset from two different crystal has also > been disappointing. It is unwise to use burnt-out data for anomalous difference phasing. Back off on the total exposure to less than 2-5 MGy in total and average data from more crystals. > > My another worry is absolute value of average intensity, which seems > to be quite low in most of the datasets. Below I have pasted last > table of scale.log (HKL2000). > Shell Lower Upper Average Average Norm. Linear Square > limit Angstrom I error stat. Chi**2 R-fac R-fac > 50.00 7.53 45.4 1.6 1.3 1.295 0.055 0.047 > 7.53 5.98 11.4 1.3 1.3 0.672 0.135 0.114 > 5.98 5.23 11.2 1.6 1.6 0.643 0.171 0.152 > 5.23 4.75 16.8 2.0 1.9 0.736 0.148 0.118 > 4.75 4.41 18.8 2.2 2.2 0.739 0.143 0.132 > 4.41 4.15 14.6 2.4 2.4 0.653 0.190 0.175 > 4.15 3.94 11.3 2.5 2.5 0.582 0.247 0.226 > 3.94 3.77 10.1 2.8 2.8 0.511 0.280 0.191 > 3.77 3.63 8.0 3.1 3.1 0.450 0.315 0.285 > 3.63 3.50 7.6 3.3 3.2 0.483 0.311 0.270 > All reflections 15.5 2.3 2.2 0.694 0.153 0.106 > The absolute value of intensity is not important unless you are comparing it to a control experiment done in exactly the same way. Intensity relative to the error in the intensity, however, is very important. > Now, I want you to help me by answering some of my queries: > > 1. Is it possible to get MAD/SAD phasing done from a dataset having > more than 15% Rmerge and resolution in the range of 4 - 4.5 Ang? Yes, but only if your anomalous signal is greater than the noise. This does not appear to be so in your case. In fact, you seem to have a very good example of a marginal case that is below the "threshold of solvability". I like to think of things in terms of signal-to-noise, and one can use a rearrangement of the Crick-Magdoff equation to tell you what the I/sigma of your data set needs to be for delta-F to be greater than sigma(delta-F): I/sigma(I) > 1.3*sqrt(Daltons/sites)/f" where: I/sigma(I) is the signal-to-noise ratio of the data set required to solve it by MAD/SAD Daltons is the molecular weight of the protein in amu sites is the number of Se sites f" is the f" of those sites (in "electrons") In your case: I/sigma(I) > 1.3*sqrt(1300*120/45)/4 = 19 is required. You have this in your lowest-angle bin, but nowhere else. It might be possible to find some sites, but you are not going to get phases beyond 7A, and phase extension from this low a level is hard to do, even when the I/sigma is high. Remember, "thresholds" like this are not sharp but represent a level of data quality where the best crystallographers in the world working very carefully and with a significant amount of luck have managed to solve a structure. If you are not one of these highly skilled and experienced people, you will probably need better data. A lot better data. And having better data is not a bad thing. For the case Tommi Kajander put forward, I/sigma(I) > 1.3*sqrt(365*8*120/80)/4 = 22 was required. Tommi? Was your I/sig better or worse than this? > > 2. Will a complete data set obtained from merging various > batches(30-40 frames each) from one or more than one crystal will have > proper anomalous signal for phasing? I am worried as weak anomalous > signal may get lost while merging. If you average enough data, you can theoretically get any I/sigma you want. In your case you will probably have to average data from not less than 8 crystals to bring your I/sigma up to 20 (6.7 * sqrt(8) = 20). > > 3. Will such a low value of average Intensities (as shown above from > HKL scale log file) will be good enough for MAD/SAD phasing Probably not. > or I really need to improve crystal quality for stronger diffraction. It is always better to have better crystals. > > 4. For MAD/SAD phasing, till what resolution we need to have anomalous > signal ? Many of my datasets shows anomalous signal maximally up to > 6-8 A (calculated using Phenix.xtriage). You need to have anomalous signal out to the resolution to which you want to have phases. In general, if you have 2.5A phases, you can extend them easily to 2.0 A with solvent flattening, but extending 6 A phases to 4 A is more problematic, I think this could be because things like histogram matching don't work as well with 6A protein maps. > > 5. Since I have low resolution (3.5 to 4 A)data, relatively high > Rmerge (14-15%), lower value of average intensity, anomalous signal up > to 6 A or so..... which programs will be more useful for heavy atom > location and to prevent false positives from being selected? There are no programs that will make bad data good. It may be possible to find these 45 sites with the programs you are already using, but it will be a challenge to get phases even if you do. Good luck, -James Holton MAD Scientist CCP4bb navigationCCP4bb <-- 1999 <-- November 1999 <-- 30 November 1999 |
|
| ProteinCrystallography.org: Copyright 2006-2010 by Quid United Ltd |