[RASMB] Sedimentation Velocity Experiments that are hard to fit

David Hayes hayes at bbri.org
Fri Mar 24 12:40:59 PST 2006


Hi to all experienced Ultracentrifuge Scientists,

I have some data (repeated experiments) from sample A buffer 1 (which may 
or may not have been produced on the planet earth -- that is all I can say 
about it)

There are two puzzling aspects to this data.
First, I am not getting good repeatability in the amount of aggregate.
Second, by eye, the raw data looks like a single species (which it should 
be) with a small amount of higher weight aggregate (not surprising), but 
the entire data set never fits well no matter what I try.
Thinking that the repeatability problem might be an artifact of my 
inexperience using Sedfit I tried many different strategies:  including 
floating or not floating just about every parameter and also using the 
nifty "experimental initial distribution" option of Sedfit.
   Fitting to the whole data set, I got a fit with a single large peak and 
two very small aggregate bumps which all together integrate to about 4% of 
the total.  The rmsd was a bit large at 0.01 and the residuals were 
systematic.  Early scans were fit very poorly and later scans fit somewhat 
poorly with scans in the middle fit pretty well.  Later using Sedfit with 
the single species model and only the later half of the scans and floating 
a lot of things and turning off the hat function in Sedfit brought me down 
to a comparable fit with a rmsd of .006, but the residuals were still 
systematic and the molecular weight was off more than what I expected for 
having 4% aggregate.
Then I did something that Peter Shuck thinks is a bit strange, but I think 
it helped me understand the data a bit more.  I fit pairs of scans with 
c(s) taking scans at different times paired with the last 
scan.  Essentially then, I was fitting each scan individually:  using the 
last scan in ever pair was needed to help Sedfit know the baseline.  Peter 
reminded me that by fitting each scan, I lose the ability to distinguish 
heterogeneity from diffusion.  But in this case, all the evidence points to 
a single species, so I did not think heterogeneity (beyond the measured 4% 
aggregation) was significant.  The peak S value stays constant all the way 
down the cell.  Each individual scan fit this way fit very well without 
systematic residuals and a rmsd of about .005.  What changed was that early 
scans fit to an f/f0 of 1.06 while by the later scans f/f0 fit to 
1.5.  This means that the early scans are broader than they should be for a 
reasonable f/f0 and that the later scans seem to be narrower and diffusing 
less.
I repeated this with Sedanal dcdt based curve fitting (here I took scans by 
twos going down the cell and fit to a single species ignoring the 4% 
aggregate) and found a similar trend:  the fitted molecular weight went 
from 95,000 to 225,000 and then settled down to about 160,000.  If there 
were heterogeneity, the scans would fit the opposite way to a one species 
model, the f/f0 would go down, or the weight would go up later in the run 
as the species sedimented apart.

I had some interference data of Tropomyosin that I subjected to a similar 
type of analysis with Sedanal, fitting the whole data set and then fitting 
scans by sets of 10 for the different times in the experiment (because of 
TI, RI noise fitting, it is not possible to fit small sets of scans with 
Sedfit using interference data; however, a Sedfit fit to the whole Tm data 
set showed no systematic variation of the residuals dependent on scan 
number).  There was no scan number dependent trend in molecular weight with 
this molecule, though the variability in fitted molecular weight was 
surprising (weights from different sets of 10 varied from 60,000 to 80,000 
randomly).

Seeing this I tried a Sedfit single species and Sedanal single species fits 
with non-ideality parameters, but floating these parameters made no 
substantial difference.

I don't know if anyone has run into this type of problem, but I am out of 
ideas what to try next.  I am being a bit stubborn wanting a good fit to 
the data:  the S value of the main peak is reproducible and the data 
contains quite a bit of information, but I can't figure out exactly what is 
going on:

Is this just non-ideality that I am not fitting properly or maybe that is 
not modeled well?
Is the culprit convection?  Thoughts on convection in the next message.

Cheers



Dr. David B Hayes
Boston Biomedical Research Institute
64 Grove St.
Watertown, MA  02472
617-658-7738

Boston Biomedical Research Institute... Today's Research for Tomorrow's 
Health.
Please visit us at www.bbri.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://list.rasmb.org/pipermail/rasmb-rasmb.org/attachments/20060324/4e3e87ca/attachment.htm>


More information about the RASMB mailing list