Abstract
Secondary Ion Mass Spectrometry (SIMS) is a widely used technique for in-situ U–Pb geochronology of accessory minerals. Existing algorithms for SIMS data reduction and error propagation make a number of simplifying assumptions that degrade the precision and accuracy of the resulting U–Pb dates. This paper uses an entirely new approach to SIMS data processing that introduces the following improvements over previous algorithms. First, it treats SIMS measurements as compositional data, using logratio statistics. This means that, unlike existing algorithms: (a) its isotopic ratio estimates are guaranteed to be strictly positive numbers; (b) identical results are obtained regardless of whether data are processed as normal ratios (e.g. 206Pb/238U) or reciprocal ratios (e.g. 238U/206Pb); and (c) its uncertainty estimates account for the positive skewness of measured isotopic ratio distributions. Second, the new algorithm accounts for the Poisson noise that characterises Secondary Electron Multiplier (SEM) detectors. By fitting the SEM signals using the method of maximum likelihood, it naturally handles low intensity ion beams, in which zero-count signals are common. Third, the new algorithm casts the data reduction process in a matrix format, and thereby captures all sources of systematic uncertainty. These include significant inter-spot error correlations that arise from the Pb/U–UO(2)/U calibration curve. The new algorithm has been implemented in a new software package called simplex. simplex was written in R and can be used either online, offline or from the command line. The program can handle SIMS data from both Cameca and SHRIMP instruments.
Secondary Ion Mass Spectrometry (SIMS) combines high sensitivity with high mass resolution (Williams, 1998). This allows the technique to obtain precise U–Pb dates on ng-sized samples whilst resolving isobaric interferences on 204Pb to a degree that is currently unachievable by other techniques such as Laser Ablation Inductively Coupled Plasma Mass Spectrometry (LAICPMS). There are some other differences between LAICPMS and SIMS as well. LAICPMS instrumentation is built by numerous manufacturers, whose data files are compatible with a rich ecosystem of data reduction codes such as Iolite, Glitter and LADR. This facilitates the intercomparison of different laboratories, different instrument designs and so forth. In contrast, the SIMS U–Pb world is dominated by just two manufacturers, whose SHRIMP (Sensitive High Resolution Ion Micro-Probe) and Cameca instruments use completely separate data reduction protocols. Most SHRIMP laboratories use Squid (Ludwig, 2000; Bodorkos et al., 2020), which is incompatible with Cameca data. In contrast, Cameca data tends to be processed by in-house software such as M. J. Whitehouse’s NordSIM spreadsheet, which are incompatible with SHRIMP data.
This paper introduces a unified algorithm for SIMS U–Pb data reduction that aims to address five problems with existing data reduction methods. The first three of these problems are:
Sections 2 – 4 will provide further details about these problems, using synthetic examples. Section 5 will show that problems 1, 2 and 3 can be solved by treating the U–Pb system as a compositional data space, using logratio statistics. However logratio statistics does not solve the remaining two problems:
These problems are discussed in more detail in Section 6. Section 7 addresses them by incorporating multinomial counting statistics into the compositional data framework. Section 12 applies the new data reduction paradigm to two datasets produced by Cameca and SHRIMP instruments, and Section 13 introduces a computer code called simplex that generated these results. Finally, further details about the implementation of the new algorithm are reported in an appendix to this paper.
The common Pb corrected 206Pb/238U-age (t) depends on the relative abundances of three nuclides, 204Pb, 206Pb and 238U:
| (1) |
where the subscript c marks the common lead composition, and λ238 is the decay constant of 238U. Equation 1 does not depend on the absolute amounts of 204Pb, 206Pb and 238U, but only on their ratios. Unfortunately, the statistical analysis of the ratios of strictly positive numbers is full of potential pitfalls, as will be illustrated with an example that was inspired by McLean et al. (2016). Consider a simple dataset of ten synthetic U–Pb measurements:
238U | 215.9 | 208.9 | 212.4 | 186.3 | 217.8 | 196.7 | 216.4 | 171.8 | 216.0 | 200.1 |
206Pb | 18.45 | 12.40 | 21.35 | 62.22 | 21.35 | 45.08 | 26.65 | 75.88 | 29.02 | 11.40 |
204Pb | 0.1570 | 0.2870 | 0.1627 | 0.01425 | 0.1092 | 0.08175 | 0.06900 | 0.02250 | 0.04975 | 0.3850 |
Let us calculate the 206Pb/238U-ratios for these data, which are needed to solve Equation 1. Comparing these ratios with their reciprocals yields two new sets of ten numbers:
|
The elementary rules of mathematics dictate that 1∕(y∕x) = x∕y for any two real numbers x and y. In other words, the reciprocal of the reciprocal ratio ought to equal that ratio. Indeed, for our example it is easy to see that 1/0.085 = 11.70 and so forth. However, when we take the arithmetic means of the (reciprocal) ratios:
|
then we find that
= = 6.01≠9.14 = a |
So the reciprocal of the mean reciprocal ratio does not equal the mean of that ratio! This is a counter-intuitive and clearly wrong result. Unfortunately, current algorithms for SIMS data reduction average ratios using the arithmetic mean, or perform (linear) regression through ratio data, which causes similar problems (Ogliore et al., 2011). Inaccurate 206Pb/238U-ratios inevitably result in inaccurate U–Pb dates, with the degree of inaccuracy scaling with the relative standard deviation of the measured data. Therefore, the numerical example shown in this section is deeply troubling for isotope geochemistry in general and SIMS U–Pb geochronology in particular.
Traditionally, the precision of isotopic data used in U–Pb geochronology has been calculated as symmetric confidence intervals. Unfortunately, this is fraught with similar problems as those discussed in Section 2. For example, take the arithmetic mean ( ) and standard deviation (sx) of the 206Pb/204Pb ratios in Table 1, and construct a studentised 95% confidence interval for :
x | sx | t92.5 | +t997.5 | +|
206Pb/204Pb | 978 | 1554 | -134 | 2090 |
where tdfα is the α-percentile of a t-distribution with df degrees of freedom (e.g., t92.5 = -2.262). Then the lower limit of the confidence interval is negative. This nonsensical result is yet another indication that there are some fundamental problems with the application of ‘conventional’ statistical operations to isotopic data. Negative isotope ratios can also arise from background corrections, but this will be further discussed in Section 6.
The statistical uncertainty of analytical data can be classified into two components (Renne et al., 1998):
Great care must be taken in choosing which sources of uncertainty should or should not be included in the error propagation. In some cases, inter-sample comparisons of SIMS U–Pb data may legitimately ignore systematic uncertainties. However, when comparing data acquired by different methods, both random and some systematic uncertainties must be accounted for. For example, when comparing U–Pb and 40Ar/39Ar data, the decay constant uncertainty must be propagated; and when comparing SIMS and TIMS U–Pb data, SIMS primary standard age uncertainty and the TIMS tracer uncertainty must be propagated. The conventional way to tackle inter-sample comparisons is called ‘hierarchical’ error propagation (Renne et al., 1998; Min et al., 2000; Horstwood et al., 2016). Under this paradigm, the random uncertainties are processed first, and the systematic uncertainties afterwards.
Hierarchical error propagation is straightforward in principle but not always in practice. Some processing steps are of a hybrid nature, including both systematic and random uncertainties. 206Pb/238U calibration for SIMS is a good example of this. 206Pb/238U-ratios are sensitive to elemental fractionation in SIMS analysis (see Section 10 for further details). These fractionation effects are captured by the following power law (Williams, 1998; Jeon and Whitehouse, 2015):
| (2) |
where the subscript m stands for the measured signal ratio, which is generally different from the atomic ratio; and the subscript (2) refers to the fact that the calibration normally uses uranium oxide for SHRIMP and uranium dioxide for Cameca instruments. The atomic 206Pb/238U-logratio of a sample is determined by (1) determining the intercept (A) and slope (B) of a reference material (r) of known Pb/U-ratio1, and (2) using this calibration curve to estimate the equivalent Pb/U-logratio of the reference material corresponding to the UO(2)/U-logratio of the sample (s). Then:
| (3) |
where the subscript a stands for the estimated (for the sample) or known (for the reference material) atomic logratios. The analytical uncertainty of lnas depends on the analytical uncertainties of both the intercept (A) and slope (B) of the reference material. But it does not necessarily do so to the same degree for all aliquots. Analytical spots that have similar UO(2)/U-ratios as the reference material will be less affected by the uncertainty of the reference material fit than spots that have very different UO(2)/U-ratios (Figure 1). Hence it is not possible to make a clean separation between random and systematic uncertainties.
Section 2 pointed out that the U–Pb age equation (Equation 1) does not depend on the absolute amounts of 204Pb, 206Pb and 238U, but only on their relative abundances. Thus we could normalise the 204Pb, 206Pb and 238U measurements of Table 1 to unity and plot them on a ternary diagram. The same is true for other geochronometers such as U–Th–He and 40Ar/39Ar (Vermeesch, 2010, 2015). In mathematics, the ternary sample space is known as a two-dimensional simplex. Data that live within this type of space are called compositional data.
Ternary systems are common in igneous petrology (e.g., the A–F–M diagram) and sedimentary petrography (e.g., the Q–F–L diagram). Geologists have long been aware of the problems associated with averages, confidence regions, and linear regression in these closed dataspaces (Chayes, 1949, 1960). But a general solution to this conundrum was not found until the 1980s, when the Scottish statistician John Aitchison published a landmark paper and book on the subject (Aitchison, 1982, 1986).
In this work, Aitchison proved that all the problems associated with the statistical analysis of compositional data can be solved by mapping those data from the simplex to a Euclidean space by means of an additive logratio (ALR) transformation. For example, given the ternary system {x,y,z}, we can define two new variables {u,v} so that:
| (4) |
In this space, Aitchison showed, one can safely calculate averages and confidence limits. Once the statistical analysis of the transformed data has been completed, the results can then be mapped back to the simplex by means of an inverse logratio transformation:
| (5) |
For example, the 204,6Pb–238U system of Table 1 can be mapped from the ternary diagram to a bivariate ln(204Pb/238U)–ln(206U/238U)-space:
u = ln | -2.46 | -2.82 | -2.30 | -1.10 | -2.32 | -1.47 | -2.09 | -0.82 | -2.01 | -2.87 |
v = ln | -7.23 | -6.59 | -7.17 | -9.48 | -7.60 | -7.79 | -8.05 | -8.94 | -8.38 | -6.25 |
Alternatively, we could also use 206Pb as the denominator isotope:
u = ln | 2.46 | 2.82 | 2.30 | 1.10 | 2.32 | 1.47 | 2.09 | 0.82 | 2.01 | 2.87 |
v = ln | -4.77 | -3.77 | -4.88 | -8.38 | -5.28 | -6.31 | -5.96 | -8.12 | -6.37 | -3.39 |
Calculating the average of the transformed data and mapping the results back to the simplex using the inverse logratio transformation yields the geometric mean of the ratios:
g = 7.58 = = |
which is an altogether more satisfying result than in Section 2. Moving on to the 95% confidence intervals of the 206Pb/204Pb-ratios, we first determine the conventional confidence limits for the logratios:
u | su | t92.5 | +t997.5 | +|
log(206Pb/204Pb) | 5.72 | 1.66 | 4.53 | 6.91 |
After the inverse-logratio transformation, these values produce an asymmetric 95% confidence interval for the geometric mean 206Pb/204Pb-ratio of 305-212+695. This interval contains only strictly positive values, solving the problem of Section 3. The ALR transformation of Equation 4 can easily be generalised to more than three components. For example, if 207Pb is added to the mix, then the four-component 204|6|7Pb–238U-system can be mapped to the three component ln(204|6|7Pb/238U)-space (Figure 2).
The compositional nature of isotopic data embeds a covariant structure into the very DNA of geochronology: in a K-component system, increasing the absolute amount of one of the components automatically lowers the relative amount of the remaining (K-1) components (Chayes, 1960). To deal with this phenomenon, it is customary in compositional data analysis to process data in matrix form, using the full covariance matrix. For example, four components yield three logratios, which require a 3 × 3 covariance matrix to describe the uncertainties and uncertainty correlations.
This approach is now widely used in sedimentary geology, geochemistry and ecology (e.g., Weltje, 2002; Vermeesch, 2006; Pawlowsky-Glahn and Buccianti, 2011), and has recently been adopted for geochronological applications as well (Vermeesch, 2010, 2015; McLean et al., 2016). The logratio covariance matrix approach is also uniquely suited to capture the systematic uncertainties (i.e. the inter-spot error correlations) that are produced by the SIMS U–Pb calibration procedure (Section 4). In that case, the covariance matrix must be expanded to accommodate multiple spots (e.g., Vermeesch, 2015). For example, the covariance structure of three spots in a four component system can be captured in a 9 × 9 matrix.
In conclusion, the logratio transformation solves the statistical woes described in Sections 2, 3 and 4 of this paper. However there are two additional problems that require further remediation.
Many mathematical operations are easier in logarithmic space than in linear space: multiplication becomes addition, division becomes subtraction, and exponentiation becomes multiplication. These mathematical operations are very common in mass spectrometer data processing chains (Vermeesch, 2015). However there are exceptions. For example, background correction does not involve the multiplication but subtraction of two signals. For low intensity ion beams such as 204Pb it is possible, by chance, that the background exceeds the signal. This results in negative values of which one cannot take the logarithm.
The subtraction problem can be solved by using a different logarithmic change of variables:
| (6) |
where x and y are the signals and b is the background. The infinite space of βxy covers all possible values of x, y and b for which x > b and y > b. Thus, background correction should be done in β-space, given an appropriate error model as described in Section 7.
One caveat to this background correction method is that it does not account for isobaric interferences, which may result in ‘overcounted’ signals. The high mass resolution of SIMS instruments removes most but not all isobaric interferences. For example, spurious HfSi, REE dioxide, or long-chain hydrocarbon ions can interfere with 204Pb, which is generally the rarest isotopic species detected. If unaccounted for, these interferences lead to the proportion of non-radiogenic Pb being overestimated (and the proportion of radiogenic Pb underestimated), resulting in excessive common Pb corrections, and underestimated dates. The accuracy of the background measurements can be monitored via the use of isotopically homogeneous reference materials (Black, 2005). A correction can then be applied by choosing the ‘session blank’ that brings the common-Pb corrected 207Pb/206Pb-ratios in alignment with the reference values.
Another issue arises when SIMS signals are registered by Secondary Electron Multiplier (SEM) detectors. These record ion beams as discrete counts, i.e. as integers, which are incompatible with the logratio transformation. For example, it is not uncommon for SEM detectors to register zero counts for low intensity ion beams such as 204Pb. These zero counts blow up the logratio transformation, because ln(0) = -∞.
This and other issues are diagnostic of a fundamental difference between compositional data and counting data that has been previously recognised and solved in fission track dating (Galbraith, 2005) and in sedimentary petrography (Vermeesch, 2018b). The same solutions can be applied to mass spectrometric count data in general, and to U–Pb geochronology in particular (Section 7).
Standard data reduction procedures for geochronology assume normally distributed residuals. In compositional data analysis, these are replaced by logistic normal distributions. However, neither the normal nor the logistic normal distribution are perfectly suited to dealing with discrete count data. The multinomial distribution is a simple alternative that seems more appropriate for the task at hand. Before we proceed, let us define the following variables:
| (7) |
| (8) |
and
| (9) |
Then the probability of observing n4 counts at mass 204, n6 counts at mass 206 and nb counts of background is given by
| (10) |
Whereas the observations n4, n6 and nb are integers, the parameters θ4, θ6 and θb are decimal numbers that are constrained to a constant sum. In other words, they belong to the simplex. Thus, we can map the three multinomial parameters to two logratio parameters, thereby establishing a natural link between counting data and compositional data. For example:
β64 | ≡ ln = ln and | (11) |
β6b | ≡ ln = ln | (12) |
β64 and β6b can be estimated from n4, n6 and nb by maximising the likelihood defined by Equation 10.
The normal and logistic normal distributions are controlled by two sets of parameters: location parameters and shape parameters. In the case of the normal distribution, the location parameter is the mean and the shape parameter is the standard deviation (or covariance matrix). In contrast, the multinomial distribution has only one set of (θ) parameters. The precision of multinomial counts is governed by the number of observed counts (σ[n] = ). More sophisticated models are possible when the observed dispersion of the data exceeds that which is expected from the multinomial counting statistics (e.g., Galbraith and Laslett, 1993; Vermeesch, 2018b). However in this paper we will assume that such overdispersion is absent from the reference materials, and from the single-spot analyses.
It takes a few tens of nanoseconds for a secondary electron multiplier to record the arrival of an ion. During this ‘dead-time’, the detector is unable to register the arrival of additional ions. This phenomenon can significantly bias isotope ratio estimates that include ion beams of contrasting intensity. Fortunately, the dead-time effect can be easily corrected. It suffices that the dwell times are adjusted by the cumulative amount of time that the detectors were incapacitated. Let d′x be the ‘effective dwell time’ of ion beam x:
| (13) |
where δx is the (non extended) dead-time of the detector that measures x. Then the expected normalised beam counts can be re-defined as:
| (14) |
and the normalised beam intensities as
| (15) |
The difference between this approach and existing SIMS data reduction approaches is that conventional data reduction algorithms apply the dead-time correction to the raw data, before calculating isotopic data; whereas the new method applies the dead-time correction to θ′x and ϕ′x, which are unknown parameters that must be estimated from the data.
Thus far we have assumed that all ions are measured synchronously, which is the case in multicollector configurations. However in single collector experiments, the measurements are made asynchronously. This can cause biased results if the signals drift over time. In SHRIMP data processing, it is customary to correct this drift by normalising to a secondary beam monitor (SBM) signal2, followed by double interpolation of numerator and denominator isotopes (Bodorkos et al., 2020; Dodson, 1978). A unified data reduction algorithm for SHRIMP and Cameca instruments requires a different approach, in which the time dependency of the signals is parameterised using a log-linear model. For Faraday detectors:
| (16) |
where nxi is the ion beam intensity of the ith integration for mass x of element X evaluated at time τxi, σ is the standard deviation of the normally distributed data scatter around the best fit line, and ‘bkg’ is the background signal. This is usually a nominal value for Cameca instruments and an actual set of measurements (nbi) for SHRIMP data. Note that the intercept parameters (αx) are ionic mass-specific, whereas the slopes (γX) are element-specific. See the Appendix for an example. For SEM detectors, the scatter of the data around the log-linear fit is controlled by Poisson shot noise:
| (17) |
The background-corrected signal ratio (evaluated at τxi) for two ion beams x and y of different elements X and Y can then be drift corrected as follows:
| (18) |
where ϕxi and ϕyi are the dead time corrected normalised beam intensities for the ith integration of masses x and y, respectively, and ϕbi is the corresponding background value. See the Appendix for further details.
Mass spectrometer signals are recorded in Volt (for Faraday detectors) or Hertz (or secondary electron multipliers). The age equation, however, requires atomic ratios. In general, signal ratios do not equal atomic ratios, because they are affected by two types of fractionation:
In the context of SIMS U–Pb geochronology, mass-dependent fractionation is commonly ignored (but not always, e.g., Stern et al., 2009), because the most important isochemical ratio is that between 206Pb and 207Pb, which lie within 0.5% mass units of each other. This is unresolvable given typical analytical uncertainties. The mass fractionation is greater for 204Pb, but so it its analytical uncertainty. Therefore, the atomic 204Pb/206Pb, 207Pb/206Pb and 208Pb/206Pb ratios can be directly estimated from the (drift corrected) 204Pb/206Pb, 207Pb/206Pb and 208Pb/206Pb signal ratios. This is not the case for the 206Pb/238U and 208Pb/232Th-ratios, which are affected by strong elemental fractionation effects. This fractionation expresses itself in two ways:
Over the course of a SIMS spot analysis, the Pb/U and Pb/Th ratio changes as a function of time. This elemental fractionation can be modelled using a log-linear model that is similar to that used for the within-spot drift correction:
| (19) |
where 0βxy is the inferred logratio of the background-corrected signals at ‘time zero’, which can be found using the method of maximum likelihood (see Appendix). Note that γXY = 0 if X = Y and, if the sensitivity drift of the data is smooth and closely follows the trends defined by Equations 16 and 17, then 0βxy ≈ αy - αx and γXY ≈ γY - γX.
Using Equation 19, the isotopic logratios can be interpolated (or extrapolated) to any point in time (τ):
| (20) |
The most precise values of τβxy are obtained when τ is chosen in the middle of the analytical sequence. These values can be used for subsequent calculations. Alternatively, we can also use the time-zero intercepts 0βxy.
The Pb/U and Pb/Th signal ratios may vary between adjacent spots on the same isotopically homogenous reference material. This fractionation obeys the power law relationship given by Equation 2. Expressing this formula in terms of corrected signal ratios:
| (21) |
where (6∕4)c stands for the 206Pb/204Pb-ratio of the common Pb (see the footnote of Section 4). Recasting Equation 21 in terms of the interpolated logratio estimates:
| (22) |
where ‘o’ stands for the uranium oxide (238U16O2+ or 238U16O+) and ‘u’ stands for 238U+.
Having applied Equation 22 to a reference material with known atomic 206Pb/238U-ratio ar, the atomic 206Pb/238U-ratio of the sample is given by
| (23) |
where τβu6(sm) and τβuo(sm) are the interpolated logratio estimates of the sample. The 206Pb/238U-age is then obtained by plugging as into the age equation. Uncertainties are obtained by standard error propagation (see Appendix).
The following paragraphs will illustrate the SIMS U–Pb data reduction process using two datasets:
Figure 3 shows the time resolved SEM counts of one representative spot measurement for each dataset. Side-by-side comparison of these two datasets reveals some interesting similarities and differences. All of the high intensity signals exhibit clear transient behaviour, which is caused by changes in oxygen availability that occur during primary ion bombardment (Magee et al., 2017). The transience of the individual SEM signals biases the isotopic ratios. For example, there are 109 seconds between the 204Pb and 208Pb measurements in each SHRIMP cycle. During these 109 seconds, the 208Pb signal drops on average by 2%, resulting in an equivalent bias of the 204Pb/208Pb ratio (Section 9). A similar drop per cycle is observed for the Cameca data.
However, there is a key difference between the Cameca and SHRIMP datasets. For the Cameca data, the U and Pb signals both drift in the same direction, resulting in positive slopes for the example of Figure 3. But for the SHRIMP data, the U- and Pb-drifts act in opposite directions: the U-signal exhibits an increase in sensitivity with time, whereas the Pb-signals decrease in intensity with time. This marked difference in behaviour between the two instruments reflects a difference in their design, causing a difference in the energy window of the secondary ions analysed (Ireland and Williams, 2003).
For Faraday detectors, the within-spot drift correction uses a generalised linear model with a lognormal link function (Equation 16). For SEM data, it uses a Poisson link function (Equation 17). In either case, the model enforces strictly positive isotopic abundances. Isotopes of the same element (such as 204|6|7|8Pb) have the same slope parameter, but different intercepts. Isotopes of different elements are free to have different slopes (Figure 4).
Figure 5 applies another log-linear function (Equation 19) to model the within-spot fractionation of Temora2 spot 11. This function models the drift-corrected logratios as a linear function of analysis time. The slopes of the log-linear functions are a function of the within-spot fractionation between the numerator and denominator elements. Because there is no fractionation between two isotopes of the same element, the slope of the Pb/Pb ratios is zero.
The ability of logratio statistics to avoid negative ratios is apparent from the first panel of Figure 5. Even though some of the ratios of the background-corrected signals are zero or negative (because the background exceeded the signal), the generalised linear fit is strictly positive. The natural ability of compositional data analysis to rule out negative ratios avoids many problems further down the data processing chain.
The right hand side of Figure 5 maps the four (log)ratios back to five equivalent raw signals (one for each isotope). The last two panels of the figure show how the logratio approach manages to effectively capture subtle fluctuations of the U and UO signal intensities.
The logratio intercepts obtained by Equation 19 form a linear array of calibration data (Equation 22). Figure 6a fits a straight line through these points using the linear regression algorithm of York et al. (2004). Alternatively, instead of fitting a calibration line through the logratio intercepts (τ = 0, Figure 6a), it is also possible to interpolate or extrapolate the logratio composition to any other point in time. For example, the green ellipses in Figure 6b show the inferred logratio compositions at 544 seconds (i.e., τ = 544), which represents the midpoint of the analyses. The slope of this calibration line is notably different than that obtained by fitting a line through the compositions at 0 seconds. This change in slope reflects the different mechanisms that are responsible for elemental fractionation within and between SIMS spots.
Figure 7 applies the Pb/U-calibration (Equation 23) to 91500 zircon, using the Temora2 data for calibration. It shows only the purely random errors, i.e., ignoring the uncertainty of the calibration. Including the calibration errors does not only inflate the uncertainties, but also causes inter-spot error correlations (Figure 1). To demonstrate this phenomenon, let us revisit the Cameca data and swap the sample and reference materials around. Thus, we use 91500 for the calibration curve, and Temora2 as a sample (Figure 8). Table 2 shows the uncertainty budget of four selected aliquots from this sample.
ln[206Pb/238U] | int. unc. (%) | tot. unc. (%) | r[*,b] | r[*,c] | r[*,d] | |
a | -2.692 | 0.32 | 0.70 | 0.62 | 0.36 | -0.34 |
b | -2.685 | 0.31 | 0.44 | 0.36 | -0.17 | |
c | -2.693 | 0.30 | 0.36 | 0.019 | ||
d | -2.719 | 0.45 | 0.56 |
Propagating the calibration curve uncertainty increases the error estimates (Table 2) by different amounts for different spots. For aliquot c, which is located immediately below the mean of the 91500 data, the calibration uncertainty only mildly increases the standard error from 0.30% to 0.36%. However, for aliquot a, which is horizontally offset from the mean of the calibration data, the systematic calibration uncertainty more than doubles the standard error from 0.32% to 0.70%. The calibration error also causes the standard errors of the various aliquots to be correlated with each other. For example, the total uncertainties of aliquots a and b are positively correlated (r[a,b] = 0.62) because their UO2/U-ratios are both offset from the mean of the calibration in the same direction. In contrast, the uncertainties of aliquots a and d are negatively correlated (r[a,d] = -0.34) because they are offset in opposite directions from the mean of the calibration data.
The inter-spot error correlations are important when calculating weighted means (Vermeesch, 2015) and isochrons. Taking into account the full covariance structure of the data affects both the accuracy and the precision of any derived age information. For example, the conventional weighted mean can be replaced with a matrix expression that accounts for correlated uncertainties (Equation 92 of Vermeesch, 2015). Applying this modified algorithm to the positively correlated aliquots a and b of Table 2 changes the weighted mean from -2.6922 to -2.6854 and increases the standard error of that mean from 0.37% to 0.44%. Applying the same calculation to the negatively correlated aliquots a and d changes their weighted mean from -2.7083 to -2.7076 and reduces its uncertainty from 0.43% to 0.35%. To take full advantage of the covariance matrix will require the development of a new generation of high level data reduction software. For example, future versions of IsoplotR (Vermeesch, 2018a) will be modified to handle these rich data structures. A comprehensive discussion of this topic falls outside the scope of this paper.
simplex is an R package for SIMS data processing that implements the algorithm presented in this paper. The program can be run in three modes: online, offline, and from the command line. The online version can be accessed at https://isoplotr.es.ucl.ac.uk/simplex/. It contains three example U–Pb datasets, including the two datasets used in this paper, plus a Cameca monazite U–Th–Pb dataset.
simplex currently accepts raw data as Cameca .asc and SHRIMP .op and .pd files. Support for SHRIMP .xml files will be added later. The online version is a good place to try the look and feel of the software. However, it is probably not the most practical way to process lots of large data files. For a more responsive user experience, simplex can also be run natively on any operating system (Windows, Mac OS or Linux). To this end, the user needs to install R on their system (see https://r-project.org/ for details). Within R, the simplex package can be installed from the GitHub code-sharing platform using the remotes package, by entering the following commands at the console:
Once installed, simplex’ graphical user interface (GUI) can be started by entering the following command at the console:
A third and final way to use simplex is from the command line. This allows advanced users to create automation scripts and extend the functionality of the package. simplex comes with an extensive API (Application Programming Interface) of fully documented user functions. An overview of all these functions can be obtained by typing the following command at the console:
This paper introduced a new algorithm for SIMS U–Pb geochronology, in which raw mass spectrometer signals are processed using a combination of logratio analysis and Poisson counting statistics. In contrast with existing data reduction protocols, which handle each aliquot of an analytical sequence separately, the new algorithm simultaneously processes all of them in parallel. It thereby produces an internally consistent set of isotopic ratios and their associated covariance matrix. This covariance matrix is a rich source of information that captures both random and systematic uncertainties, including inter-spot error correlations that have hitherto been ignored in geochronology.
The example data of Section 12 showed that these inter-spot error correlation can be either positive or negative (see also Vermeesch, 2015; McLean et al., 2016). Ignoring them affects both the accuracy and precision of high end data processing steps such as isochron regression and concordia age calculation. Unfortunately, existing postprocessing software such as Isoplot (Ludwig, 2003) and IsoplotR (Vermeesch, 2018a) does not yet handle inter-spot error correlations. Further work is needed to extend these codes and take full advantage of the new algorithm. IsoplotR was designed with such future upgrades in mind: its input window contains a large number of spare columns that will accommodate covariance matrices in a future update. Once the aforementioned high end data reduction calculations have been updated, it will be possible to fully quantify the gain in precision and accuracy of the new algorithm compared to the previous generation of SIMS data reduction software.
The data reduction principles laid out in this paper are applicable not only to U–Pb geochronology, but also to other SIMS applications such as stable isotope analysis. In fact, simplex already handles such data for multicollector Cameca instruments. It is worth mentioning that the stable isotope functionality can also be used to correct 207Pb/206Pb ratio measurements for mass-dependent isotope fractionation, as was briefly discussed in Section 10. Future updates of the mass-dependent fractionation correction will also addresses the overcounted background problem that was mentioned in Section 6.
Besides U–Pb geochronology and stable isotopes, the new data reduction paradigm can also be adapted to other chronometers and other mass spectrometer designs, such as Thermal Ionisation Mass Spectrometry (TIMS, Connelly et al., 2021), noble gas mass spectrometry (Vermeesch, 2015), and LAICPMS (McLean et al., 2016). simplex already includes a function to export data to IsoplotR. Adding similar functionality to other data processing software will improve geochronologists’ ability to integrate multiple datasets whilst keeping track of systematic uncertainties, including those associated with reference materials and decay constants.
This Section provides further algorithmic details for the new U–Pb data processing workflow. It assumes that ions are recorded on SEM detectors, which is by far the most common configuration. The case of Faraday collectors is similar and, in fact, simpler.
Within-spot drift is modelled using a log-linear function (Equation 17) with a distinct intercept (αx) for each ion channel (x), and shared slopes (γX) between isotopes of the same element (X). To illustrate this concept, consider the case of 204|6|7Pb (inclusion of 208Pb is a trivial extension). Let xi be the time dependent parameter of the shot noise for 20xPb, where x ∈{4,6,7}:
| (24) |
then the log-likelihood function of the parameters given the data is defined as:
| (25) |
where N represents the number of cycles. The parameters α{x} = {α4,α6,α7} and γPb are estimated by maximising d with respect to them. Only γPb is used in subsequent calculations. The intercepts α{x} are discarded.
The next step of the data reduction extracts logratios from the raw data using a log-linear model that is similar to the within-spot drift correction (Equation 19). Here, in contrast with the drift correction, the intercepts are just as important as the slopes. For isochemical ratios such as 207Pb/206Pb, the slope of the drift-corrected logratios is zero and we only need to estimate the intercept. For multichemical ratios such as 238U/206Pb, both the slope and the intercept are non-zero. In order to keep track of covariances, it is useful to process all the isotopes together, using a common denominator. For example, using 206Pb (‘6’) as a common denominator and 204Pb (‘4’), 207Pb (‘7’), 238U (‘u’) and UO (‘o’) as numerators:
| (26) |
where ‘X’ stands for Pb if x ∈{4,6,7}, for U if x = u, and for UO if x = o. Then the normalised ion counts are given by:
| (27) |
for y ∈{4,7,u,o}, with
| (28) |
in which z ∈{4,6,7,u,o,b}. Then the log-likelihood is calculated as:
| (29) |
where γ64 = γ67 = 0 because there is no elemental fractionation between the Pb-isotopes. From the logratios with common denominator, it is easy to derive any other logratio:
| (30) |
One of the main advantages of the new data reduction method is its ability to keep track of the full covariance structure of the data, including inter-spot error correlations. This ability is derived from the fact that all parameters are derived by the method of maximum likelihood, which stipulates that the approximate covariance matrix of any set of estimated parameters can be obtained by inverting the negative matrix of second derivatives (i.e., the Hessian matrix) of the log-likelihood function with respect to said parameters:
| (31) |
For example, to estimate the covariance matrix of the logratio slopes and intercepts for a single spot analysis, the Hessian is a 6 × 6 matrix that includes the second derivatives of l with respect to β64, β67, β6u, β6o, γ6u and γ6o. Computing this matrix is tedious to do by hand but straightforward to do numerically.
Given the covariance matrix of the logratios, subsequent data reduction steps propagate the analytical uncertainties by conventional first order Taylor approximation. Thus, if y = f(x), then:
| (32) |
where Jf is the Jacobian matrix (and JfT its transpose) of partial derivatives of f with respect to x. For example, to estimate the m × m covariance matrix of m fractionation-corrected 206Pb/238U-ratios, error propagation of Equation 23 would involve an m × (2m + 3) Jacobian matrix and an (2m + 3) × (2m + 3) covariance matrix containing the uncertainties of A, B, lnar, as well as τβu6(j) and τβuo(j) (for j from 1 to m).
The idea for this work was born during an academic visit of the author to the Institute of Geology and Geophysics (IGG) at the Chinese Academy of Sciences (CAS) in Beijing. With no prior experience in handling SIMS data, the author relied on the expertise of Dr. Yang Li and Dr. Li-Guang Wu (IGG-CAS) to introduce him to the world of Cameca data processing, and of Dr. Simon Bodorkos, Dr. Charles Magee and Dr. Andrew Cross (Geoscience Australia) for SHRIMP data processing. The test data were kindly shared by Dr. Li and Dr. Bodorkos, who also provided detailed feedback on the manuscript prior to submission. Software development for the simplex data reduction program was supported by NERC Standard Grant #NE/T001518/1 (‘Beyond Isoplot’), which aims to develop an ‘ecosystem’ of inter-connected geochronological data processing software. The graphical user interface for simplex makes extensive use of the shinylight and dataentrygrid packages developed by software engineer Tim Band, who is employed on this NERC grant.
Aitchison, J.: The Statistical Analysis of Compositional Data, Journal of the Royal Statistical Society, 44, 139–177, 1982.
Aitchison, J.: The statistical analysis of compositional data, London, Chapman and Hall, 1986.
Black, L. P.: The use of multiple reference samples for the monitoring of ion microprobe performance during zircon 207Pb/206Pb age determinations, Geostandards and Geoanalytical Research, 29, 169–182, 2005.
Black, L. P., Kamo, S. L., Allen, C. M., Davis, D. W., Aleinikoff, J. N., Valley, J. W., Mundil, R., Campbell, I. H., Korsch, R. J., Williams, I. S., et al.: Improved 206Pb/238U microprobe geochronology by the monitoring of a trace-element-related matrix effect; SHRIMP, ID–TIMS, ELA–ICP–MS and oxygen isotope documentation for a series of zircon standards, Chemical Geology, 205, 115–140, 2004.
Bodorkos, S., Bowring, J., and Rayner, N.: Squid3: Next-generation Data Processing Software for Sensitive High Resolution Ion Micro Probe (SHRIMP), Geoscience Australia, 2020.
Chayes, F.: On ratio correlation in petrography, The Journal of Geology, 57, 239–254, 1949.
Chayes, F.: On correlation between variables of constant sum, Journal of Geophysical research, 65, 4185–4193, 1960.
Connelly, J., Bollard, J., Costa, M., Vermeesch, P., and Bizzarro, M.: Improved methods for high-precision Pb–Pb dating of extra-terrestrial materials, Journal of Analytical Atomic Spectrometry, 36, 2579–2587, 2021.
Dodson, M.: A linear method for second-degree interpolation in cyclical data collection, Journal of Physics E: Scientific Instruments, 11, 296, 1978.
Galbraith, R. F.: Statistics for fission track analysis, CRC Press, 2005.
Galbraith, R. F. and Laslett, G. M.: Statistical models for mixed fission track ages, Nuclear tracks and radiation measurements, 21, 459–470, 1993.
Hiess, J., Condon, D. J., McLean, N., and Noble, S. R.: 238U/235U systematics in terrestrial uranium-bearing minerals, Science, 335, 1610–1614, 2012.
Horstwood, M. S., Košler, J., Gehrels, G., Jackson, S. E., McLean, N. M., Paton, C., Pearson, N. J., Sircombe, K., Sylvester, P., Vermeesch, P., et al.: Community-Derived Standards for LA-ICP-MS U-(Th-) Pb Geochronology–Uncertainty Propagation, Age Interpretation and Data Reporting, Geostandards and Geoanalytical Research, 40, 311–332, 2016.
Ireland, T. R. and Williams, I. S.: Considerations in zircon geochronology by SIMS, Reviews in mineralogy and geochemistry, 53, 215–241, 2003.
Jeon, H. and Whitehouse, M. J.: A critical evaluation of U–Pb calibration schemes used in SIMS zircon geochronology, Geostandards and Geoanalytical Research, 39, 443–452, 2015.
Ludwig, K. R.: Decay constant errors in U–Pb concordia-intercept ages, Chemical Geology, 166, 315–318, 2000.
Ludwig, K. R.: User’s manual for Isoplot 3.00: a geochronological toolkit for Microsoft Excel, Berkeley Geochronology Center Special Publication, 4, 2003.
Magee, C., Danisik, M., and Mernagh, T.: Extreme isotopologue disequilibrium in molecular SIMS species during SHRIMP geochronology, Geoscientific Instrumentation, Methods and Data Systems, 6, 523–536, 2017.
McLean, N. M., Bowring, J. F., and Gehrels, G.: Algorithms and software for U-Pb geochronology by LA-ICPMS, Geochemistry, Geophysics, Geosystems, 17, 2480–2496, 2016.
Min, K., Mundil, R., Renne, P. R., and Ludwig, K. R.: A test for systematic errors in 40Ar/39Ar geochronology through comparison with U/Pb analysis of a 1.1-Ga rhyolite, Geochimica et Cosmochimica Acta, 64, 73–98, 2000.
Nasdala, L., Corfu, F., Valley, J. W., Spicuzza, M. J., Wu, F.-Y., Li, Q.-L., Yang, Y.-H., Fisher, C., Münker, C., Kennedy, A. K., et al.: Zircon M127–A homogeneous reference material for SIMS U–Pb geochronology combined with hafnium, oxygen and, potentially, lithium isotope analysis, Geostandards and Geoanalytical Research, 40, 457–475, 2016.
Ogliore, R., Huss, G., and Nagashima, K.: Ratio estimation in SIMS analysis, Nuclear instruments and methods in physics research section B: beam interactions with materials and atoms, 269, 1910–1918, 2011.
Pawlowsky-Glahn, V. and Buccianti, A.: Compositional data analysis: Theory and applications, John Wiley & Sons, 2011.
Renne, P. R., Swisher, C. C., Deino, A. L., Karner, D. B., Owens, T. L., and DePaolo, D. J.: Intercalibration of standards, absolute ages and uncertainties in 40Ar/39Ar dating, Chemical Geology, 145, 117–152, 1998.
Stern, R. A., Bodorkos, S., Kamo, S. L., Hickman, A. H., and Corfu, F.: Measurement of SIMS instrumental mass fractionation of Pb isotopes during zircon dating, Geostandards and Geoanalytical Research, 33, 145–168, 2009.
Vermeesch, P.: Tectonic discrimination diagrams revisited, Geochemistry, Geophysics, Geosystems, 7, https://doi.org/10.1029/2005GC001092, 2006.
Vermeesch, P.: HelioPlot, and the treatment of overdispersed (U-Th-Sm)/He data, Chemical Geology, 271, 108 – 111, https://doi.org/10.1016/j.chemgeo.2010.01.002, 2010.
Vermeesch, P.: Revised error propagation of 40Ar/39Ar data, including covariances, Geochimica et Cosmochimica Acta, 171, 325–337, 2015.
Vermeesch, P.: IsoplotR: a free and open toolbox for geochronology, Geoscience Frontiers, 9, 1479–1493, 2018a.
Vermeesch, P.: Statistical models for point-counting data, Earth and Planetary Science Letters, 501, 1–7, 2018b.
Weltje, G.: Quantitative analysis of detrital modes: statistically rigorous confidence regions in ternary diagrams and their use in sedimentary petrology, Earth-Science Reviews, 57, 211 – 253, https://doi.org/10.1016/S0012-8252(01)00076-9, 2002.
Wiedenbeck, M., Alle, P., Corfu, F., Griffin, W., Meier, M., Oberli, F. v., Quadt, A. v., Roddick, J., and Spiegel, W.: Three natural zircon standards for U-Th-Pb, Lu-Hf, trace element and REE analyses, Geostandards newsletter, 19, 1–23, 1995.
Williams, I. S.: U-Th-Pb geochronology by ion microprobe, Reviews in Economic Geology, 7, 1–35, 1998.
York, D., Evensen, N. M., Martínez, M. L., and De Basabe Delgado, J.: Unified equations for the slope, intercept, and standard errors of the best straight line, American Journal of Physics, 72, 367–375, 2004.