This article is part of the monograph on Trichloroethylene Toxicity.
Address correspondence to F.Y. Bois, INERIS, Parc ALATA, BP 2, 60550 Verneuil-en-Halatte, France. Telephone: 33 3 44 55 65 96. Fax: 33 3 44 55 68 99. E-mail: frederic.bois@ineris.fr
This work has been supported by the U.S. Environmental Protection Agency.
Received 20 October 1999; accepted 10 February 2000.
New physiologically based pharmacokinetic (PBPK) models of trichloroethylene (TCE) in mice and humans have recently been published (
1,2), together with new toxicokinetic data sets. These models, prior to their use in research or regulation, should be carefully calibrated by confrontation with the data. It is most efficient in terms of information use and from a statistical point of view to combine two forms of information: prior knowledge and data. Neither source of information is complete. If prior knowledge were sufficient, experiments would not need to be performed, but data alone are insufficient to pin down all parameters of a complex PBPK model to reasonable values. Prior knowledge can be summarized as prior parameter distributions. These can be obtained from the scientific literature, from specific
in vitro experiments, and, as is the case here, from fitting previous data. The data [in this case the new data provided to us by Fisher (
2)] are introduced through the use of a likelihood function. A Bayesian statistical framework gives rigorous rules to combine the prior distributions with the data likelihood, to yield posterior parameter distributions. In the case of complex models, like those described here, the posterior parameter distributions can be obtained by numerical Markov chain Monte Carlo (MCMC) techniques (
3,4). This statistical methodology is gaining interest and is establishing itself for the calibration and validation of PBPK models (
3,5-8). Beyond improving the fit, this method also provides distributions of prediction estimates directly usable as inputs for uncertainty analysis of cancer dose-response relationships.
An interesting aspect of the human data analyzed here is that they have been collected individually on a number of male and female volunteers. A hierarchical population model (4,5) deconvolves the various levels of variability present in the data. Such a model, of which the PBPK model is just a component, is easily calibrated with Bayesian numerical methods. This offers a unique opportunity to examine separately the important issues of variability and uncertainty in human toxicokinetics of trichloroethylene.
Data
Mice. Groups of male B6C3F1 mice (body weight [bw] 25-30 g) were exposed to TCE or its metabolites by intravenous (i.v.) injection (9) or by oral gavage (1).
All i.v. doses were equal to 100 mg of the compound administered per kilogram of body weight. After chloral hydrate (CH) administration, venous concentrations of CH, free trichloroethanol (TCOH), trichloroacetic acid (TCA), and dichloroacetic acid (DCA), as well as the amount of urine excreted metabolites, were recorded at various times after dosing. Similarly, after TCA i.v. dosing, venous concentrations of TCA and DCA, as well as the amount of DCA excreted in urine, were recorded. Following TCOH dosing, venous concentrations of CH, TCOH, TCA, and the urine-excreted amounts of TCOH and glucuronidated trichloroethanol (TCOG), were measured. Finally, after DCA dosing, venous concentrations and urine-excreted amounts of DCA were measured.
When trichloroethylene was administered by gavage in corn oil, doses of 300, 600, 1,200, and 2,000 mg/kg bw were used. TCE concentrations were measured in venous blood, liver, fat, lung, and kidney. CH and TCOH concentrations were recorded in venous blood, liver, lung, and kidney. TCA, DCA, and TCOG concentrations were obtained in venous blood, liver, and kidney. The quantities of TCA and TCOG excreted in urine were also recorded.
Additional data, published by Templin et al. (10), were partly used. Groups of four male B6C3F1 mice (bw 27 g) were gavaged with 3.8 mmol/kg (500 mg/kg) TCE. The venous blood concentrations of TCE, TCA, and DCA were measured at various times.
Humans. A group of 21 volunteers (10 females and 11 males) were exposed by inhalation to 50, 60, or 100 ppm TCE for 4 hr (2). Body weight and adiposity were recorded for each subject. The venous blood concentrations of TCE, TCOH, and TCA were measured during and after exposure. For some subjects, the exhaled air concentration of TCE was also recorded at various times after exposure. Finally, the cumulated quantities of TCA and TCOH glucuronide excreted in urine were recorded. The data for each individual were available to us.
Toxicokinetic and Statistical Models
Mice. The description of the physiological model used for mice can be found in Abbas and Fisher (1). The model equations were transcribed to a format suitable for MCSim (11). Three modifications were made to the model, after consultation with the original authors. a) Metabolism of TCE in the lung was turned off. b) For several parameters, the scaling power of body weight was changed from 0.74 to 0.75 for standardization with the model of Clewell et al. (12). c) The volumes of the "body" and poorly perfused compartments, and the blood flow to the body compartment, were computed by difference at each iteration so that the sum of the organ volumes equaled 82% of the body weight and the sum of organ flows equaled cardiac output. Given this reparameterization, the model had a total of 57 independent parameters.
The statistical model used for mice lumps uncertainty and variability, since the animal data were aggregated (only the mean and standard deviation [SD] for several animals were provided). For a schematic of the model, see for example, Bois (13). Various concentrations or quantities (y) were measured. The expected values of these measurements are a function (f) of exposure level (E), time (t), a set of physiological parameters of unknown values (
), and a set of measured, covariate parameters (
), such as body weight. E, t,
, and
are experiment specific. All animals in an experiment were supposed to have behaved similarly, from a toxicokinetic point of view. The function is the pharmacokinetic model described above. The concentrations or quantities actually observed are also affected by measurement error and interindividual variability. Only aggregated data were available, and the corresponding errors were assumed to be independent and log-normally distributed, with mean zero and variance
2 (on the log scale). The variance vector
2 had 27 components for mice, most of them known by repeating the experiments (Table 1). The unknown experimental variances were sampled with a standard non-informative prior distribution P(
i2) ~
i-2 (14). It was assumed that each
parameter was a priori distributed log-normally (except for PCCH, the fraction TCE converted to CH, distributed normally), with averages M and variances S (in log scale).
A major advantage of physiological modeling is to provide a priori information on several of the mean parameter values for mice. Values for the hyperparameters M were set on the basis of the parameter values used by Fisher (2), or when applicable, on the basis of the posterior parameter distributions obtained from the analysis of the Clewell et al. model (12,13). To set S, a distinction was made between the physiological parameters or partition coefficients (which are quite well known) and the other metabolic or pharmacokinetic parameters (which are model specific and little known a priori). For the first group of parameters uncertainties of the order of 20-50% were assigned (3,5,6). For the second group of parameters, a "vague" distribution was assumed and S was set to correspond to a factor between 2 ("quite uncertain") or 5 ("very uncertain"). All priors on µ were truncated to ±2
S or ±3
S to avoid reaching unrealistic values. Table 2 gives for mice the values used for exp(M) (i.e., the geometric means) and exp(S) (i.e., the geometric standard deviation [GSD]), both of which lie on the natural scale.
Humans. The PBPK model for humans is a simplification of the mouse model. Basically, in addition to TCE, only the distribution of TCOH and TCA in blood, liver, kidney, and rest of the body is described by physiological compartments. CH is not modeled explicitly: a fraction of TCE metabolized forms TCOH while the rest forms TCA. Formation of DCA is not modeled. A fraction of TCOH is glucuronidated to give TCOG, which is supposed to be immediately excreted in urine. Four modifications to the original model were introduced. For several parameters, the scaling power of body weight was changed from 0.74 to 0.75 for standardization with the model of Clewell et al. (12). The volumes of the body and poorly perfused compartments and the blood flow to the body compartment were computed by differences at each iteration so that the sum of the organ volumes equaled 82% of the body weight and the sum of organ flows equaled cardiac output; the quantity of TCOG excreted is in TCOH equivalents, rather than in TCOG equivalents. The Michaelis-Menten coefficient Km (labeled KMTCOH) for the formation of TCOG was reexpressed as the ratio Vmax/Km for easier Monte Carlo sampling (preliminary runs showed that Vmax and Km for that reaction are highly correlated). The model had 33 independent parameters.
The statistical model describing uncertainties and variabilities in human data was constructed using a hierarchical population approach (3,4). The model has two major components: the individual level and the population level. At the individual level, various concentrations or quantities (y) were measured. The expected values of these measurements are a function () of exposure level (E), time (t), a set of physiological parameters of unknown values (
), and a set of measured, covariate parameters (
), such as body weight. E, t,
, and
are experiment specific. The function is the human pharmacokinetic model. The concentrations or quantities actually observed are also affected by measurement errors. These errors were assumed to be independent and log-normally distributed, with mean zero and variance
2 (on the log scale). The variance vector
2 had six components for humans.
At the population level, each component of the
parameter set was assumed to be distributed log-normally, with averages µ and variances
2 (in log scale). A priori knowledge of µ and
2 is available under the form of standard values for some parameters. Since uncertainty in these average values and variances has to be acknowledged, a priori log-normal distributions were assumed for the population means µ (with hyperparameters M and S) and a standard inverse gamma distribution (15) for the population variances
2.
Values for the hyperparameters M were set on the basis of the parameter values used by Fisher (2), or when applicable, on the basis of the posterior parameter distributions obtained from the analysis of the Clewell et al. model (12,13). To set S, a distinction was made between the physiological parameters or partition coefficients (which are quite well known) and the other metabolic or pharmacokinetic parameters (which are model specific and little known a priori). For first group of parameters uncertainties of the order of 20-50% were assigned (3,5,6). For the second group of parameters, a vague distribution was assumed and S was set to correspond to a factor of 5. All priors on µ were truncated to ±2
S.
The ß coefficient of the inverse-gamma prior on
2 was set to the posterior values found in the previous analysis (13) when applicable and to 0.22 otherwise (this corresponds to a CV of 50%). The
coefficients of the inverse-gamma priors were set to 1 or 6 [the latter when a posterior distribution had already been obtained from analysis of the Clewell et al. data (13), which had several groups of subjects]. Table 3 gives for humans the values used for exp(M)--i.e., the geometric means--exp(S) --the GSD--and the
and ß coefficients. For
2, a standard noninformative prior distribution P(
12,...,
n2) ~
1-2
...
n-2 was used.
Statistical Computation of Posterior Parameter Distributions
From Bayes' theorem, the joint posterior distribution of the parameters to estimate, P(
,
2, µ,
2|y,
, E, t, M, S,
0), in the case of humans, is proportional to the likelihood of the data multiplied by the parameters' priors:
P(
,
2, µ,
2|y,
, E, t, M, S,
20) ~
P(y|
,
2,
,E,t)
P(
|µ,
2)
P(
2)
P(µ|M,S)
P(
2|
20) [1]
A similar, but simpler, expression can be obtained in the case of the mouse model. For mice and humans, the likelihood term is given by the normal measurement model:
log(y) ~ N(log (
,
,E,t,)
2) [2]
As mentioned above, the prior distribution for
2 is: P(
12,...,
n2) ~
1-2
...
n-2, with eventually some fixed components.
For humans, the prior distribution of each component of
is an independent normal distribution in log space:
log(
)~N(µ,
2) [3]
with truncation constraints. Each component of µ, or
2 is assigned an independent hyperprior distribution, µ ~ N(M,S2) and
2 ~ inverse-gamma(
,ß), as described in the previous section. For mice, the µ and
2 levels are not defined and prior distribution of
depends directly on M and S.
Current practice in Bayesian statistics is to summarize a complicated high-dimensional posterior distribution by random draws of the vector of parameters. This is currently the most effective way to perform high-dimensional numerical integration. Because there are many parameters to estimate, Metropolis-Hasting sampling was used to perform a random walk through the posterior distribution. This iterative sampling procedure is particularly convenient in the case of hierarchical models. It belongs to a class of MCMC techniques that has recently received much interest (3,16-21). Three independent MCMC runs were performed for each species. Convergence was monitored using the method of Gelman and Rubin (22).
Mice
For mice, 20,000 iterations were necessary to reach convergence of the sampler. One of every 5 of the last 5,000 simulations of three independent Markov chains were recorded, yielding 3,000 sets of parameter values from which the inferences and predictions presented in the following were made.
Quality of data adjustment. Figure 1 presents the data values observed compared with their predicted counterparts (data values are concentrations or excreted quantities). Predictions were made with the parameter set of highest posterior density. For a perfect fit, all points would fall on the diagonal (equality of predicted and observed values). Such an adjustment is not expected given the analytical measurement errors in the data. The graph is presented on log-log scale, since the errors are assumed to be log-normally distributed and the data span a wide range. Most of the residuals are contained within a factor of 3 along the diagonal, but quite a few reach a factor of 10 or even a factor of 100. Figures 2-7 show all the gavage data and corresponding predictions in the liver as a function of time. Basically the model reproduces correctly the profiles for TCE and TCA. TCOH and TCOG concentrations are less well predicted. CH and DCA concentrations and TCOG excreted are rather poorly fitted. The fact that each data point represents a different group of animals may explain the noise present in the data. However, some differences between model and data appear to be systematic (Figure 3, for example).
 |
Figure 1. Observed versus predicted mice data values (all concentrations or quantities) for the Monte Carlo iteration of highest posterior probability.
|
 |
Figure 2. Predicted (solid line) and observed (points) time course of TCE concentration in liver of mice dosed with various quantities of TCE in corn oil. Error bars correspond to ±2 SD.
|
 |
Figure 3. Predicted (solid line) and observed (points) time course of CH concentration in liver of mice dosed with various quantities of TCE in corn oil. Error bars correspond to ±2 SD.
|
 |
Figure 4. Predicted (solid line) and observed (points) time course of TCOH concentration in liver of mice dosed with various quantities of TCE in corn oil. Error bars correspond to ±2 SD.
|
 |
Figure 5. Predicted (solid line) and observed (points) time course of TCA concentration in liver of mice dosed with various quantities of TCE in corn oil. Error bars correspond to ±2 SD.
|
 |
Figure 6. Predicted (solid line) and observed (points) time course of DCA concentration in liver of mice dosed with various quantities of TCE in corn oil. Error bars correspond to ±2 SD.
|
 |
Figure 7. Predicted (solid line) and observed (points) time course of TCOG concentration in liver of mice dosed with various quantities of TCE in corn oil. Error bars correspond to ±2 SD.
|
Posterior parameter distributions. The joint distribution of all parameters is obtained in output of the MCMC simulations. This allows consideration of marginal distributions (distributions of the parameters considered individually) and also of correlations of any order. Table 1 summarizes the posterior distributions of the mouse parameter values obtained in the last 1,000 iterations of the three runs performed (results of the three runs are pooled, and the distributions are established with 3,000 values). The geometric means can be interpreted as representing the values for an "average" mouse. Note that the columns of geometric standard deviations represent the sum of group variability among mice and experimental error. Overall, the parameters retain physiologically plausible values.
Yet the posterior means for several parameters are quite far from the prior means. This is observed for metabolic parameters or for some partition coefficients. In particular, the value of KM for TCE is 20 times that found with the previous model and experiments. SDs about these geometric means are quite low and hover around 1.1 (corresponding to a 10% CV, which includes uncertainty and variability). The parameter values are therefore quite well identified by the data.
Very high correlations are observed between parameter estimates for mice (up to 0.95 for KFTCC and KFDCC). Any parameterization neglecting to estimate these covariances will produce incorrect predictions, since these parameters cannot be sampled independently without producing highly improbable combinations and hence highly improbable predictions.
Humans
More than 100,000 iterations were necessary to reach convergence of the sampler in the case of human data. One of every 10 of the last 50,000 simulations of three independent Markov chains were recorded, yielding 15,000 sets of parameter values from which the inferences and predictions presented in the following were made.
Quality of data adjustment. Figure 8 gives the observed data values versus their predicted counterparts (data values are concentrations or excreted quantities). Predictions were made with the parameter set of highest posterior density. The graph is presented on log-log scale, since the errors are assumed to be log-normally distributed and the data span a wide range. Most of the residuals are contained within a factor of 2 along the diagonal. The fit here is substantially better than for mice. Figures 9-11 show all the data and corresponding predictions as a function of time. The model was adjusted to each individual's data, using a population toxicokinetic approach. The data are reasonably well fitted, overall, and particularly the TCA data. However, the concentration of TCE in alveolar air and venous blood is not adjusted satisfactorily, given the quality of adjustment expected from a PBPK model: The curves do not "bend" enough (Figure 9). Another area of significant misfit is the prediction of TCOH concentration in venous blood. The predicted terminal half-life is clearly too short (data not shown). Yet, a good adjustment of TCOH can be obtained for the volunteers observed up to 22 hr (Figure 10). Most of the variability observed on the figures is due to differences in inhalation exposure levels (concentrations of 50, 60, or 100 ppm TCE were used). However, a large part of the variability in TCOG elimination to urine seems due to factors other than exposure levels (data not shown). Somewhat troublesome is the disagreement between measured (albeit, approximately) fractions of body weight as fat (VFC) and the corresponding toxicokinetic estimates. The estimates take into account the measured values as well as what can be inferred about the size of the fat compartment from the TCE and other concentration data. The measured values of VFC reach much more extreme values than the estimates (Figure 12).
 |
Figure 8. Observed versus predicted human data values (all concentrations or quantities) for the Monte Carlo iteration of highest posterior probability.
|
 |
Figure 9. Predicted (solid lines) and observed (points) time course of TCE concentration in venous blood of human volunteers exposed by inhalation to various concentrations of TCE. The error bars presented on a data point correspond to ±2 estimated measurement SD (the size of the error is the same for all points).
|
 |
Figure 10. Predicted (solid lines) and observed (points) time course of TCOH concentration in venous blood of human volunteers exposed by inhalation to various concentrations of TCE. These volunteers were followed for up to 22 hr. The error bars presented on a data point correspond to ±2 estimated measurement SD (the size of the error is the same for all points).
|
 |
Figure 11. Predicted (solid lines) and observed (points) time course of TCA concentration in venous blood of human volunteers exposed by inhalation to various concentrations of TCE. The error bars presented on a data point correspond to ±2 estimated measurement SD (the size of the error is the same for all points).
|
 |
Figure 12. Observed versus predicted fraction of body weight as fat in human volunteers exposed by inhalation to various concentrations of TCE. The error bars span the 95% confidence intervals of the predictions.
|
Posterior parameter distributions. Table 3 summarizes the posterior distributions of the population means and SDs for all human model parameters (the distributions were established with 15,000 values). The population means represent the values for an average person. They are affected by uncertainty and were each assigned a geometric mean and a GSD (note that an approximate CV can quickly be computed, since CV ~ GSD - 1). The population SDs measure between-subject variability. They are also affected by uncertainty (i.e., the population SD cannot be exactly computed from a finite sample).
The posterior means for several parameters are quite far from their prior estimates. The location of the blood over air partition coefficient, PB, is shifted from 13.7 (±20%) to 18 (±10%). The location of other partition coefficients (slowly perfused muscle over blood and richly perfused liver over blood) is also changed. The largest shifts are observed for metabolic parameters. The estimate of the scaling coefficient of TCE Vmax in liver (VMAXC) is 4.22 (±20%) instead of 43.8. A large uncertainty existed on that prior estimate, but 4.22 is still outside its prior 95% confidence interval (lower bound at 12.6). Therefore, there appears to be a conflict between the data studied here and the data studied previously. Note also that the posterior mean of the fraction TCE converted to TCOH (POH) is quite lower than a priori estimated. The posterior means of the scaling coefficient of Vmax for TCOH metabolism (VMTCOC) and of the scaling coefficient of the rate constant from TCOH to TCA (KOCHC) are much increased but not incompatible with the vague priors used. In terms of uncertainty, SDs about the posterior means are quite low, approximately 1.1 or 1.2 (corresponding to a 10-20% CV). The parameter values are overall quite well identified by the data.
Estimates of population variability are given by the posteriors of the population SDs,
. Volumes and flows appear to vary across subject by approximately 20% (CV). The variability is somewhat higher for partition coefficients (between 30 and 40% CVs). It is much higher for the metabolic parameters (SDs representing approximately a factor of 2 difference). For example, the lowest rate constant scaling coefficient for TCA metabolism (KTMETC) is 0.12
÷ 1.6 hr-1 (subject 110F) and the highest is 1.1
÷ 1.25 hr-1 (subject 101F, the subject with lowest TCA levels on Figure 11). For the scaling coefficient of the rate constant of TCA loss to urine (KRTCC), the values range from 0.13
÷ 1.2 hr-1 (subject 106M) to 3.4
÷ 1.2 hr-1 (subject 111F) (Figure 13). Subject 111F excreted the highest amount of TCA in urine (data not shown). Note that the estimates of variability given are themselves affected by uncertainty (up to 50% CV for the variability estimates of the metabolic parameters). For example, the 95% posterior confidence interval of S for KRTCC is [1.77, 2.85].
 |
Figure 13. Individual estimates of the rate constant scaling coefficient for TCA metabolism (KTMETC) and of the rate constant of TCA loss to urine (KRTCC). The error bars span the 95% confidence intervals of the estimates. Large individual differences are observed, with some degree of correlation.
|
Examination of the subjects' parameter values points to peculiar pharmacokinetic behaviors for some of them:
- Subject 106M has the highest fraction fat (VFC), particularly for a male (some female subjects approach his VF, but no males of the group studied approach it). He is also an "outlier" with a TCE fat over blood partition coefficient 3 times that of the others. His maximal rate scaling coefficient for TCE metabolism (VMAXC) and his rate constant of TCA loss to urine (KRTCC) are the lowest. Note that this subject shows the lowest amount of TCA excreted (data not shown), and his TCOH blood concentrations are not well modeled.
- Subject 201M is also outstanding for his highest alveolar ventilation rate scaling coefficient (QPC), highest fractional blood flow to the kidney (QKC), and high TCOH body over blood partition coefficient (POHB), particularly for a male (females tend to have higher POHBs, see below). This subject has the second lowest TCA excretion in urine and one of the lowest TCOG elimination. It seems that this subject can be characterized as "fast" in intake and elimination, at least from a physiological (if not enzymatic) point of view.
- Subject 103M has a Michaelis-Menten constant (KM) about 3 times as high as everyone else. He has one of the highest excretions of TCA in urine.
Splitting the subjects by sex reveals some differences between males and females. The visual examination of the marginal distributions of a given parameter for each subject reveals little pattern. For example, in Figure 14, it appears that female subjects exhibit higher values of KOCHC, the rate constant scaling coefficient for TCOH to TCA. However, a large part of the variability observed is due to uncertainty about the population mean. When the effect of the overall mean is removed, the difference is highly significant: P(KOCHCfemale > KOCHCmale) = 0.9998 (i.e., the female average was superior to the male average 99.98% of the times in the sample of 15,000 parameter vectors obtained by MCMC sampling).
 |
Figure 14. Individual estimates of the rate constant scaling coefficient for TCOH to TCA, grouped by sex. The error bars span the 95% confidence intervals of the estimates.
|
Other significant sex differences were found:
- The scaling coefficient of the alveolar ventilation rate is higher for males P(QPCmale > QPCfemale) = 0.9997. This implies that the data are sufficiently informative to show such a difference and also that the surface area correction is not sufficient to explain sex differences in alveolar ventilation. The level of physical training might be another covariate to consider for alveolar ventilation in human studies.
- The posterior estimates of the VFC are higher on average for females than for males: P(VFCfemale > VFCmale) > 0.9999. That was expected and is influenced by the covariate measurements of adiposity and by the toxicokinetic data.
- Female subjects also seem to have higher TCOH body over blood partition coefficients: P(POHBfemale > POHBmale) > 0.9999; they also have a higher Vmax over Km ratio (i.e., rate constant at low concentration) for TCOH glucuronidation: P(KMTCOHfemale > KMTCOHmale) > 0.9999, and a higher rate constant scaling coefficient for TCA urinary excretion: P(KRTCCfemale > KRTCCmale) > 0.9943.
- Males were found to have higher TCA body over blood partition coefficients: P(PTCBmale > PTCBfemale) = 0.992.
All other male versus female parameter averages do differ slightly, but the P values are less than 0.95, most of them being around 0.5.
The mice data show a rather large inter-individual or interlot variability. There is little to do about that, unless the experimental design were to allow the observation of individual animals. The data still permit an extensive calibration of the model. The human data have the advantage of being available for each individual. There appear to be, a posteriori, a few outlying points (for example, Figure 10), which could be checked and eventually removed. These few apparent outliers have little weight and should not affect sensibly the results presented here. There could be a problem with the measurement of fat content; this will be discussed below, in light of the model fit.
The mouse model is quite complicated, and yet some aspects of the data are not so well described by it (CH and DCA concentrations, and TCOG excreted). This is somewhat disappointing, given the complexity of the model, and one wonders what will have to be done to fit such an extensive data set. It is possible that some metabolic reactions do not obey simple Michaelis-Menten reactions or that prior opinions about the model parameters were for some of them overly confident. Note also that even in large models, model uncertainty can be large. The model, for example, does not include hepatic recycling or all possibilities of extrahepatic metabolism. Yet, overall, the model fits well a large part of the data, in particular TCE and TCA distribution. It would be interesting to see the performance of other models [e.g., those presented in (12,23)] with this data set, which would allow a formal comparison of the competing models on the basis of a common measure of goodness of fit, such as likelihood ratios.
The posterior parameter distributions obtained for mice are quite narrow (with CVs of about 10 or 20%), indicating that the data are strongly informative for most parts of the model. Indeed, as indicated above, some parameters or processes might need to be added for a better fit and the model is somewhat minimal with respect to the data. However, it should be kept in mind that the fit is not excellent, and that may overconstrain the posterior distributions. Part of the high covariance between parameters may also be due to overconstraining. It is possible to model statistically the lack of fit by including an autocorrelation between data points (24). This has not been attempted here and could be a useful improvement. Note that the posterior uncertainty for the metabolic parameters would have been further underestimated if all physiological parameters had been set to predefined values.
The human model, even though complex, also has difficulties in fitting all the data. This is true in particular for TCOH concentrations over a long period of time, and some improvement of the model in that respect may be needed. Similarly, the adiposity of the subjects (a measured covariate) does not fit well with the estimated fraction of body weight as fat. It is possible that the pharmacokinetic compartment "fat" is not well estimated by external adiposity measurements, in particular for extreme values. It also appears that the model may not be able to describe correctly outlying subjects like subject 106M. This could be due to the PBPK model, which lacks some component important for such a subject. The misfit for that subject could also be due to a lack of flexibility of the statistical model adopted here (log-normal distributions of the parameters in the population). A possibility for checking would be to fit the data of only that subject to determine if a good fit could be obtained.
The human posterior parameter distributions agree in general quite well with the corresponding priors, with reduced uncertainty (since information from the data has been gained). SDs about the posterior means are quite low and correspond to a 10-20% CV: The parameter values are overall quite well identified by the data. There is, however, a conflict for the values of VMAXC between the values previously found (3) and the ones obtained here. This is an important parameter and a good characterization is important. The difference could be due to conflicts between the data analyzed here and the previous data. For example, extensive TCOH data are available here. It is also possible that the conflict is due to differences in human model structures. A solution to this dilemma would be to take the model of Clewell et al. (12) and fit it to the data of Fisher (2) to obtain an estimate of VMAXC with the same data set. In any case, it is not obvious that the differences in parameter values would result in notable differences when predicting toxicologically relevant end points, such as internal TCA concentrations. This remains to be checked.
Volumes and flows appear to vary across subject by 20% (CV), approximately. The variability is somewhat higher for partition coefficients (between 30 and 40% CVs) and much higher for the metabolic parameters (SDs representing about a factor of 2 difference). This is similar to what was found for a small group of human volunteers exposed to tetrachloroethylene (5). Differences appear to exist between sexes in the toxicokinetics of TCE. There are differences between males and females in alveolar ventilation and adiposity for the population sample studied. This should be true for compounds other than TCE and shows that the model scaling could be improved. More important for TCE kinetics are the findings that females have higher TCOH body over blood partition coefficients, lower TCA body over blood partition coefficients, higher Vmax over Km ratios for TCOH glucuronidation, higher rate constant scaling coefficients for TCOH to TCA, and higher rate constant scaling coefficients for TCA urinary excretion. Note that the statistically highly significant differences found for these parameters should be interpreted with some caution. They are conditional on the model structure being correct. At least, at this point, it can be said that there are most certainly differences in the kinetic behavior of TCE between the males and females of the sample studied. It would have been hard to reach that conclusion without the statistical adjustment of a model, given the multiple exposure levels, differences in body weight, nonlinear kinetics, etc. Still, these differences may not be significant in terms of TCE toxicity (i.e., biologically significant). It would be interesting to assess by simulations whether internal metabolite concentrations are much different for males and females for the same TCE exposure.
REFERENCES AND NOTES
1. Abbas R, Fisher JW. A physiologically based pharmacokinetic model for trichloroethylene and its metabolites, chloral hydrate, trichloroacetate, dichloroacetate, trichloroethanol, and trichloroethanol glucuronide in B6C3F1 mice. Toxicol Appl Pharmacol 147:15-30 (1997).
2. Fisher JW. Personal communication, 1997.
3. Gelman A, Bois F, Jiang J. Physiological pharmacokinetic analysis using population modeling and informative prior distributions. J Am Stat Assoc 91:1400-1412(1996).
4. Wakefield JC. The Bayesian analysis of population pharmacokinetic models. J Am Stat Assoc 91:62-75(1995).
5. Bois FY, Gelman A, Jiang J, Maszle D, Zeise L, Alexeef G. Population toxicokinetics of tetrachloroethylene. Arch Toxicol 70:347-355 (1996).
6. Bois F, Jackson E, Pekari K, Smith M. Population toxicokinetics of benzene. Environ Health Perspect 104(suppl 6):1405-1411 (1996).
7. Fanning E, Bois FY, Rothman N, Bechtold B, Li G, Hayes R, Smith M. Population toxicokinetics of benzene and its metabolites. In: Society of Toxicology Annual Meeting, Cincinnati, Ohio, 1997.
8. Johanson G, Jonsson F, Bois F. Development of new technique for risk assessment using physiologically based toxicokinetic models. Am J Ind Med (suppl )1:101-03 (1999).
9. Abbas R, Seckel CS, Kidney JK, Fisher JW. Pharmacokinetic analysis of chloral hydrate and its metabolism in B6C3F1 mice. Drug Metab Dispos 24:1340-1346 (1996).
10. Templin MV, Parker JC, Bull RJ. Relative formation of dichloroacetate and trichloroacetate from trichloroethylene in male B6C3F1 mice. Toxicol Appl Pharmacol 123:1-8 (1993).
11. Maszle D, Bois FY. Program MCSim - User Manual. Available: ftp://rfs63.berkeley.edu/pub/mcsim, 1993.
12. Clewell HJ III, Gentry PR, Covington TR, Gearhart JM. Development of a physiologically based pharmacokinetic model of trichloroethylene and its metabolites for use in risk assessment. Environ Health Perspect 108(suppl 2):283-305 (2000).
13. Bois FY. Statistical analysis of Clewell et al. PBPK model of trichloroethylene kinetics. Environ Health Perspect 108(suppl 2):307-316 (2000).
14. Bernardo JM, Smith AFM. Bayesian Theory. New York:Wiley, 1994.
15. Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis. London:Chapman & Hall, 1995.
16. Gelfand AE, Smith AFM. Sampling-based approaches to calculating marginal densities. J Am Stat Assoc 85:398-409 (1990).
17. Gelfand AE, Hills SE, Racine-Poon A, Smith AFM. Illustration of Bayesian inference in normal data models using Gibbs sampling. J Am Stat Assoc 85:972-985 (1990).
18. Gelfand AE, Smith AFM, Lee T-M. Bayesian analysis of constrained parameter and truncated data problems using Gibbs sampling. J Am Stat Assoc 87:523-532 (1992).
19. Gelman A. Iterative and non-iterative simulation algorithms. Comput Sci Stat 24:433-438 (1992).
20. Tanner MA. Tools for Statistical Inference - Observed Data and Data Augmentation Methods. Vol 67. Berlin:Springer-Verlag, 1991.
21. Wakefield JC, Smith AFM, Racine-Poon A, Gelfand AE. Bayesian analysis of linear and non-linear population models using the Gibbs sampler. Appl Stat - J Royal Stat Soc Series C 43:201-221 (1994).
22. Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences (with discussion). Stat Sci 7:457-511 (1992).
23. Stenner RD, Merdink JL, Fisher JW, Bull RJ. Physiologically-based pharmacokinetic model for trichloroethylene considering enterohepatic recirculation of major metabolites. Risk Anal 18:261-269 (1998).
24. Davidian M, Giltinan DM. Nonlinear Models for Repeated Measurement Data. London:Chapman & Hall, 1995.
Last Updated: May 2, 2000