Probit Analysis
Probit
Analysis is a specialized regression model of binomial response variables
(variable with only two outcomes). It transforms the sigmoid dose-response
curve to a straight line that can then be analyzed by regression either through
least squares or maximum likelihood.
Applications of probit analysis
Probit Analysis is commonly used in toxicology to determine the relative toxicity of chemicals to living organisms. This is done by testing the response of an organism under various concentrations of each of the chemicals in question and then comparing the concentrations at which one encounters a response. Here, the response is always binomial (e.g. death/no death) and the relationship between the response and the various concentrations is always sigmoid. Probit analysis transforms the sigmoid to linear and performs regression on the relationship. There are many endpoints used to compare the differing toxicities of chemicals, but the LC50 (liquids) or LD50 (solids) are the most widely used outcomes of the modern dose-response experiments. The LC50/LD50 represent the concentration (LC50) or dose (LD50) at which 50% of the population responds.
The Estimation of the Median Effective Dose:
In Probit analysis successive batches of insects are exposed to different concentrations of the poison/chemical for a constant time and, after a suitable interval, the numbers dead and alive are recorded.
The probit analysis transforms the experimental results in order to estimate µ and σ2 from the following equation.
(1)
The probit of the proportion P is defined as the probability P in a normal distribution with mean 5 and variance 1 i.e. the probit of P is Y, where
(2)
If equation (1) represents the distribution of tolerances of dosages, the expected proportion of insect killed by a dose x0 is given by
(3)
Comparing (2) and (3) for P shows that the probit of expected proportion killed is related to the dosage by the linear equation
(4)
By applying probit transformation, experimental results may be used to give an estimate of this equation, and the parameters of the tolerance distribution may then be estimated. The median effective dosage is estimated as that value of x which gives Y = 5
The Probit regression line
There are two methods of estimation of the parameters i.e. graphical method or statistical process Both depend upon the probit transformation. The graphical approach is rapid and is sufficiently good for many purposes, but for some ore complex problems, or when an accurate assessment of the precision of estimates is needed, the second method is preferred.
Method – I (graphical Approach)
Ø Find out the least tolerated (smallest) dose (100% mortality) and most tolerated (highest) dose (0% mortality) by hit and trial method
Ø Once these two doses are determined, select at least 5 doses in between them, and observe mortality due to these doses
Ø Calculate the percentage kill observed for each dose and convert them to probit.
Ø Apply correction factor to 0% and 100% mortality group [for 0% dead = 100 (0.25/n) and for 100% dead = 100x (n-0.25/n), where n = number of death]
Ø The percentage mortality values are converted to probit values by reading the corresponding probit units from the probit table
Ø The probits are then plotted against x (the logarithm of the dose) and a straight line drawn by eye to fit the points satisfactorily as possible.
Ø The log LD 50 is estimated from the line as m (intercept), the dosage at which Y = 5.
Ø The slope of the line, b, which is an estimate of 1/σ, is obtained.
Ø These two parameters are then substituted in equation (4) to give the estimated relationship between dosage and kill.
Ø To test whether the line is an adequate representation of the data, a χ2 test is performed.
Method 2 (Maximum Likelihood Estimation) (Procedure taken form Lei and Sun (2018)
1. Suppose in a population, n subjects are treated with doses (i) and r subjects show a response to each dose then the empirical proportion (p*) of responders is given by
(1)
where i = 1 to n and n is the number of doses
2. If we have a control group and response occurred in it with proportion C, the proportions of responders were corrected using the Abbott equation for each treatment dose
(2)
3. The corrected proportion (pi) was then converted to a probit value (yi)
(3)
where is inverse of Normal distribution
4. Establish a provisional regression line between yi and the logarithm of the dose (xi)
(4)
5. Calculate the expected probits (Yi)
(5)
6. Calculate the expected response proportion (Pi)
(6)
where Φ(Yi) is the cumulative probability of the standard normal distribution corresponding to (Yi), and C is the natural response proportion if exist
7. Calculate the working probit (yi) as
, (7)
Where (8)
8. Compute expected probits from weighted regression of working probits weighted on xi, with each yi . A weight, niwi,is assigned to each xi and yi where wi is the weighting coefficient computed as
(9)
where C is the natural response of control group
9. The slope β of the working probit regression equation is given by
(10)
10. The intercept α of the working probit regression equation
, (11)
Where (12)
11. The standard error of β is
(13)
and the standard error of α
(14)
12. The χ2 statistic of the probit regression equation is computed as
(15)
The significance level p of the χ2 statistic can be calculated as the right-tailed probability of the chi-squared distribution with n – 2 degrees of freedom (d.f.).
A significant χ2 statistic (p < 0.05) might indicate either that the population did not respond independently or that the fitted probit-log(dose) regression line did not adequately describe the dose-response relationship in the test samples.
To get an optimal fit of the probit-log10(dose) regression, we substituted α and β for α0 and β0 and repeated the calculations of equation (5) to (15) until a stable χ2 appeared, indicating convergence. This procedure is a maximum likelihood (ML) method
The significance of the slope was assessed using the z test
(16)
If h < 1, the model
provided a good fit to the data. Otherwise, standardized residuals will be
plotted to identify outliers or other possible causes of poorness of fit. Each
residual defined the difference between the observed ri and
the expected response number (niPi) for each
dose. The residuals were standardized by dividing them by their standard
errors, . For models providing a
good fit, the standardized residuals fell mostly between −2 and 2.
Standardized residuals distributed randomly showed no systematic patterns or
tendencies toward positive or negative sign.
Calculation of the lethal doses of toxicants or populations and their 95% CLs
In this step, we first calculated the logarithms of the doses (θπ) at which levels of interest (π) gave the expected response proportion
(17)
where yπ was the πth percentile of the probit distribution curve calculated in inverse of normal distribution for the probit distribution.
The πth lethal dose was then calculated as
and The standard error of θπ, σ(θπ)
can be calculated as
(18)
The 95% CL of the LDπ was then given as
10θπ±t0.05,k−2σ(θπ) (19)
Procedure to Perform Probit Analysis using Online Tool (OPSTAT)
Step 1: Enter the data of dose response in the text area provided or paste the data. The sequence of the data should be [dose][total no. of subject][No. of killed] in a single line corresponding to first dose. Enter the data for other doses in separate lines in the similar manner as above. In last line data for control group will be entered. as shown in example below
Conc No. Killed
1.01 50 44
0.89 49 42
0.71 46 24
0.58 48 16
0.41 50 6
0 50 0
Step 2. Submit your data by pressing submit button.
Step
3. Enter
the number of doses except control group in next page and press Analyse button.
The results will be displayed on separate webpage which can be printed or
saved.
References
Ø Finney, D. J., Ed. (1952).Probit Analysis. Cambridge, England, Cambridge University Press.
Ø Finney, D. J. and W. L. Stevens (1948). "A table for the calculation of working probits and weights in probit analysis." Biometrika35(1-2): 191-201.
Ø Greenberg, B. G. (1980). "Chester I. Bliss, 1899-1979." International Statistical Review / RevueInternationale de Statistique8(1): 135-136.
Ø Hahn, E. D. and R. Soyer."Probit and Logit Models: Differences in a Multivariate Realm." Retrieved May 28, 2008, from http://home.gwu.edu/~soyer/mv1h.pdf
Ø Lei, C and Sun, X (2018). Comparing lethal dose ratios using probit regression with arbitrary slopes. BMC Pharmacology and Toxicology 19:61.