Power Analysis and Sample Size Determination for Crossover Trials with Application to Bioequivalence Assessment of Topical Ophthalmic Drugs Using Serial Sampling Pharmacokinetic Data

YU Yong Pei; YAN Xiao Yan; YAO Chen; XIA Jie Lai

doi:10.3967/bes2019.079

Objective To develop methods for determining a suitable sample size for bioequivalence assessment of generic topical ophthalmic drugs using crossover design with serial sampling schemes. Methods The power functions of the Fieller-type confidence interval and the asymptotic confidence interval in crossover designs with serial-sampling data are here derived. Simulation studies were conducted to evaluate the derived power functions. Results Simulation studies show that two power functions can provide precise power estimates when normality assumptions are satisfied and yield conservative estimates of power in cases when data are log-normally distributed. The intra-correlation showed a positive correlation with the power of the bioequivalence test. When the expected ratio of the AUCs was less than or equal to 1, the power of the Fieller-type confidence interval was larger than the asymptotic confidence interval. If the expected ratio of the AUCs was larger than 1, the asymptotic confidence interval had greater power. Sample size can be calculated through numerical iteration with the derived power functions. Conclusion The Fieller-type power function and the asymptotic power function can be used to determine sample sizes of crossover trials for bioequivalence assessment of topical ophthalmic drugs.

HTML

INTRODUCTION

Bioequivalence studies have been widely adopted to demonstrate that two drug formulations have comparable bioavailability. The most commonly used endpoints in bioequivalence trials are pharmacokinetic (PK) parameters^[1]. Generally, bioequivalence is assessed by comparing the area under the concentration-time curve (AUC) and the maximal concentration (C_max)^[2]. According to regulatory guidelines, noncompartmental analysis (NCA) is recommended to estimate AUC and C_max. In a common NCA analysis, all of the subjects follow the same sampling regime, and each subject is sampled at each of a series of time points and drug plasma concentration time profiles. The PK parameters of each individual subject can be estimated based on the complete sampling data. Then, an equivalence test is conducted on the log-transformed PK parameters using Schuirmann's two one-sided tests (TOSTs)^[3]. Finally, bioequivalence can be concluded if the 90% confidence interval of the ratio of the geometric mean lies within the prespecified bioequivalence margin of 0.80 to 1.25.

A research problem has arisen from the development of generic topical ophthalmic drugs. Topical ophthalmic drugs usually have low bioavailability^[4]. Due to both rapid tear turnover and drainage from the ocular surface, formulations have a short contact time with the surface of the eye and only a small percentage of drug remains in the precorneal area after administration^[5]. The quantity of drug that reaches the aqueous humor (AH) is also limited because the corneal barrier^[6], as well as systemic absorption across the conjunctiva, restrict drug absorption into the inner eye^{[7, 8]}. Sampling difficulty is another challenge in ocular drug development^[9]. The AH sample can only be obtained when the eye is undergoing surgery, such as during cataract replacement. Sampling from tears may significantly reduce the amount of medication remaining in the eye. Due to these limitations, complete sampling data for each individual subject are unattainable in pharmacokinetic studies of topical ophthalmic drugs. In this case, a serial sampling design could be applied, which has been commonly used in bioavailability studies using small animals and for which blood sampling has been restricted. In a serial sampling design, only one sample is obtained per subject at one of the prespecified timepoints and the concentration time profile of the study formulation is obtained with multiple subjects sampled at different time points.

The US Food and Drug Administration (FDA) has issued several product-specific guidances (PSGs) to regulate the design issues in the bioequivalence assessment of topical ophthalmic drugs. Single-dose, in vivo, sparse-sampling AH studies with PK endpoints using either a crossover or parallel study design are recommended in these guidances^[1]. An NCA approach based on serial sampling design has been specified to estimate the AUC from 0 to the last time point (AUC_0-t). Ratios of the AUC_0-t, as well as C_max, are required for the bioequivalence assessment. The bioequivalence test for C_max can be conducted using Schuirmann's TOSTs in the same manner as that of complete data designs. The bioequivalence test for the AUC_0-t, however, is complicated because of the serial sampling data.

For parallel study designs, several methods have been proposed to estimate the confidence interval of the ratio of two AUCs with serial sampling data. Wolfsegger^[10] proposed a Fieller-type confidence interval for the ratio of two AUCs with serial sampling data to assess bioequivalence. Jaki et al.^[11] expanded the Fieller-type confidence interval to address the case of sparse sampling designs with flexible sampling regimes. In addition to the Fieller-type confidence interval, Wolfsegger and Jaki^[12] derived an asymptotic confidence interval using the delta method. Jaki et al.^[13] carried out a comparison of seven methods for construction of confidence intervals for ratios of AUCs and demonstrated that the Fieller-type confidence interval, asymptotic confidence interval, and bootstrap-t interval are superior to other methods. Hua et al.^[14] generated an excellent summary of statistical issues regarding bioequivalence assessment with serial sampling pharmacokinetic data.

For bioequivalence assessment in crossover designs with serial sampling data, bootstrap methods are most frequently used. Shen and Machado^[15] developed a nonparametric bootstrap method for bioequivalence assessment to address the case of serial sampling. Harigaya et al.^[4] reviewed studies of topical ophthalmic corticosteroid suspensions submitted to the FDA and showed that bootstrap methods were used in all of these studies. Bootstrap methods provide a straightforward way to derive estimates of standard errors and confidence intervals for complex estimators and can be applied in both parallel and crossover designs. The drawback of the bootstrap method is its relatively cumbersome study design, in which batches of simulations are needed in order to estimate sample size. In contrast, parametric methods have also been proposed. Based on the work of Wolfsegger and Jaki^[12], Jaki et al.^[16] expanded Fieller-type confidence intervals to the case of crossover design. With the method of Jaki et al., batch sampling regimes can be considered, which include serial sampling design as a special case. In a submitted literature of the authors, a simplified Fieller-type confidence interval is derived based on the work of Locke^[17]. This method provides an unbiased estimate of the ratios of AUCs and has greater simplicity than the method of Jaki et al., without a decrease in empirical power or an increase in Type Ⅰ errors.

Bioequivalence studies for topical ophthalmic drugs are often conducted using a large sample size^{[4, 15]}, which is a major challenge in the development of generics. In this way, adaptive designs that allow for minimizing the total sample size or increasing the chances of eventual success would be preferable. Determination of sample size is one of the bases of adaptive adjustment. Despite parametric methods for bioequivalence assessment in crossover designs with serial sampling data having been developed, methods for sample size determination are still limited. Hua et al.^[14] derived the power function for parallel designs with serial sampling data. The question of sample size determination in crossover designs with serial sampling data remains open. Moreover, Fieller's theorem is based on the normality assumption^[18], while pharmacokinetic data are generally considered to be log-normally distributed. Thus, the power of the Fieller-type interval on log normally distributed data requires further investigation.

In the present study, we derive the power function of the Fieller-type confidence interval and the asymptotic confidence interval in crossover designs with serial sampling data. Simulation studies are conducted to evaluate the derived power functions. The impacts of the log-normal distribution, variance, and intra-correlation on the power of the Fieller-type confidence interval and the asymptotic confidence interval are also investigated in the simulation studies. The remainder of the article is organized as follows. In the Methods section, we specify the study design and notation, present the process of the bioequivalence test using Fieller-type confidence intervals, and derive the power function. In the Results section, we describe the simulation results and present an application of this approach in a hypothetical example. Finally, we complete this manuscript with a discussion followed by our conclusions.

METHODS

Study Design and Notation

In this article, we focus on crossover designs with serial sampling schemes. Consider a two-sequence, two-period, two-treatment crossover study. Subjects are randomized into two sequence groups: TR and RT (T = test product; R = reference product). Subjects in each sequence are then randomly assigned to Q-sampling time points $\left( {{t_1}, \ldots, {t_Q}} \right)$. Let the number of subjects at each time point be the same and be denoted by ${n_q}$. Each subject provides one sample in each period. Let ${y_{ijkq}}$ denote the drug concentration of the $i{\rm{th}}$ subject at the $q{\rm{th}}$ time point (q = 1, …, Q) [of period j (j = 1, 2) in sequence k (k = 1, 2)]. Then, using Bailer's algorithm^[19], the estimate of the AUC from 0 to the last time point of sequence k in the $j{\rm{th}}$ period is approximated by:

$$AU{C_{jk}} = \mathop \sum \limits_{q = 1}^Q {c_q}{\mu _{jkq}}$$

(1)

where ${c_q}$ is equal to:

$$ {c_1} = \frac{1}{2}\left( {{t_2} - {t_1}} \right)\;\;{\rm{for}}\;\;q = 1, $$

$$ {c_q} = \frac{1}{2}\left( {{t_{q + 1}} - {t_{q - 1}}} \right)\;\;{\rm{for}}\;\;q = 2, \ldots , {\rm{ Q}} - 1, $$

$${c_Q} = \frac{1}{2}\left( {{t_Q} - {t_{Q - 1}}} \right)\;\;{\rm{for}}\;\;q = {\rm{ Q}}.$$

The $AU{C_{jk}}$ can be estimated by:

$${\widehat {AUC}_{jk}} = \mathop \sum \limits_{q = 1}^Q {c_q}{\bar y_{jkq}}$$

(2)

with ${\bar y_{jkq}} = \frac{1}{{{n_q}}}\mathop \sum \limits_{i = 1}^{{n_q}} {y_{ijkq}}$. Since subjects at different time points are independent, there are no covariance terms involved in the variance of ${\widehat {AUC}_{jk}}$, which is estimated by:

$${\hat s^2}\left( {{{\widehat {AUC}}_{jk}}} \right) = \hat \sigma _{jk}^2 = \mathop \sum \limits_{q = 1}^Q \frac{{c_q^2\hat s_{jkq}^2}}{{{n_q}}}$$

(3)

with $\hat s_{jkq}^2 = \frac{1}{{{n_q} - 1}}\mathop \sum\nolimits_{i = 1}^{{n_q}} \left( {{y_{ijkq}} - {{\bar y}_{jkq}}} \right).$ For simplicity, we will use notations for the AUCs in each sequence and period as listed in Table 1.

Sequence	Period 1 (j = 1)	Period 2 (j = 2)
Sequence TR (k = 1)	$\hat a$	$\hat b$
Sequence RT (k = 2)	$\hat c$	$\hat d$

Table 1. Notations of AUCs in Each Sequence and Period

To assess the bioequivalence between two products, we denote the AUCs of the test product and reference product by κ and λ, respectively, and define the ratio of the two AUCs as $\theta $ = κ/λ. We assume that there is no carry-over effect, as the washout period is long enough.

Bioequivalence Assessment

As specified by FDA guidance documents, the ratio of AUC_0-t from the test product to the AUC_0-t from the reference product is used to assess bioequivalence. Bioequivalence can be claimed if the ratios of AUCs lie within the prespecified equivalence range $\left( {{\theta _1}, {\rm{}}{\theta _2}} \right)$, which is frequently set to be (0.80, 1.25) as recommended by regulatory authorities. The hypotheses for the bioequivalence test of two AUCs can be stated as:

$$ {H_0}:\frac{{AU{C^T}}}{{AU{C^R}}} \le {\theta _1}\;or\;\frac{{AU{C^T}}}{{AU{C^R}}} \ge {\theta _2} $$

(4)

$$ {H_1}:{\theta _1} < \frac{{AU{C^T}}}{{AU{C^R}}} < {\theta _2} $$

(5)

Based on the work of Locke^[17], the estimators of κ and λ are given by:

$${\widehat {AUC}^T} = \hat \kappa = \frac{{\hat a + \hat d}}{2}$$

(6)

and

$${\widehat {AUC}^R} = \hat \lambda = \frac{{\hat b + \hat c}}{2}$$

(7)

The standard errors are then given by:

$${s^2}\left( {{{\widehat {AUC}}^T}} \right) = \hat \xi _\kappa ^2 = \frac{1}{4}\left( {\hat \sigma _a^2 + \hat \sigma _d^2} \right)$$

(8)

and

$$ {s^2}\left( {{{\widehat {AUC}}^R}} \right) = \hat \xi _\lambda ^2 = \frac{1}{4}\left( {\hat \sigma _b^2 + \hat \sigma _c^2} \right) $$

(9)

The covariance of $\hat \kappa $ and $\hat \lambda $ can be estimated as:

$${\hat \xi _{\kappa , \lambda }} = \frac{1}{4}\left( {{{\hat \sigma }_{a, b}} + {{\hat \sigma }_{c, d}}} \right)$$

(10)

Note that in the context of crossover designs, $\hat \kappa $ and $\hat \lambda $ include the individual drug effect and the mean of the fixed period effects and are not unbiased estimators of the individual effects of T and R if the period effects are accounted for^[20]. Thus, it is necessary to make an assumption that the mean period effect is 0, which is a reasonable model assumption in crossover trials. With this assumption, the parameter of interest $\hat \theta = \hat \kappa /\hat \lambda $ is an unbiased estimator of the ratio of the individual effects of T and R.

Using Fieller's method, a confidence interval for the parameter of interest $\theta $ is derived from the statistic:

$$T = \frac{{\hat \kappa - \theta \hat \lambda }}{{\sqrt {\hat \xi _\kappa ^2 + {\theta ^2}\hat \xi _\lambda ^2 - 2\theta {{\hat \xi }_{\kappa , \lambda }}} }}$$

(11)

The statistic T has a central t distribution. The corresponding degree of freedom can be obtained using the Satterthwaite approximation^{[16, 21]}, which gives:

$$ \hat \nu = \frac{{{{\left( {\hat \xi _\kappa ^2 + {\theta ^2}\hat \xi _\lambda ^2} \right)}^2}}}{{\hat \xi _\kappa ^4/\left( {{n_q} - 1} \right) + {\theta ^4}\hat \xi _\lambda ^4/\left( {{n_q} - 1} \right)}} $$

(12)

Hence, by resolving the following equation about T:

$$\left\{ {\theta {\rm{|}}{T^2} \le t_{\alpha , \hat \upsilon }^2} \right\}$$

(13)

a (1-2α) × 100% confidence interval for $\theta $ can be obtained. The two roots of this quadratic are the lower and upper limits of the Fieller-type confidence interval for $\theta $, which are given by:

$${\theta _L} = \left[ { - B - {{\left( {{B^2} - AC} \right)}^{1/2}}} \right]/A$$

(14)

and

$${\theta _U} = \left[ { - B + {{\left( {{B^2} - AC} \right)}^{1/2}}} \right]/A$$

(15)

where

$${\rm{A}} = {\hat \lambda ^2} - t_{\alpha , \hat \upsilon }^2\hat \xi _\lambda ^2$$

$${\rm{B}} = t_{\alpha , \hat \upsilon }^2{\hat \xi _{\kappa , \lambda }} - \hat \lambda \hat \kappa $$

$${\rm{C}} = {\hat \kappa ^2} - t_{\alpha , \hat \upsilon }^2\hat \xi _\kappa ^2$$

We reject ${H_0}$ at the ${\rm{ \mathsf{ α} }}$ level of significance if $\theta $₁ < ${\theta _L}$ and $\theta $₂ < $\theta $_U. As discussed by Fieller^{[11, 18, 22]}, obtaining an interpretable confidence interval requires that ${\hat \lambda ^2}/\hat \xi _\lambda ^2 > t_{\alpha /2}^2$ and ${\hat \kappa ^2}/\hat \xi _\kappa ^2 > t_{\alpha /2}^2$ are both satisfied. In other words, both $\kappa $ and $\lambda $ should be statistically significant compared to 0 in order to construct a Fieller-type confidence interval that contains no negative values.

Sample-size Determination

As presented by Hauschke et al.^[23] and Berger et al.^[24], a likelihood ratio test proposed by Sasabuchi^[25], referred to as the T₁/T₂ test, can always lead to the same decision on bioequivalence with the Fieller-type confidence interval. In this way, the power of the Fieller-type confidence interval can be analyzed using the power function of the T₁/T₂ test. Hua et al.^[14] demonstrated the power function of the T₁/T₂ test for parallel designs with serial sampling data. With a modification on the work of Hauschke et al.^[23], the power function of the T₁/T₂ test can be applied to address the case of crossover designs with serial sampling data.

The statistics of the T₁/T₂ test in the crossover design with serial-sampling data are given by:

$${T_l} = \frac{{\hat \kappa - {\theta _l}\hat \lambda }}{{\sqrt {\hat \xi _\kappa ^2 + \theta _l^2\hat \xi _\lambda ^2 - 2{\theta _l}{{\hat \xi }_{\kappa , \lambda }}} }}, l = 1, 2$$

(16)

Bioequivalence can be concluded if and only if ${T_1} > {t_{\alpha, \hat \upsilon }}$ and ${T_2} < - {t_{\alpha, \hat \upsilon }}$. Hence, when the variances of AUCs of the test product and reference are equal, the power of crossover design, referred to as 'Fieller-type power,' is given by:

$$ 1 - \beta = Pr({T_1} > {t_{{\rm{a, v}}}}\;{\rm{and}}\;\;{T_2} < - {t_{{\rm{a, v}}}}|{\theta _1} < \kappa /\lambda < {\theta _2}, \;{\xi ^2}_{\rm{ \mathsf{ κ} }}, {\xi ^2}_{\rm{ \mathsf{ λ} }}, \;\xi \kappa , \lambda , \;p) $$

(17)

where $\rho $ is the correlation coefficient between T₁ and T₂. Note that in a T₁/T₂ test, two statistics ${T_1}$ and T₂ are correlated. The random vector (T₁/T₂) follows a bivariate non-central t-distribution with non-centrality parameters φ₁, φ₂, and correlation coefficient $\rho $, which are given by:

$${\varphi _l} = \frac{{\kappa - {\theta _l}\lambda }}{{\sqrt {\hat \xi _\kappa ^2 + \theta _l^2\hat \xi _\lambda ^2 - 2{\theta _l}{{\hat \xi }_{\kappa , \lambda }}} }}, \;l = 1, 2$$

(18)

$$\rho \left( {{T_1}, {T_2}} \right) = \frac{{\hat \xi _\kappa ^2 + {\theta _1}{\theta _2}\hat \xi _\lambda ^2 - {{\hat \xi }_{\kappa , \lambda }}\left( {{\theta _1} + {\theta _2}} \right)}}{{\sqrt {\left( {\hat \xi _\kappa ^2 + \theta _1^2\hat \xi _\lambda ^2 - 2{\theta _1}{{\hat \xi }_{\kappa , \lambda }}} \right)\left( {\hat \xi _\kappa ^2 + \theta _2^2\hat \xi _\lambda ^2 - 2{\theta _2}{{\hat \xi }_{\kappa , \lambda }}} \right)} }}$$

(19)

The parameters φ₁ and φ₂ are related to n_q, which is the number of subjects per timepoint of each sequence. In this way, for a specific power level, the sample size n_q can be calculated numerically using Owen' Q function, as:

$$1 - \beta = Q\left( {\infty , - {t_{\alpha , \nu }}, {\varphi _1}, {\varphi _2}, \rho } \right) - Q\left( {{t_{\alpha , \nu }}, - {t_{\alpha , \nu }}, {\varphi _1}, {\varphi _2}, \rho } \right)$$

(20)

The details regarding Owen' Q function are described in Supplementary in File S1 (available in www.besjournal.com). If the two AUCs have equal variances, the non-centrality parameters φ₁ and φ₂ can be simplified by applying the pooled variance ${\hat \xi ^2}$ as a substitute of $\hat \xi _\kappa ^2$ and $\hat \xi _\lambda ^2$, as follows:

$${\varphi _l} = \frac{{\theta - {\theta _l}}}{{{{\widehat {CV}}_R}\sqrt {\frac{{1 + \theta _l^2 - 2{\theta _l}\hat r}}{{{n_q}}}} }}, l = 1, 2$$

(21)

where ${\widehat {CV}_R}$ is the coefficient of variance of AUC of the reference product and $\hat r = {\hat \xi _{\kappa, \lambda }}/{\hat \xi ^2}$. Then, the sample size of each timepoint in each sequence n_q can be calculated with $\theta $, $\hat r$ and $\widehat {CV}$.

As presented by Hirschberg et al.^[26], the asymptotic confidence interval is a common alternative of the Fieller-type confidence interval. The asymptotic confidence interval uses a delta method to obtain an approximate variance of the parameter of interest, $\theta $, which here we refer to as ${\sigma _\theta }$. Thus, the asymptotic confidence interval is given by:

$$\left( {\hat \theta - {Z_{1 - \alpha }}{{\hat \sigma }_\theta }, \hat \theta + {Z_{1 - \alpha }}{{\hat \sigma }_\theta }} \right)$$

(22)

where

$${\hat \sigma _\theta } = \sqrt {\frac{{\hat \xi _\kappa ^2 + {{\hat \theta }^2}\hat \xi _\lambda ^2 - 2\hat \theta {{\hat \xi }_{\kappa , \lambda }}}}{{{{\hat \lambda }^2}}}} $$

With an assumption of equal variance, the ${\hat \sigma _\theta }$ can be simplified as:

$${\hat \sigma _\theta } = {\widehat {CV}_R}\sqrt {\frac{{1 + {{\hat \theta }^2} - 2\hat \theta \hat r}}{{{n_q}}}} $$

(23)

Bioequivalence can be concluded if $\hat \theta - {Z_{1 - \alpha }}{\hat \sigma _\theta } > {\theta _1}$ and $\hat \theta + {Z_{1 - \alpha }}{\hat \sigma _\theta } < {\theta _2}$. In this way, the power of the asymptotic confidence-interval method, referred to as 'asymptotic power, ' can be derived based on a two-one sided test^[27], which is given by:

$$1 - \beta = {\rm{\Phi }}\left( {\frac{{\hat \theta - {\theta _1}}}{{{{\hat \sigma }_\theta }}} - {Z_{1 - \alpha }}} \right) + {\rm{\Phi }}\left( {\frac{{\hat \theta - {\theta _2}}}{{{{\hat \sigma }_\theta }}} - {Z_{1 - \alpha }}} \right) - 1$$

(24)

Analogously, n_q can be obtained by iterating the power function. An R program for sample-size estimation is provided in Suppelementary File S1.

DISCUSSION

In the development of generic topical ophthalmic drugs, serial sampling is the most common sampling scheme due to the limitations of low bioavailability and difficulties in sampling. Although parametric methods for bioequivalence assessment in crossover designs with serial sampling data have been developed^{[14, 16]}, methods for sample-size determination are still limited. In this paper, we derived the power function of the Fieller-type confidence interval and the power function of the asymptotic confidence interval in crossover designs with serial sampling data. Simulation studies revealed that both the Fieller-type power function and the asymptotic power function could provide a reliable estimate for the power of bioequivalence assessment when normality assumptions were satisfied, as the values of both power functions were very close to the corresponding empirical power drawn from normal data. Moreover, both the Fieller-type power function and the asymptotic power function would yield a conservative estimate of power in cases when the study data were log-normally distributed. Both of the two power functions could be applied in power analysis for bioequivalence assessment.

Our derived power functions provide a simple way for determination of sample size in bioequivalence assessment using crossover design with serial sampling data. In current practice, bootstrap methods are most frequently used in bioequivalence assessment of generic topical ophthalmic drugs^[4]. The sample size determination for bootstrap methods is a time-consuming process since it is carried out through batches of simulation studies. Through numerical iteration methods, the sample size for the Fieller-type confidence interval and the asymptotic confidence interval can be determined by the Fieller-type power function and the asymptotic power function, respectively. We recommend the use of the Fieller-type power if the expected ratio of AUCs is less than or equal to 1 since, in such case, the power of the Fieller-type confidence interval is larger than the asymptotic confidence interval. If the expected ratio of AUCs is larger than 1, the asymptotic confidence interval should be used since it has larger power. In practice, it might be difficult to make assumptions on the expected ratio of AUCs, especially without a previous pilot study. We consider that Fieller-type power could be preferable when the information on the test product is limited because the Fieller-type interval shows more robustness than the asymptotic interval in some extreme cases, such as when the variance of reference drug is too large, as suggested by Hirschberg et al.^[26] Also, it would be reasonable to assume that the relative bioavailability, e.g., the ratio of AUCs, is less than 1 in study design. To avoid a waste of resources due to the overestimate of sample size, we suggest that the nominal power should be at least 80% since the difference between empirical power and power function value would be minor due to log normality.

In vivo bioequivalence studies for topical ophthalmic drugs are often conducted using a large sample size. Harigaya et al.^[4] reviewed six bioequivalence studies of topical ophthalmic corticosteroid suspensions submitted to the FDA and showed that the numbers of subjects in each time point are around 70. Shen and Machado^[15] illustrated a bioequivalence study of Tobradex AF suspension and TOBRADEX ophthalmic suspension, in which 987 subjects (at least 75 patients per timepoint) were enrolled to establish bioequivalence. This is because, in a serial sampling design, each subject must be involved in only one time point, and the AUC would be a linear combination of the average concentration of each time point. Correspondingly, the variance of the AUC would be a weighted sum of the variance of each time point, and the sample size needed to demonstrate bioequivalence would be consequently large. Thus, adaptive designs that allow early stopping or sample size reestimation would be preferable because of their advantage with respect to resource-saving or increasing the chances of eventual success. Based on our derived power functions, the conditional power function^[29] and conditional target power^[30], which are essential to adaptive adjustment, can be derived. With the power function of bioequivalence tests for the crossover trials with serial sampling design, adaptive designs-such as adaptive sample size sequential methods^[31] and multiple-stage adaptive designs^[32-34]-could be further considered in the context of serial sampling design.

CONCLUSIONS

The Fieller-type power function and the asymptotic power function can provide precise power estimates when normality assumptions are satisfied, and they may yield conservative estimates of power in cases when data are log-normally distributed. With these power functions, the sample size needed for bioequivalence assessment using crossover design with serial sampling data can be determined through numerical iteration methods. Adaptive sample size sequential methods and multiple-stage adaptive designs for crossover designs with serial sampling schemes could be cultivated further based on the derived power functions.

Supplementary File S1: Owen's Q function.

In this paper, we use Owen's Q function to calculate the cumulative density of a bivariate non-central t-distribution. The structure of Owen's Q function is shown below:

$$ \begin{array}{l} Q\left( {{t_1}, {t_2}, {\varphi _1}, {\varphi _2}, \rho } \right) = \frac{{\sqrt {2{\rm{ \mathsf{ π} }}} }}{{\mathit{\Gamma} \left( {\nu /2} \right){2^{\left( {\upsilon - 2} \right)/2}}}}\int_0^\infty {\mathit{\Phi} _2}(\frac{{{t_1}x}}{{\sqrt \upsilon }}\\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\; - {\varphi _1}, \frac{{{t_2}x}}{{\sqrt \upsilon }} - {\varphi _2}, \rho ){x^{\upsilon - 1}}\mathit{\Phi} '\left( x \right)dx, \end{array} $$

(25)

$$\begin{array}{l} {\mathit{\Phi} _2}\left( {x, y, \rho } \right)\\ = \frac{1}{{2{\rm{ \mathsf{ π} }}\sqrt {\left( {1 - {\rho ^2}} \right)} }}\int_{ - \infty }^x \int_{ - \infty }^y {\rm{exp}}\left( { - \frac{{{u^2} - 2\rho u\upsilon + {\upsilon ^2}}}{{2\left( {1 - {\rho ^2}} \right)}}} \right)dudv, \\ \;\;\;\;\;\;\;\;\;\;\;\;\;\mathit{\Phi} '\left( x \right) = \frac{1}{{\sqrt {2\pi } }}{\rm{exp}}\left( { - \frac{{{x^2}}}{2}} \right) \end{array}$$

(26)

where v is the degrees of freedom obtained using Satterthwaite approximation, φ₁, φ₂ are non-centrality parameters, and $\rho $ is correlation coefficient.

ACKNOWLEDGMENTS

We thank LetPub (www.letpub.com) for its linguistic assistance during the preparation of this manuscript.

AUTHOR CONTRIBUTIONS

YY carried out the derivation, performed the data analysis and drafted the manuscript. JX conceived of research questions and revised the manuscript. YY, XY, and CY designed and performed the simulation study. XY and CY critically commented and revised the manuscript. All authors read and approved the final version of the manuscript.

Suppelementary File 1. pgrograms

Program 1. R program for calculating the power of the Fieller-type confidence interval and the asymptotic confidence interval

power.serialsample<- function(auc=1, var.auc=1, cov.auc=1, nq=1)

{

  t.auc<-auc[1];

  r.auc<-auc[2];

  t.aucse2<-var.auc[1]/nq;

  r.aucse2<-var.auc[2]/nq;

  cov.aucse<-cov.auc/nq;

  feiller.t1<-(t.auc-0.80*r.auc)/sqrt(t.aucse2+0.80^2*r.aucse2-2*0.80*cov.aucse);

  feiller.t2<-(t.auc-1.25*r.auc)/sqrt(t.aucse2+1.25^2*r.aucse2-2*1.25*cov.aucse);

  corr.t1t2<-

(t.aucse2+0.80*1.25*r.aucse2-(0.80+1.25)*cov.aucse)/sqrt((t.aucse2+0.80^2*r.aucse2-2*0.80*cov.aucse)*(t.aucse2+1.25^2*r.aucse2-2*1.25*cov.aucse));

  theta<-t.auc/r.auc;

  df=(t.aucse2+theta^2*r.aucse2)^2/(t.aucse2^2/(2*nq-2)+theta^4*r.aucse2^2/(2*nq-2));

  asy.se<-sqrt((t.aucse2+theta^2*r.aucse2-2*theta*cov.aucse)/r.auc^2);

  asy.t1=(theta-0.80)/asy.se;

  asy.t2=(theta-1.25)/asy.se;

  powerasy.lower<-1-pt(qt(0.95, df), df, asy.t1);

  powerasy.upper<-pt(-1*qt(0.95, df), df, asy.t2);

  asy.power<-powerasy.upper+powerasy.lower-1;

  library(mvtnorm)

  delta<-c(feiller.t1, feiller.t2);

  df<-as.integer(df);

  rho<-corr.t1t2;

  corr<-diag(2);

  corr[1, 2]<-corr[2, 1]<-rho;

  upper<-c(Inf, 0);

  upper[2]<- -qt(1-0.05, df);

  power<-pmvt(upper=upper, delta=delta, corr=corr, df=df);

  upper[1]<- qt(1-0.05, df);

  power<- power-pmvt(upper=upper, delta=delta, corr=corr, df=df);

  fieller.power<-power;

  result<-data.frame(feiller.t1, feiller.t2, corr.t1t2, theta, df, asy.t1, asy.t2, asy.se, fieller.power, asy.power);

  return(result[c("fieller.power", "asy.power")])

}

Program 2. R program for calculating the sample size per timepoint for bioequivalence using the Fieller-type confidence interval or the asymptotic confidence interval

npergroup.serialsample<- function(auc=1, var.auc=1, cov.auc=1, target.power=0.8, ci.method="fieller")

{

  nq<-5;

  if (!(tolower(ci.method) %in% c("fieller", "asymptotic"))){

    warning("Method unkown. Will use the lower one the Fieller-type power and the Asymptotic power.\n", call.=FALSE, immediate.=TRUE)

  }

  method.list<-c("Fieller-type power", "Asymptotic power");

  repeat{

      if (tolower(ci.method)=="fieller"){

         method.flag=1;

         calc.power<-power.serialsample(auc=auc, var.auc=var.auc, cov.auc=cov.auc, nq=nq)[method.flag];

         power.method<-method.list[1];

      }

      if (tolower(ci.method)=="asymptotic"){

         method.flag=2;

         calc.power<-power.serialsample(auc=auc, var.auc=var.auc, cov.auc=cov.auc, nq=nq)[method.flag];

         power.method<-method.list[2];

      }

      if (!(tolower(ci.method) %in% c("fieller", "asymptotic"))){

          calc.power<-power.serialsample(auc=auc, var.auc=var.auc, cov.auc=cov.auc, nq=nq);

          method.flag=which.min(calc.power);

          calc.power<-calc.power[which.min(calc.power)];

          power.method<-method.list[method.flag];

      }

      nq<-nq+1;

      if(calc.power[1]>=0.8){

        npergroup<-nq-1;

        target.power<-calc.power;

        names(target.power)<-"Power";

        break

      }

    }

      result<-data.frame(power.method, target.power, npergroup);

    return(result)

}

p1<-npergroup.serialsample(auc=c(118853.61, 126004.00), var.auc=c(1489997446.5, 3109615770.9), cov.auc=815789682.12, target.power=0.8, ci.method="fieller")

p2<-npergroup.serialsample(auc=c(118853.61, 126004.00), var.auc=c(1489997446.5, 3109615770.9), cov.auc=815789682.12, target.power=0.8, ci.method="asymptotic")

Reference (34)

[1]	Yu LX, Li BV. FDA bioequivalence standards. New York: AAPS Press/Springer, 2014.
[2]	Chow SC, Liu JP. Design and analysis of bioavailability and bioequivalence studies. Boca Raton: CRC Press, 2009.
[3]	Endrenyi L, Blume HH, Tothfalusi L. The Two Main Goals of Bioequivalence Studies[J]. AAPS J, 2017, 19(): 885-90. doi: 10.1208/s12248-017-0048-x
[4]	Harigaya Y, Jiang X, Zhang H. Bioequivalence Study Methods with Pharmacokinetic Endpoints for Topical Ophthalmic Corticosteroid Suspensions and Effects of Subject Demographics[J]. Pharmaceutical Res, 2018, 36(): 13-.
[5]	Zhu H, Chauhan A. Effect of viscosity on tear drainage and ocular residence time[J]. Optom Vis Sci, 2008, 85(): 715-25. doi: 10.1097/OPX.0b013e3181824dc4
[6]	Li M, Wang ZL, Gou LY. Evaluation of the protein requirement in Chinese young adults using the indicator amino acid oxidation technique[J]. Biomed Environ Sci, 2013, 26(): 655-62.
[7]	Ahmed I, Patton TF. Importance of the noncorneal absorption route in topical ophthalmic drug delivery[J]. Invest Ophthalmol Vis Sci, 1985, 26(): 584-7.
[8]	Zhang YP, Peng XY, Li ZH. Hyperglycemic effects of a periocular dexamethasone injection in diabetic patients after vitreoretinal surgery[J]. Biomed Environ Sci, 2012, 25(): 311-6.
[9]	Deng F, Ranta VP, Kidron H. General Pharmacokinetic Model for Topically Administered Ocular Drug Dosage Forms[J]. Pharmaceutical Research, 2016, 33(): 2680-90. doi: 10.1007/s11095-016-1993-2
[10]	Wolfsegger MJ. Establishing bioequivalence in serial sacrifice designs[J]. J Pharmacokinet Pharmacodyn, 2007, 34(): 103-13. doi: 10.1007/s10928-006-9037-x
[11]	Jaki T, Wolfsegger MJ, Lawo JP. Establishing Bioequivalence in Complete and Incomplete Data Designs Using AUCs[J]. J Biopharm Stat, 2010, 20(): 803-20. doi: 10.1080/10543401003618835
[12]	Wolfsegger MJ, Jaki T. Assessing Systemic Drug Exposure in Repeated Dose Toxicity Studies in the Case of Complete and Incomplete Sampling[J]. Biom J, 2009, 51(): 1017-29.
[13]	Jaki T, Wolfsegger MJ, Ploner M. Confidence intervals for ratios of AUCs in the case of serial sampling:a comparison of seven methods[J]. Pharm Stat, 2009, 8(): 12-24. doi: 10.1002/pst.321
[14]	Hua SY, Hawkins DL, Zhou J. Statistical considerations in bioequivalence of two area under the concentration-time curves obtained from serial sampling data AU-Hua, Steven Y[J]. J Applied Stat, 2013, 40(): 1140-54. doi: 10.1080/02664763.2013.780234
[15]	Shen MY, Machado SG. Bioequivalence evaluation of sparse sampling pharmacokinetics data using bootstrap resampling method[J]. J Biopharm Stat, 2017, 27(): 257-64. doi: 10.1080/10543406.2016.1265543
[16]	Jaki T, Pallmann P, Wolfsegger MJ. Estimation in AB/BA crossover trials with application to bioequivalence studies with incomplete and complete data designs[J]. Stat Med, 2013, 32(): 5469-83. doi: 10.1002/sim.5886
[17]	Locke CS. An exact confidence interval from untransformed data for the ratio of two formulation means[J]. J Pharmacokinet Biopharm, 1984, 12(): 649-55. doi: 10.1007/BF01059558
[18]	Herson J. Fieller's theorem vs. The delta method for significance intervals for ratios[J]. J Stat Computation Simulation, 1975, 3(): 265-74. doi: 10.1080/00949657508810091
[19]	Bailer AJ. Testing for the equality of area under the curves when using destructive measurement techniques[J]. J Pharmacokine Biopharm, 1988, 16(): 303-9. doi: 10.1007/BF01062139
[20]	Jones B, Kenward MG. Design and analysis of cross-over trials. Boca Raton: CRC Press/Taylor & Francis, 2014.
[21]	Satterthwaite FE. An Approximate Distribution of Estimates of Variance Components[J]. Biometrics Bulletin, 1946, 2(): 110-4. doi: 10.2307/3002019
[22]	Fieller EC. Some Problems in Interval Estimation[J]. Journal of the Royal Statistical Society Series B (Methodological), 1954, 16(): 175-85. doi: 10.1111/j.2517-6161.1954.tb00159.x
[23]	Hauschke D, Kieser M, Diletti E. Sample size determination for proving equivalence based on the ratio of two means for normally distributed data[J]. Stat Med, 1999, 18(): 93-105. doi: 10.1002/(SICI)1097-0258(19990115)18:1<93::AID-SIM992>3.0.CO;2-8
[24]	Berger RL, Hsu JC. Bioequivalence Trials, Intersection-Union Tests and Equivalence Confidence Sets[J]. Stat Sci, 1996, 11(): 283-302. doi: 10.1214/ss/1032280304
[25]	Sasabuchi S. A Test of a Multivariate Normal Mean with Composite Hypotheses Determined by Linear Inequalities[J]. Biometrika, 1980, 67(): 429-39. doi: 10.1093/biomet/67.2.429
[26]	Hirschberg J, Lye J. A Geometric Comparison of the Delta and Fieller Confidence Intervals[J]. Am Stat, 2010, 64(): 234-41. doi: 10.1198/tast.2010.08130
[27]	Julious SA. Sample sizes for clinical trials with Normal data[J]. Stat Med, 2004, 23(): 1921-86. doi: 10.1002/sim.1783
[28]	Chiambaretta F, Garraffo R, Elena PP. Tear concentrations of azithromycin following topical administration of a single dose of azithromycin 0.5%, 1.0%, and 1.5% eyedrops (T1225) in healthy volunteers[J]. Eur J Ophthalmol, 2008, 18(): 13-20. doi: 10.1177/112067210801800103
[29]	Mehta CR, Pocock SJ. Adaptive increase in sample size when interim results are promising:A practical guide with examples[J]. Stat Med, 2011, 30(): 3267-84. doi: 10.1002/sim.4102
[30]	Maurer W, Jones B, Chen Y. Controlling the type Ⅰ error rate in two-stage sequential adaptive designs when testing for average bioequivalence[J]. Stat Med, 2018, 37(): 1587-607. doi: 10.1002/sim.7614
[31]	Potvin D, DiLiberti CE, Hauck WW. Sequential design approaches for bioequivalence studies with crossover designs[J]. Pharmaceutical Stat, 2008, 7(): 245-62. doi: 10.1002/pst.294
[32]	Kieser M, Rauch G. Two-stage designs for cross-over bioequivalence trials[J]. Stat Med, 2015, 34(): 2403-16. doi: 10.1002/sim.6487
[33]	Xu J, Audet C, DiLiberti CE. Optimal adaptive sequential designs for crossover bioequivalence studies[J]. Pharmaceutical Stat, 2016, 15(): 15-27. doi: 10.1002/pst.1721
[34]	Yan F, Zhu H, Liu J. Design and inference for 3-stage bioequivalence testing with serial sampling data[J]. Pharmaceutical Stat, 2018, 17(): 458-76.

Sample Size per Timepoints		Type-Ⅰ Error		Power
Sample Size per Timepoints		Ratio = 0.80	Ratio = 1.25	Ratio = 0.95	Ratio = 1.00	Ratio = 1.05
20
	>Log-normal Fieller EP	3.46	4.40	73.30	85.50	78.50
	Normal Fieller EP	4.94	5.08	69.08	81.28	73.84
	Fieller-type power	4.99	5.00	67.70	79.89	73.37
	Log-normal asymptotic EP	3.56	6.04	70.64	84.36	80.04
	Normal asymptotic EP	4.20	6.52	67.08	81.60	77.36
	Asymptotic power	5.00	4.98	65.87	81.10	78.63
30
	Log-normal Fieller EP	3.10	4.50	88.92	95.58	91.04
	Normal Fieller EP	5.34	4.78	84.86	95.82	88.80
	Fieller-type power	5.00	5.00	84.89	94.84	88.81
	Log-normal asymptotic EP	3.20	5.84	87.22	95.04	91.82
	Normal asymptotic EP	4.76	5.84	83.42	96.16	90.66
	Asymptotic power	5.00	5.00	81.55	94.59	93.48
*Note.* EP: Empirical power.

Sequence	Period 1	Period 2
Sequence (TR)	96022.94	104076.84
Sequence (RT)	147931.16	141684.28

Power Analysis and Sample Size Determination for Crossover Trials with Application to Bioequivalence Assessment of Topical Ophthalmic Drugs Using Serial Sampling Pharmacokinetic Data

doi: 10.3967/bes2019.079

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Power Analysis and Sample Size Determination for Crossover Trials with Application to Bioequivalence Assessment of Topical Ophthalmic Drugs Using Serial Sampling Pharmacokinetic Data

doi: 10.3967/bes2019.079

1. College of Military Preventive Medicine, the Fourth Military Medical University, Xi'an 710032, Shaanxi, China

2. Peking University Clinical Research Institute, Peking University Health Science Center, Beijing 100191, China

Corresponding author: XIA Jie Lai, Tel:86-13571999716, E-mail:xiajielai@fmmu.edu.cn