Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models

Genest, Christian; Rémillard, Bruno

doi:10.1214/07-AIHP148

Genest, Christian ; Rémillard, Bruno

Annales de l'I.H.P. Probabilités et statistiques, Tome 44 (2008) no. 6, pp. 1096-1127.

Résumé
Abstract

Pour tester qu’une loi $P$ donnée provient d’une famille paramétrique $𝒫$ , on est souvent amené à comparer une estimation non paramétrique $A_{n}$ d’une fonctionnelle $A$ de $P$ à un élément $A_{θ_{n}}$ correspondant à une estimation $θ_{n}$ de $θ$ . Dans bien des cas, la loi asymptotique de statistiques de tests bâties à partir du processus $n^{1 / 2} (A_{n} - A_{θ_{n}})$ dépend de la loi inconnue $P$ . On montre ici que si les suites $A_{n}$ et $θ_{n}$ d’estimateurs sont régulières dans un sens précis, le recours au rééchantillonnage paramétrique conduit à des approximations valides des seuils des tests. Autrement dit si $A_{n}^{*}$ et $θ_{n}^{*}$ sont des analogues de $A_{n}$ et $θ_{n}$ déduits d’un échantillon de loi $P_{θ_{n}}$ , les processus empiriques $n^{1 / 2} (A_{n} - A_{θ_{n}})$ et $n^{1 / 2} (A_{n}^{*} - A_{θ_{n}^{*}})$ convergent alors conjointement en loi vers des copies indépendantes de la même limite. Ce résultat est employé pour valider l’approche par rééchantillonnage paramétrique dans le cadre de tests d’adéquation pour des familles de lois et de copules multivariées. Deux types de tests sont envisagés : les uns comparent la version empirique d’une loi ou d’une copule et son estimation paramétrique sous l’hypothèse nulle ; les autres mesurent la distance entre les estimations paramétrique et non paramétrique de la loi associée à la transformation intégrale de probabilité classique. La validité du rééchantillonnage à deux degrés est aussi démontrée dans les cas où l’estimation paramétrique est difficile à calculer. La méthodologie est illustrée au moyen d’un nouveau test d’adéquation de copules fondé sur une fonctionnelle de Cramér-von Mises du processus de copule empirique.

In testing that a given distribution $P$ belongs to a parameterized family $𝒫$ , one is often led to compare a nonparametric estimate $A_{n}$ of some functional $A$ of $P$ with an element $A_{θ_{n}}$ corresponding to an estimate $θ_{n}$ of $θ$ . In many cases, the asymptotic distribution of goodness-of-fit statistics derived from the process $n^{1 / 2} (A_{n} - A_{θ_{n}})$ depends on the unknown distribution $P$ . It is shown here that if the sequences $A_{n}$ and $θ_{n}$ of estimators are regular in some sense, a parametric bootstrap approach yields valid approximations for the $P$ -values of the tests. In other words if $A_{n}^{*}$ and $θ_{n}^{*}$ are analogs of $A_{n}$ and $θ_{n}$ computed from a sample from $P_{θ_{n}}$ , the empirical processes $n^{1 / 2} (A_{n} - A_{θ_{n}})$ and $n^{1 / 2} (A_{n}^{*} - A_{θ_{n}^{*}})$ then converge jointly in distribution to independent copies of the same limit. This result is used to establish the validity of the parametric bootstrap method when testing the goodness-of-fit of families of multivariate distributions and copulas. Two types of tests are considered: certain procedures compare the empirical version of a distribution function or copula and its parametric estimation under the null hypothesis; others measure the distance between a parametric and a nonparametric estimation of the distribution associated with the classical probability integral transform. The validity of a two-level bootstrap is also proved in cases where the parametric estimate cannot be computed easily. The methodology is illustrated using a new goodness-of-fit test statistic for copulas based on a Cramér-von Mises functional of the empirical copula process.

MR Zbl | 4 citations dans Numdam

DOI : 10.1214/07-AIHP148

Classification : 62F05, 62F40, 62H15
Mots clés : copula, goodness-of-fit test, Monte Carlo simulation, parametric bootstrap, P-values, semiparametric estimation

@article{AIHPB_2008__44_6_1096_0,
     author = {Genest, Christian and R\'emillard, Bruno},
     title = {Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models},
     journal = {Annales de l'I.H.P. Probabilit\'es et statistiques},
     pages = {1096--1127},
     publisher = {Gauthier-Villars},
     volume = {44},
     number = {6},
     year = {2008},
     doi = {10.1214/07-AIHP148},
     mrnumber = {2469337},
     zbl = {1206.62044},
     language = {en},
     url = {http://archive.numdam.org/articles/10.1214/07-AIHP148/}
}

TY  - JOUR
AU  - Genest, Christian
AU  - Rémillard, Bruno
TI  - Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models
JO  - Annales de l'I.H.P. Probabilités et statistiques
PY  - 2008
SP  - 1096
EP  - 1127
VL  - 44
IS  - 6
PB  - Gauthier-Villars
UR  - http://archive.numdam.org/articles/10.1214/07-AIHP148/
DO  - 10.1214/07-AIHP148
LA  - en
ID  - AIHPB_2008__44_6_1096_0
ER  -

%0 Journal Article
%A Genest, Christian
%A Rémillard, Bruno
%T Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models
%J Annales de l'I.H.P. Probabilités et statistiques
%D 2008
%P 1096-1127
%V 44
%N 6
%I Gauthier-Villars
%U http://archive.numdam.org/articles/10.1214/07-AIHP148/
%R 10.1214/07-AIHP148
%G en
%F AIHPB_2008__44_6_1096_0

Genest, Christian; Rémillard, Bruno. Validity of the parametric bootstrap for goodness-of-fit testing in semiparametric models. Annales de l'I.H.P. Probabilités et statistiques, Tome 44 (2008) no. 6, pp. 1096-1127. doi : 10.1214/07-AIHP148. http://archive.numdam.org/articles/10.1214/07-AIHP148/

Bibliographie
Cité par

[1] P. Barbe, C. Genest, K. Ghoudi and B. Rémillard. On Kendall's process. J. Multivariate Anal. 58 (1996) 197-229. | MR | Zbl

[2] R. Beran. Minimum distance procedures. In Nonparametric Methods 741-754. Handbook of Statistics 4. North-Holland, Amsterdam, 1984. | MR | Zbl

[3] R. Beran and P. W. Millar. A stochastic minimum distance test for multivariate parametric models. Ann. Statist. 17 (1989) 125-140. | MR | Zbl

[4] P. J. Bickel and J.-J. Ren. The bootstrap in hypothesis testing. In State of the Art in Probability and Statistics (Leiden, 1999) 91-112. IMS Lecture Notes Monogr. Ser. 36. Inst. Math. Statist., Beachwood, OH, 2001. | MR

[5] P. J. Bickel and M. J. Wichura. Convergence criteria for multiparameter stochastic processes and some applications. Ann. Math. Statist. 42 (1971) 1656-1670. | MR | Zbl

[6] W. Breymann, A. Dias and P. Embrechts. Dependence structures for multivariate high-frequency data in finance. In Selected Proceedings from Quantitative Methods in Finance, 2002 (Cairns/Sydney) 3 1-14, 2003. | MR

[7] S. Demarta and A. J. Mcneil. The t copula and related copulas. Internat. Statist. Rev. 73 (2005) 111-129. | Zbl

[8] J. Dobrić and F. Schmid. A goodness of fit test for copulas based on Rosenblatt's transformation. Comput. Statist. Data Anal. 51 (2007) 4633-4642. | MR | Zbl

[9] J. Durbin. Weak convergence of the sample distribution function when parameters are estimated. Ann. Statist. 1 (1973) 279-290. | MR | Zbl

[10] J.-D. Fermanian. Goodness-of-fit tests for copulas. J. Multivariate Anal. 95 (2005) 119-152. | MR | Zbl

[11] J.-D. Fermanian, D. Radulović and M. H. Wegkamp. Weak convergence of empirical copula processes. Bernoulli 10 (2004) 847-860. | MR | Zbl

[12] P. Gänßler and W. Stute. Seminar on Empirical Processes. Birkhäuser Verlag, Basel, 1987. | MR | Zbl

[13] C. Genest, K. Ghoudi and L.-P. Rivest. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82 (1995) 543-552. | MR | Zbl

[14] C. Genest, J.-F. Quessy and B. Rémillard. Tests of serial independence based on Kendall's process. Canad. J. Statist. 30 (2002) 441-461. | MR | Zbl

[15] C. Genest, J.-F. Quessy and B. Rémillard. Goodness-of-fit procedures for copula models based on the probability integral transformation. Scand. J. Statist. 33 (2006) 337-366. | MR | Zbl

[16] C. Genest, B. Rémillard and D. Beaudoin. Goodness-of-fit tests for copulas: A review and a power study. Insurance Math. Econom. 43 (2008). In press. | MR | Zbl

[17] C. Genest and L.-P. Rivest. Statistical inference procedures for bivariate Archimedean copulas. J. Amer. Statist. Assoc. 88 (1993) 1034-1043. | MR | Zbl

[18] K. Ghoudi and B. Rémillard. Empirical processes based on pseudo-observations. In Asymptotic Methods in Probability and Statistics (Ottawa, ON, 1997) 171-197. North-Holland, Amsterdam, 1998. | MR | Zbl

[19] K. Ghoudi and B. Rémillard. Empirical processes based on pseudo-observations. II. The multivariate case. In Asymptotic Methods in Stochastics 381-406. Fields Inst. Commun. 44. Amer. Math. Soc., Providence, RI, 2004. | MR | Zbl

[20] N. Henze. Empirical-distribution-function goodness-of-fit tests for discrete models. Canad. J. Statist. 24 (1996) 81-93. | MR | Zbl

[21] M. N. Jouini and R. T. Clemen. Copula models for aggregating expert opinions. Oper. Res. 44 (1996) 444-457. | Zbl

[22] C. A. J. Klaassen and J. A. Wellner. Efficient estimation in the bivariate normal copula model: Normal margins are least favourable. Bernoulli 3 (1997) 55-77. | MR | Zbl

[23] Y. Malevergne and D. Sornette. Testing the Gaussian copula hypothesis for financial assets dependences. Quant. Finance 3 (2003) 231-250. | MR

[24] D. Pollard. The minimum distance method of testing. Metrika 27 (1980) 43-70. | MR | Zbl

[25] J. H. Shih and T. A. Louis. Inferences on the association parameter in copula models for bivariate survival data. Biometrics 51 (1995) 1384-1399. | MR | Zbl

[26] W. Stute, W. González-Manteiga and M. Presedo-Quindimil. Bootstrap based goodness-of-fit tests. Metrika 40 (1993) 243-256. | MR | Zbl

[27] H. Tsukahara. Semiparametric estimation in copula models. Canad. J. Statist. 33 (2005) 357-375. | MR | Zbl

[28] A. W. Van Der Vaart and J. A. Wellner. Weak Convergence and Empirical Processes. Springer, New York, 1996. | MR | Zbl

[29] W. Wang and M. T. Wells. Model selection and semiparametric inference for bivariate failure-time data (with discussion). J. Amer. Statist. Assoc. 95 (2000) 62-76. | MR | Zbl

Cité par Sources :