Preventive vaccines are an effective public health intervention for reducing the burden of infectious diseases, but have yet to be developed for several major infectious diseases. Vaccine sieve analysis studies whether and how the efficacy of a vaccine varies with the genetics of the infectious pathogen, which may help guide future vaccine development and deployment. A standard statistical approach to sieve analysis compares the effect of the vaccine to prevent infection and disease caused by pathogen types defined dichotomously as genetically near or far from a reference pathogen strain inside the vaccine construct. For example, near may be defined by amino acid identity at all amino acid positions considered in a multiple alignment and far defined by at least one amino acid difference. An alternative approach is to study the efficacy of the vaccine as a function of genetic distance from a pathogen to a reference vaccine strain where the distance cumulates over the set of amino acid positions. We propose a nonparametric method for estimating and testing the trend in the effect of a vaccine across genetic distance. We illustrate the operating characteristics of the estimator via simulation and apply the method to a recent preventive malaria vaccine efficacy trial.
Keywords: vaccines, competing risks, causal inference, marginal structural model, Hamming distance
@article{JSFS_2020__161_1_164_0, author = {Benkeser, David and Juraska, Michal and Gilbert, Peter B.}, title = {Assessing trends in vaccine efficacy by pathogen genetic distance}, journal = {Journal de la soci\'et\'e fran\c{c}aise de statistique}, pages = {164--175}, publisher = {Soci\'et\'e fran\c{c}aise de statistique}, volume = {161}, number = {1}, year = {2020}, mrnumber = {4125253}, zbl = {1443.62376}, language = {en}, url = {http://archive.numdam.org/item/JSFS_2020__161_1_164_0/} }
TY - JOUR AU - Benkeser, David AU - Juraska, Michal AU - Gilbert, Peter B. TI - Assessing trends in vaccine efficacy by pathogen genetic distance JO - Journal de la société française de statistique PY - 2020 SP - 164 EP - 175 VL - 161 IS - 1 PB - Société française de statistique UR - http://archive.numdam.org/item/JSFS_2020__161_1_164_0/ LA - en ID - JSFS_2020__161_1_164_0 ER -
%0 Journal Article %A Benkeser, David %A Juraska, Michal %A Gilbert, Peter B. %T Assessing trends in vaccine efficacy by pathogen genetic distance %J Journal de la société française de statistique %D 2020 %P 164-175 %V 161 %N 1 %I Société française de statistique %U http://archive.numdam.org/item/JSFS_2020__161_1_164_0/ %G en %F JSFS_2020__161_1_164_0
Benkeser, David; Juraska, Michal; Gilbert, Peter B. Assessing trends in vaccine efficacy by pathogen genetic distance. Journal de la société française de statistique, Volume 161 (2020) no. 1, pp. 164-175. http://archive.numdam.org/item/JSFS_2020__161_1_164_0/
[1] On least squares and linear combination of observations, Proceedings of the Royal Society of Edinburgh, Volume 55 (1936), pp. 42-48 | DOI | Zbl
[2] A phase 3 trial of RTS,S/AS01 malaria vaccine in African infants., New England Journal of Medicine, Volume 367 (2012) no. 24, p. 2284-95 | DOI
[3] Improved estimation of the cumulative incidence of rare outcomes, Statistics in Medicine, Volume 37 (2018) no. 2, pp. 280-293 | DOI | MR
[4] survtmle: Targeted Minimum Loss-Based Estimation for Survival Analysis in R (2017) https://CRAN.R-project.org/package=survtmle | DOI
[5] Stacked regressions, Machine learning, Volume 24 (1996) no. 1, pp. 49-64 | DOI | Zbl
[6] Multiple outputation: Inference for complex clustered data, Biometrics, Volume 59 (2003), pp. 420-429 | DOI | MR | Zbl
[7] The two-sample problem for failure rates depending on a continuous mark: An application to vaccine efficacy, Biostatistics, Volume 9 (2008) no. 2, pp. 263-276 | DOI | Zbl
[8] Statistical methods for assessing differential vaccine protection against human immunodeficiency virus types, Biometrics, Volume 54 (1998) no. 3, pp. 799-814 | DOI | Zbl
[9] Sieve analysis: methods for assessing from vaccine trial data how vaccine efficacy varies with genotypic and phenotypic pathogen variation, Journal of Clinical Epidemiology, Volume 54 (2001) no. 1, pp. 68-85 | DOI
[10] The influence curve and its role in robust estimation, Journal of the American Statistical Association, Volume 69 (1974) no. 346, pp. 383-393 | DOI | MR | Zbl
[11] Toward causal inference with interference, Journal of the American Statistical Association, Volume 103 (2008) no. 482, pp. 832-842 | DOI | MR | Zbl
[12] Statistical inference for data adaptive target parameters, The International Journal of Biostatistics, Volume 12 (2016) no. 1, pp. 3-19 | DOI | MR
[13] Mark-Specific Hazard Ratio Model with Multivariate Continuous Marks: An Application to Vaccine Efficacy, Biometrics, Volume 69 (2013) no. 2, pp. 328-337 | DOI | MR | Zbl
[14] Mark-specific hazard ratio model with missing multivariate marks, Lifetime Data Analysis, Volume 22 (2016) no. 4, pp. 606-625 | DOI | MR | Zbl
[15] Genetic diversity and protective efficacy of the RTS,S/AS01 malaria vaccine, New England Journal of Medicine, Volume 373 (2015) no. 21, pp. 2025-2037 | DOI
[16] Causality: Models, Reasoning, and Inference, Cambridge University Press, New York, NY, 2009 | DOI | MR
[17] Diagnosing and responding to violations in the positivity assumption, Statistical Methods in Medical Research, Volume 21 (2010) no. 1, pp. 31-54 | DOI | MR
[18] First Results of Phase 3 Trial of RTS,S/AS01 Malaria Vaccine in African Children, New England Journal of Medicine, Volume 365 (2011) no. 20, pp. 1863-1875 (PMID: 22007715) | arXiv | DOI
[19] Increased HIV-1 vaccine efficacy against viruses with genetic signatures in Env V2, Nature, Volume 490 (2012) no. 7420, pp. 417-420 | DOI
[20] Proportional hazards models with continuous marks, Annals of Statistics, Volume 37 (2009) no. 1, p. 394 | MR | Zbl
[21] Mark-specific proportional hazards model with multivariate continuous marks and its application to HIV vaccine efficacy trials, Biostatistics, Volume 14 (2012) no. 1, pp. 60-74
[22] Targeted maximum likelihood estimation of effect modification parameters in survival analysis, The International Journal of Biostatistics, Volume 7 (2011) no. 1, pp. 1-34 | DOI | MR
[23] Asymptotic optimality of likelihood-based cross-validation, Statistical Applications in Genetics and Molecular Biology, Volume 3 (2004) no. 1, pp. 1-23 | DOI | MR | Zbl
[24] Super learner, Statistical Applications in Genetics and Molecular Biology, Volume 6 (2007) no. 1, pp. 1-23 | DOI | MR | Zbl
[25] Targeted maximum likelihood learning, The International Journal of Biostatistics, Volume 2 (2006) no. 1, pp. 1-40 | MR
[26] Stacked generatlization, Neural Networks, Volume 5 (1992) no. 2, pp. 241-259 | DOI