Journal articles

* Student or intern collaborator; + corresponding author

  1. D. Lee*, S. Yang+, L. Dong, X. Wang, D. Zeng, J.W. Cai (2021). Improving trial generalizability using observational studies, Biometrics, doi:10.1111/biom.13609. [arxiv]

    ** Winner of the 2020 ENAR Distinguished Student Paper Competition Award

  2. Y. Zhang*, S. Yang, W. Ye, Douglas E. Faries, I. Lipkovich, Z. Kadziola (2021). Best practices of double score matching for estimating causal effects, Statistics in Medicine, tentatively accepted.
  3. B. J. Reich, S. Yang, and Y. Guan (2021). Discussion on “Spatial+: a novel approach to spatial confounding” by Dupont, Wood and Augustin, Biometrics, accepted.
  4. D. Kong, S. Yang, and L. Wang (2021). Muti-cause causal inference with unmeasured confounding and binary outcome. Biometrika, doi:10.1093/biomet/asab016. [arxiv]
  5. S. Yang (2021). Semiparametric efficient estimation of structural nested mean models with irregularly spaced observations. Biometrics, [arxiv]
  6. B. J. Reich, S. Yang, Y. Guan, A. B. Giffin, M. J. Miller and A. G. Rappold (2021). A review of spatial causal inference methods for environmental and epidemiological applications. International Statistical Review, 89, 605-634. [arxiv]
  7. M.Y. Huang and S. Yang+ (2021). Robust inference of conditional average treatment effects using dimension reduction. Statistica Sinica, doi:10.5705/ss.202020.0409. [arxiv]
  8. S. Yang, J. K. Kim, and Youngdeok Hwang (2021). Integration of data from probability surveys and big found data for finite population inference using mass imputation. Survey Methodology, 47, 29–58.
  9. F. Cools, D. Johnson, A. J. Camm, J. P. Bassand, F. Verheugt, S. Yang, A. Tsiatis, D. A. Fitzmaurice, S. Z. Goldhaber, G. Kayani, S. Goto, S. Haas, F. Misselwitz, A. Turpie, K. Fox, K. Pieper, A. K. Kakkar (2021). Risks associated with discontinuation of oral anticoagulation in newly diagnosed patients with atrial fibrillation: results from the GARFIELD-AR Registry. Journal of Thrombosis and Hemostasis, doi:10.1111/jth.15415. (Collaboration work)
  10. S. Yang, Y. Zhang, G. Liu, and Q. Guan (2021). SMIM: a unified framework of Survival sensitivity analysis using Multiple Imputation and Martingale. Biometrics, 10.1111/biom.13555. [arxiv]
  11. S. Yang, J. K. Kim, and R. Song (2020). Doubly robust inference when combining probability and non-probability samples with high-dimensional data, Journal of the Royal Statistical Society: Series B, 82, 445–465.
  12. S. Yang, K. Pieper, and F. Cools (2020). Semiparametric estimation of structural failure time model in continuous-time processes, Biometrika, 107, 123-136.
  13. N. Corder* and S. Yang (2020). Estimating average treatment effects utilizing fractional imputation when confounders are subject to missingness, Journal of Causal Inference, 8, 249-271.
  14. L. Dong*, E. Laber, Y. Goldberg, R. Song, S. Yang (2020). Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness, Statistics in Medicine, doi: 10.1002/sim.8678.
  15. S. Yang and P. Ding (2020). Combining multiple observational data sources to estimate causal effects, Journal of American Statistical Association, 115, 1540–1554.
  16. S. Yang and J. K. Kim (2020). Statistical data integration in survey sampling: a review, Japanese Journal of Statistics and Data Science, 3, 625–650.
  17. W. Li*, S. Yang,+ and P. Han (2020). Robust estimation for moment condition models with data missing not at random, Journal of Statistical Planning and Inference, 207, 246–254.
  18. S. Yang and J. K. Kim (2020). Asymptotic theory and inference of predictive mean matching imputation using a superpopulation model framework. Scandinavian Journal of Statistics, 47, 839–861.
  19. S. Chen, S. Yang, and J.K. Kim (2020). Nonparametric mass imputation for data integration. Journal of Survey Statistics and Methodology,
  20. S. Yang (2019). Book reviews: Flexible imputation of missing data, 2nd ed. Journal of American Statistical Association, 114, 1421–1421.
  21. S. Yang, L. Wang, and P. Ding (2019). Causal inference with confounders missing not at random, Biometrika, 106, 875–888.
  22. S. Yang and D. Zeng (2018). Discussion on penalized spline of propensity methods for treatment comparison by Zhou, Elliott and Little, Journal of American Statistical Association, 114, 30–32.
  23. S. Yang  and J. J. Lok (2018). Sensitivity analysis for unmeasured confounding in coarse structural nested mean models, Statistica Sinica, 28, 1703–1723.
  24. S. Yang (2018). Propensity score weighting for causal inference with clustered data, Journal of Causal Inference,
  25. S. Yang and J. K. Kim (2018). Nearest neighbor imputation for general parameter estimation in survey sampling, Advances in Econometrics, 39, 211–236.
  26. S. Yang and P. Ding (2018). Asymptotic inference of causal effects with observational studies trimmed by the estimated propensity scores, Biometrika, 105, 487–493.
  27. Z. Wang, J. K. Kim, and S. Yang (2018). An approximate Bayesian inference under informative sampling, Biometrika, 105, 91–102.
  28. J. Lok, S. Yang, B.Sharkey, Hughes, M (2018). Estimation of the cumulative incidence function under multiple dependent and independent censoring mechanisms, Lifetime Data Analysis, 24, 201–223.
  29. S. Yang, A. A. Tsiatis, and M. Blazing (2018). Modeling survival distribution as a function of time to treatment discontinuation: a dynamic treatment regime approach, Biometrics, 74, 900–909.
  30. S. Yang and J. K. Kim (2017). A semiparametric inference to regression analysis with missing covariates in survey data, Statistica Sinica, 27, 261–285.
  31. J. K. Kim and S. Yang (2017). A note on multiple imputation under complex sampling, Biometrika, 104, 221–228.
  32. S. Yang  and J. K. Kim (2017). Discussion: dissecting multiple imputation from a multi-phase inference perspective: what happens when god’s, imputer’s and analyst’s models are uncongenial? by X. Xie and X. L. Meng, Statistica Sinica, 27, 1568–1573.
  33. S. Yang, and J. J. Lok (2016). A goodness-of-fit test for structural nested mean models, Biometrika, 103, 734–741.
  34. S. Yang, and J. K. Kim (2016). Fractional imputation in survey sampling: a comparative review, Statistical Science, 31, 415–432.
  35. S. Yang, G. Imbens, Z. Cui, D. Faries and Z. Kadziola (2016), Propensity score matching and stratification in observational studies with multi-level treatments, Biometrics, 72, 1055–1065. With R package available “multilevelMatching“.
  36. S. Yang and J. K. Kim (2016). A note on multiple imputation for method of moments estimation, Biometrika103, 244–251.
  37. S. Yang and J. K. Kim (2015). Likelihood-based inference with missing data under missing-at-random, Scandinavian Journal of Statistics, 43, 436–454.

    ** Winner of the 2014 JSM Student Paper Competition Award

  38. L. Peyer, G. Welk, L. B. Davis, S. Yang, and J. K. Kim (2015). Factors associated with parent concern for child weight and parenting behaviors, Childhood Obesity, 11, 269–274. (Collaboration work)
  39. S. Yang and Z. Zhu (2015). Variance estimation and kriging prediction for a class of non-stationary spatial models, Statistica Sinica,25, 135–149.
  40. J. K. Kim and S. Yang (2014). Fractional hot deck imputation for robust estimation under item nonresponse in survey sampling, Survey Methodology40, 211–230.
  41. J. K. Kim, Z. Zhu, and S. Yang (2013). Improved estimation for June Area Survey incorporating several information, Proceedings 59th ISI World Statistics Congress, Hong Kong, China, 199–204.
  42. S. Yang, J. K. Kim and D. W. Shin (2013). Imputation methods for quantile estimation under missing at random, Statistics and Its Interface6, 369–377.
  43. S. Yang, J. K. Kim and Z. Zhu (2013). Parametric fractional imputation for mixed models with nonignorable missing data, Statistics and Its Interface6, 339–347.


Technical Reports 


  1. D. Johnson, K. Pieper, and S. Yang+. Treatment-specific Marginal Structural Cox Model for the Effect of Treatment Discontinuation. [arxiv]
  2. J.Y. Wang, R Wong, S. Yang, and G. Chan. Estimation of Partially Conditional Average Treatment Effect by Hybrid Kernel-covariate Balancing. [arxiv]
  3. S. Yang, D. Zeng, X. Wang. Improved Inference for Heterogeneous Treatment Effects Using Real-World Data Subject to Hidden Confounding. [arxiv]
  4. S. Yang, X. Wang, and D. Zeng. Elastic integrative analysis of randomized trial and real-world data for treatment heterogeneity estimation. [arxiv]
  5. B. Colnet, I. Mayer, G. Chen, A. Dieng, R. Li, G. Varoquaux, J.P. Vert, J. Josse+, S. Yang+. Causal inference methods for combining randomized trials and observational studies: a review. [arxiv]
  6. Y. Guan, G. L. Page, B. J. Reich, M. Ventrucci and Shu Yang. A spectral adjustment for spatial confounding. [arxiv]
  7. A. B. Giffin*, B. J. Reich , S. Yang+, and A. Rappold. Generalized propensity score approach to causal inference with spatial interference. [arxiv]

    ** Winner of the 2021 ENAR Distinguished Student Paper Competition Award

  8. A. B. Giffin*, B. J. Reich, S. Yang, and A. Rappold. Instrumental variables, spatial confounding and interference. [arxiv]
  9. A. B. Giffin*, W. Gong, S. Majumder, A. Rappold, B. J. Reich, and S. Yang. Estimating intervention effects on infectious disease control: the effect of community mobility reduction on Coronavirus spread. [arxiv]
  10. A. Larsen*, S. Yang, A. Rappold, and B. Reich. A spatial causal analysis of wildland fire-contributed PM2.5 using numerical model output. [arxiv]
  11. L. Wu, S. Yang, B. J. Reich, and A. Rappold. Estimating spatially varying health effects in app-based citizen science research. [arxiv]

    ** Winner of the 2021 ASA Section on Statistics in Epidemiology Young Investigator Award

  12. Q. Guan* and S. Yang+. A unified framework for causal inference with multiple imputation using martingale. [arxiv]
  13. Z. Jiang, S. Yang, and P. Ding. Multiply robust estimation of causal effects under principal ignorability. [arxiv]
  14. S. Yang and Y. Zhang. Multiply robust matching estimators of average and quantile treatment effects. [arxiv]
  15. S. Tang*, S. Yang+, T. Wang, Z. Cui. L. Li, D. Faries. Causal inference of hazard ratio based on propensity score matching. [arxiv]

    ** Winner of the 2021 ENAR Distinguished Student Paper Competition Award

  16. X. Mao, Z. Wang, and S. Yang. Matrix completion for survey data prediction with multivariate missingness. [arxiv]
  17. S. Yang and Z. Zhu. Semiparametric estimation of spectral density and variogram with irregular observations, Journal of Statistical Planning and Inference, revision. [arxiv]




  • S. Yang (2014). Fractional imputation method of handling missing data and spatial statistics. Iowa State University. [Link]