个人简介
学习经历
1993.9-1997.6,中国科大数学系计算数学及其应用软件专业,获理学学士学位。
1997.9-2000.6,中国科大数学系应用数学专业攻读硕士学位(导师:赵林城教授),获理学硕士学位。
2001.2-2003.12,中国科大统计与金融系概率论与数理统计专业(导师:应志良教授、赵林城教授),获理学博士学位。
工作经历
2000.7-2011,中国科大统计与金融系,助教/讲师。
2011.2-2018.5,复旦大学生物统计学研究所,研究员、博导。
2018.6至今,中国科大统计与金融系,教授、博导。
博士后研究
2004.3-2005.4,乔治华盛顿大学统计系(postdoctoral scientist)
2006.8-2007.11,耶鲁大学医学院(postdoctoral associate)
2009.1-2010.12,美国NIH国立癌症研究所(postdoctoral visiting fellow)
在Biometrika、Biometrics、Bioinformatics、Biostatistics、Annals of Applied Statistics、Statistics in Medicine、Statistica Sinica等国内外统计学期刊上发表论文50多篇,获得近600次引用(google scholar检索,其中大部分为他引),单篇引用次数超过50的论文有5篇;参与撰写的教材《遗传学中的统计方法》(科学出版社,2006年)被多所高校采用。
研究领域
健康大数据分析和统计计算。目前研究兴趣为医学领域内大数据(电子病历档案、高通量测序数据)的统计方法学研究及其应用,并开发相应R软件包。
部分研究项目
高通量生物医学数据的统计方法学研究,引进人才启动基金,100万,2018.6.1-2020.5.31,主持。
基于新一代高通量测序数据的若干统计方法学研究,国家自然科学基金委(面上项目),48万,2018.1.1-2021.12.31,主持。
基于新一代测序数据的统计遗传学新理论、方法与应用,973计划,900万, 2012.1.1-2016.8.31,主持。
全基因组DNA甲基化研究中的统计学方法,国家自然科学基金面上项目, 50万,2014.1.1-2017.12.31,课题骨干。
人类基因关联分析的若干问题,国家自然科学基金青年项目,15万,2008.1.1-2010.12.31,主持。
近期论文
查看导师新发文章
(温馨提示:请注意重名现象,建议点开原文通过作者单位确认)
[01] Zhang H, Chatterjee N, Rader D, Chen J (2018). Adjustment of non-confounding covariates in case-control genetic associaiton studies. Annals of Applied Statistics 12(1):200-221.
[02] Zhou B, Wen S, Wang L, Jin L, Li H, Zhang H# (2017). AntCaller: An accurate variant caller incorporating ancient DNA damage. Molecular Genetics and Genomics 292(6):1419-1430.
[03] Wang C, Shen Q, Du L, Xu J, Zhang H# (2016). armDNA: A functional beta model for detecting age-related genomewide DNA methylation marks. Statistical Methods in Medical Research DOI: 10.1177/0962280216683571.
[04] Kang G, Du L, Zhang H# (2016). multiDE: a dimension reduced model based statistical method for di
erential expression analysis using RNA-sequencing data with multiple treatment conditions. BMC Bioinformatics 17: 248.
[05] Shen Q, Hu J, Jiang N, Hu X, Luo Z, Zhang H# (2016). contamDE: Difierential expression analysis of RNA-seq data for contaminated tumor samples. Bioinformatics 32(5): 705-712.
[06] Hu J, Li T, Xiu Z, Zhang H# (2015). MAFsnp: A Multi-sample Accurate and Flexible SNP Caller Using Next-generation Sequencing Data. PLoS ONE 10(8): e0135332. doi:10.1371/journal.pone.0135332.
[07] Zhang H#, Xu J, Jiang N, Hu X, Luo Z (2015). PLNseq: a multivariate Poisson lognormal distribution for high-throughput matched RNA-sequencing read count data. Statistics in Medicine 34: 1577-1589.
[08] Zhang H, Qin J, Landi M, Caporaso N, Yu K (2013). A copula-model based semiparametric interaction test under the case-control design. Statistica Sinica. 23: 1505-1521.
[09] Zhang H, Zeng D, Olschwang S, Yu K (2013). Semiparametric inference on the penetrances of rare genetic mutations based on a case-family design. Journal of Statistical Planning and Inference. 143: 368-377.
[10] Zhang H, Wacholder S, Qin J, Hildesheim A, Yu K (2011). Improved genetic association tests for an ordinal outcome representing the disease progression process. Genetic Epidemiology. 35:499-505.
[11] Zhang H, Ahn J, Yu K (2011). Comparing statistical methods for removing seasonal variation from vitamin D measurements in case-control studies. Statistics and Its Interface. 4:85-93.
[12] Li T, Li Z, Ying Z, Zhang H#. (2010). Influence of population stratification on population based marker-disease association analysis. Annals of Human Genetics. 74:351-360.
[13] Zhang H, Olschwang S, Yu K. (2010). Statistical inference on the penetrances of rare genetic mutations based on a case-family design. Biostatistics. 11:519-532.
[14] Wu C*, Zhang H*, Liu X, DeWan A, Dubrow R, Ying Z, Yang Y, Hoh J (2009). Detecting essential and removable interactions in genome-wide association studies. Statistics and Its Interface. 2:161-170.
[15] Zhang H, Chen H, Li Z. (2009). Large sample interval mapping method for genetic trait loci infinite regression mixture models. Journal of Statistical Planning and Inference. 139:764-779.
[16] Yan T, Yang Y, Cheng X, DeAngelis MM, Hoh J, Zhang H# (2009). Genotypic association analysis using discordant-relative-pairs. Annals of Human Genetics. 73:84-94.
[17] Francis P*, Zhang H*, DeWan A, Hoh J, Klein M (2008). Joint e
ects of polymorphisms in the HTRA1, LOC387715/ARMS2, and CFH genes on AMD in a Caucasian population. Molecular Vision. 14:1395-1400.
[18] Zhang H, Morrison MA, DeWan A, Adams S, Andreoli M, Huynh N, Regan M, Brown A, Miller JW, Kim IK, Hoh J, DeAngelis MM (2008). The NEI/NCBI dbGAP database: Genotypes and haplotypes that may specifically predispose to risk of neovascular age-related macular degeneration. BMC Medical Genetics. 9:51.
[19] Chen K*, Ying Z*, Zhang H*, Zhao L* (2008). Analysis of least absolute deviation. Biometrika. 95(1):107-122.
[20] Zhang H, Chen H, Li Z. (2008). An explicit representation of the limit of the LRT for interval mapping of quantitative trait loci. Statistics and Probability Letters. 78:207-213.
[21] Zhang Han, Zhang Hong#, Li Zhaohai, Zheng Gang (2007). Statistical methods for haplotype-based matched case-control association studies. Genetic Epidemiology. 31(4):316-326.
[22] Zhang H, Zheng G, Li Z (2006). Statistical analysis for haplotype-based matched case-control studies. Biometrics, 62:1124-1131.