Hello! I am Martin, a postdoc at Harvard School of Public Health working with Prof. Alkes Price on statistical genetics. Currently, I am interested in analyzing the UK biobank whole exome sequencing (WES) data, and potential methods for combining GWAS with single-cell RNA-seq. Before that, I did my PhD at Stanford with Prof. David Tse and Prof. James Zou on statistics, machine learning, and computational biology. Some topics I worked on during my PhD include empirical Bayes, multiple hypothesis testing, multi-armed bandits, and single-cell RNA-seq.
I publish under Martin Jinye Zhang. I also go under Jinye Zhang (张金野).
Last updated: 04/14/2021
4/2021 Our paper on identifying aging signatures using the Tabula Muris Senis data was accepted by eLife.
2019 - Present, T.H. Chan School of Public Health, Harvard University,
2014 - 2019, Department of Electrical Engineering, Stanford University,
Doctor of Philosophy (PhD)
2014 - 2017, Department of Electrical Engineering, Stanford University,
Master of Science (MS)
2010 - 2014, Department of Electronic Engineering, Tsinghua University,
Bachelor of Engineering (B.Eng.)
Manuscripts under review
Bandit-PAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits.
Mo Tiwari, Martin Jinye Zhang, James Mayclin, Sebastian Thrun, Chris Piech, Ilan Shomorony.
Deep longitudinal multiomics profiling reveals two biological seasonal patterns in California.
M Reza Sailani, Ahmed A Metwally, Wenyu Zhou, Sophia Miryam Schüssler-Fiorenza Rose, Sara Ahadi, Kevin Contrepois, Tejaswini Mishra, Martin Jinye Zhang, Łukasz Kidziński, Theodore J Chu, Michael P Snyder.
Nature Communications (2020).
A single-cell transcriptomic atlas characterizes ageing tissues in the mouse.
The Tabula Muris Consortium.
Nature (2020). Contributed to differential expression analysis (Fig. 2f-h) and cluster diversity score (Fig. 4c-f).
Polymicrobial periodontal disease triggers a wide radius of effect and unique virome.
Li Gao, Misun Kang, Martin Jinye Zhang, M. Reza Sailani, Ryutaro Kuraji, April Martinez, Changchang Ye, Pachiyappan Kamarajan, Charles Le, Ling Zhan, Hélène Rangé, Sunita P. Ho, Yvonne L. Kapila.
npj Biofilms and Microbiomes (2020).
Determining sequencing depth in a single-cell RNA-seq experiment.
Martin J. Zhang*, Vasilis Ntranos*, David Tse.
Nature Communications (2020). Selected as 2020 Top 50 Life and Biological Sciences Articles
(Preliminary version: "One read per cell per gene is optimal for single-cell RNA-seq". [pdf])
Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits.
Martin J. Zhang, James Zou, David Tse.
Fast and covariate-adaptive method amplifies detection power in large-scale multiple hypothesis testing. [software] [code to reproduce paper]
Martin J. Zhang, Fei Xia, James Zou.
Nature Communications (2019). Preliminary version accepted as the Cell Systems best paper in RECOMB 2019 and received the RECOMB Best Paper Award
(Preliminary version: "AdaFDR: a Fast, Powerful and Covariate-Adaptive Approach to Multiple Hypothesis Testing". [pdf])
Longitudinal multi-omics of host–microbe dynamics in prediabetes.
Wenyu Zhou*, M. Reza Sailani*, Kévin Contrepois*, Yanjiao Zhou*, Sara Ahadi*, Shana Leopold, Martin J. Zhang, ..., George M. Weinstock, Michael Snyder.
Nature (2019). Contributed 3 panels in 2 figures.
Exploring Patterns Unique to a Dataset with Contrastive Principal Component Analysis.
Abubakar Abid*, Martin J. Zhang*, Vivek K. Bagaria, James Zou.
Nature Communications (2018).
Medoids in Almost Linear Time via Multi-armed Bandits.
Vivek Bagaria*, Govinda Kamath*, Vasilis Ntranos*, Martin J. Zhang*, David Tse.
NeuralFDR: learning decision threshold from hypothesis features.
Fei Xia*, Martin J. Zhang*, James Zou, David Tse.
Block-wise MAP Inference for the Determinantal Point Processes with Application to Change Point Detection.
Martin J. Zhang, Zhijian Ou.
On the Theoretical Analysis of Cross Validation in Compressive Sensing.
Jinye Zhang, Laming Chen, Petros T. Boufounos, and Yuantao Gu.
Minimax Optimality of Sign Test for Paired Heterogeneous Data.
Martin J. Zhang, Meisam Razaviyayn, and David Tse.
adafdr: covariate-adaptive multiple testing.
sceb: sequencing-depth aware estimators for single-cell RNA-seq analysis via empirical Bayes.
Meddit: an almost linear algorithm for computing the medoid for a set of n points via adaptive sampling.
contrastive: a python library for performing unsupervised machine learning on datasets with learning (e.g. PCA) in contrastive settings, where one is interested in patterns (e.g. clusters or clines) that exist one dataset, but not the other.