Yan Sun
|
Assistant Professor,
Department of Mathematical Science,
New Jersey Institute of Technology,
Newark, NJ.
E-mail: yan.sun@njit.edu
|
About me
I am an assistant professor in the Department of Mathematical Science at New Jersey Institute of Technology.
Prior to joining NJIT, I was a Postdoctoral Researcher in the Department of Statistics and Data Science at the University of Pennsylvania,
jointly mentored by Prof. Edgar Dobriban,
Prof. Ian Barnett,
and Prof. Pratik Chaudhari,
working on model calibration and uncertainty quantification of machine learning models.
Before that, I was an applied scientist at Amazon.
I obtained my Ph.D. in 2022 from the Department of Statistics at Purdue University, supervised by Prof. Faming Liang.
I received my bachelor’s degree in 2017 from the Department of Mathematics in Chu Kochen Honors College, Zhejiang University.
Research Interests
I am interested in connecting statistical theories with advanced Machine Learning models, and with theoretical guidance, develop effective methods to improve practice.
Some topics that I have worked on include:
Sparse Deep Learning: Consistency, High Dimensional Variable Selection, Network Pruning.
Bayesian Statistics: Bayesian Neural Network, Posterior Concentration, Markov Chain Monte Carlo.
Uncertainty Quantification: Model Calibration, Asymptotic Distribution.
Large Language Models: Watermarking, Sampling
Preprints
Foundations of Top-k Decoding For Language Models
Georgy Noarov, Soham Mallick, Tao Wang, Sunay Joshi, Yan Sun, Yangxinyu Xie, Mengxin Yu, Edgar Dobriban
[PDF]
An Analytical Characterization of Sloppiness in Neural Networks: Insights from Linear Models
Jialin Mao, Itay Griniasty, Yan Sun, Mark K Transtrum, James P Sethna, Pratik Chaudhari
[PDF]
Statistical Inference for Generative Model Comparison
Zijun Gao, Yan Sun
[PDF],
[Code]
Watermarking language models with error correcting codes
Patrick Chao, Yan Sun, Edgar Dobriban, Hamed Hassani
[PDF]
A Confidence Interval for the \( {\ell_2} \) Expected Calibration Error
Yan Sun, Pratik Chaudhari, Ian Barnett, Edgar Dobriban
[PDF],
[Code]
Publications
Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior
Mingxuan Zhang, Yan Sun, Faming Liang
Journal of Data Science (2024).
[PDF]
Extended fiducial inference: toward an automated process of statistical inference
Faming Liang, Sehwan Kim, Yan Sun
Journal of the Royal Statistical Society: Series B (Statistical Methodology) (2024).
[PDF]
Deep network embedding with dimension selection
Tianning Dong, Yan Sun, Faming Liang
Neural Networks (2024).
[PDF]
Sparse Deep Learning for Time Series Data: Theory and Applications
Mingxuan Zhang, Yan Sun, Faming Liang
Advances in neural information processing systems (2023).
[PDF]
Nonlinear Sufficient Dimension Reduction with a Stochastic Neural Network
Siqi Liang, Yan Sun, Faming Liang
Advances in neural information processing systems (2022).
[PDF],
[Code]
A Kernel-Expanded Stochastic Neural Network.
Yan Sun, Faming Liang
Journal of the Royal Statistical Society: Series B (Statistical Methodology) (2022).
[PDF],
[Code]
Sparse Deep Learning: A New Framework Immune to Local Traps and Miscalibration.
Yan Sun, Wenjun Xiong, Faming Liang
Advances in neural information processing systems (2021).
[PDF],
[Code]
Learning Sparse Deep Neural Networks with Spike-and-Slab Priors.
Yan Sun, Qifan Song, Faming Liang
Statistics & Probability Letters (2021).
[PDF],
[Code]
Consistent Sparse Deep Learning: Theory and Computation.
Yan Sun*, Qifan Song*, Faming Liang
Journal of the American Statistical Association (2021).
[PDF],
[Code]
Extended stochastic gradient Markov chain Monte Carlo for large-scale Bayesian variable selection.
Qifan Song, Yan Sun, Mao Ye, Faming Liang
Biometrika, (2020)
[PDF]
Variable selection via penalized neural network: a drop-out-one loss approach.
Mao Ye*, Yan Sun*
International Conference on Machine Learning (2018).
[PDF]
|