Office Hours Spring 2016Tuesday/Thursday 09:00AM - 11:00AM
Courses in Spring 2016
MAT220: Statistics for Biological Sciences
Biostatistics: A Foundation for Analysis in the Health Sciences, 10th Edition, by Wayne W. Daniel, Chad L. Cross, Wiley. 2013
MAT488/STA588: Statistical Data Mining
1.An Introduction to Statistical Learning: with Applications in R, by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani, Springer, 2013. (USM library has free e-copy of this book. You can also buy a soft cover copy thorough USM library webpage at $24.99 )
2.Data Mining and Business Analytics with R, by Johannes Ledolter. Wiley, 2013.
Upcoming New Courses
MAT 486: Introduction to Big Data Analytics
STA 586: Predictive Modeling with Big Data
MAT 105D: Mathematics for Quantitative Decision Making
MAT 108: College Algebra
MAT 120D: Introduction to Statistics
MAT 140: Pre-calculus
MAT 152D: Calculus A
MAT 220: Statistics for Biological Sciences
MAT 252: Calculus C
MAT 264: Statistical Computer Packages
MAT 281: Probability
MAT 282: Statistical Inference
MAT 295: Linear Algebra
MAT 364: Numerical Analysis
MAT 380: Probability and Statistics
MAT 386: Sampling Techniques
MAT 387: Introduction to Bio/Applied Statistics
MAT 485: Introduction to Applied Regression
MAT 487: Introduction to Categorical Data Analysis
STA 574: Statistical Computing
STA 579: Probability Models
STA 580: Statistical Inference
STA 580N: Bio/Applied Statistical Methods
STA 582 Introduction to Longitudinal Data Analysis
STA 583: Sampling Methods
STA 585: Regression Analysis
STA 585N: Regression and Forecasting
STA 587: Categorical Data Analysis
STA 588: Introduction to Biostatistics
STA 589: Survival Analysis
My primary research focuses on the applied statistical methodology and data analysis. I am particularly interested in statistical process control, reliability and survival modeling, statistical methods for environmental study, data visualization and statistical data mining. I am also interested in working with researchers in other disciplines on the applications of advanced statistical modeling.
. Xu, J and Peng, C. (2014). Fitting and Testing Two-sample Marshall-Olkin Extended Weibull Model with Randomly Censored Data, Journal of Applied Statistics, Vol. 41 Issue 12, pp 2577-2595. DOI:10.1080/02664763.2014.922166 (published online first, May 2014)
. Gupta, R. C. and Peng, C. (2013). Proportional Odds Frailty Model and Stochastic Comparisons. Annals of the Institute of Statistical Mathematics. Vol.66, pp 897–912. DOI 10.1007/s10463-013-0432-y (published online first in October, 2013)
. Peng, C. and Xu, J. (2012). Nonparametric Confidence Limits of Quantile-based Process Capability Indices. International journal of Quality, Statistics, and Reliability. Volume 2012 (2012), Article ID 985152
. Fisher, M. C. , Hochberg, M. C., El-Taha, M., Kremer, J. M., Peng, C., Greenberg, J. D. (2012). Smoking, Smoking Cessation, and Disease Activity in a Large Cohort of Patients with Rheumatoid Arthritis. Journal of Rheumatology , 39(5):904-909. Epub 2012 Mar 15
. Anandarajah, A. P., El-Taha, M, Peng, C, Reed, G, Greenberg, J and Ritchlin, C. T. (2011). Association between focal erosions and generalised bone loss in psoriatic arthritis. Annals of the Rheumatic Disease, 70(7):1345-7. Epub 2011 Feb 16
. Guan, Z. and Peng, C. (2011). Two-sample Semi-parametric (Reverse) Proportional Hazard Model. In JSM Proceedings, Section on Nonparametric Statistics. Alexandria, VA: American Statistical Association. 2067-2080.
. Guan, Z. and Peng, C. (2011). A rank-based empirical likelihood approach to two-sample proportional odds model and its goodness-of-fit. Journal of Nonparametric Statistics. Vol. 23(3), 763-780. DOI: 10.1080/10485252.2011. 559726
. Gupta, R. C., Lvin, S. and Peng, C. (2010). Estimating Turning Points of Failure Rate of the Extended Weibull Distribution. Computational Statistics and Data Analysis. Vol. 54(4), 924-934.
. Peng, C. (2010). Interval estimation of population parameters based on environmental data with detection limits, Environmetrics. Vol. 21(6), 645-658.
. Peng, C. (2010). Parametric Lower Confidence Limits of Quantile-Based Process Capability Indices. Quality Technology & Quantitative Management, Vol. 7(3), 199-214.
. Peng, C. (2010). Estimating and Testing Quantile-based Process Capability Indices for One-specification Interval and Skewed Distributions. Journal of Data Science, Vol. 8(2). 253 – 268.
. Gupta, R. C. and Peng, C. (2009). Estimating reliability in Proportional Odds Ratio Models. Computational Statistics and Data Analysis. Vol 53(4), 1495-1510.
I am interested in projects from almost all areas that involve significant statistical components. I usually work with collaborators / clients closely throughout the entire phase of consultation: from the initial stage of sampling and study design to modeling, outcome interpretation and the writing of formal and rigorous statistical report including both technical justifications and practical interpretations.
A partial list of models involved in numerous projects completed in last few years include linear regression models, binary logistic regression model, family of multi-category logit models, Poisson and loglinear regression models, lifetime regression models (non-parametric Kaplan-Meier, semi-parametric Cox proportional hazard and various parametric models including accelerated failure time and competing risk models) based on both censored and completely observed data, time series modeling (exponential smoothing family of models, ARIMA and ARIMAX) and various random-effect regression models (including repeated measure ANOVA) for longitudinal data, models with high dimensional data (principal component analysis, factor analysis and structural equation modeling), linear and nonlinear discriminate analysis, cluster analysis, etc.
Programming / Computing