STATS 504 - Practice and Communication in Applied Statistics
This page contains my course work from STATS 504 (Winter 2023)
- Programming language: Python, R
- Frameworks / Library: Numpy, Pandas, Seaborn, Matplotlib, Statmodels, SciPy, Lifelines
- Topics: Linear regression, Dimension reduction regression, SIR(Sliced Inverse Regression), Multilevel regression, GLM(Generalized Linear Model), GEE(Generalized Estimating Equations), Survival analysis, MCA(Multiple correspondence analysis), PCA(Principal Components Analysis), Power analysis, Data depth, Tail distribution, Pareto tail plot
Homeworks
1. Comparative analysis of survival function by region
Topic: Kaplan-Meier model, Log-rank test, Cox PH model
2. Association between demographic variables and birth counts by counties
Topic: GLM(Generalized Linear Model), GEE(Generalized Estimating Equations), PCA(Principal Component Analysis), Score test
3. Association between demographic, body measurment variables and blood pressure
Topic: SIR(Sliced Inverse Regression), Score test
4. Association between the passage of time and change in plant habitats
Topic: Multilevel regression, AIC
5. Analyze the functional relationship between temperature, salinity and ocean depth
Topic: Data depth
6. Analysis of changes in tail distribution of X-ray flux
Topic: Tail distribution, Pareto tail plot