Tuesday, August 19, 2014
Using a GBM for Classification in R from Wallace Campbell on Vimeo.
Saturday, December 21, 2013
It's been a while since my last video, with good reason: I started a full time job as a senior statistician about 2 months ago! Not that I spend my entire day coding in R, but I wouldn't be nearly as useful if I didn't know how to use it.
Anyway, while in grad school, I helped my friends with data analysis for thesis and dissertation projects. In return, they brought me cookies and beer. This video explains some of the most common tasks that are necessary in the analysis of experimental data.
R code and data available on GitHub .
Wednesday, September 25, 2013
In this video, I demonstrate how to use k-fold cross validation to obtain a reliable estimate of a model's out of sample predictive accuracy as well as compare two different types of models (a Random Forest and a GBM). I use data Kaggle's Amazon competition as an example.
Tuesday, August 27, 2013
If you're not programming in parallel, you're only using a fraction of your computer's power! I demonstrate how to run "for" loops in parallel using the mclapply function from the multicore library. The code can be scaled to any number of available cores.
Monday, August 26, 2013
I describe how to estimate the Weibull accelerated failure time model and the Cox proportional hazards model, test the assumptions, make predictions, and plot survival functions using each model.