DataMiningCamp


 

 

Camp Presentation Materials

Camp Presentation

R Code

Description of Talk

 

 

 

 

Home Page For the Beginning Data Mining/Machine Learning Class

 

Install R 

To get started using R you can download it from   http://cran.r-project.org/ 

 

Complete instructions for downloading R and installing it on eclipse are here:  To run R on Eclipse

 

 

Reading Material 

To understand the mathematics behind linear regression check out these slides:  MultivariateLinearRegression.pdf

 

To understand why you need to normalize the data before the regression: RidgeBiasDiscussion.txt

 

 PredictionErrorvsModelComplexity This chart is from the Elements of Statistical Learning text book

 

Ridge Regression Section 3.4.1 on page 61 of the :

"The Elements of Statistical Learning - Data Mining, Inference, and Prediction"  by Trevor Hastie, Robert Tibshirani, and Jerome Friedman.      http://www-stat.stanford.edu/~tibs/ElemStatLearn

 

 

Regression(stats315a).pdf

PracticalRegression_R.pdf

 

 

 

Examples to Run 

Here are some examples you might find interesting:

SimpleRegressionExample.R      problem statement  this is a ridiculously simple example

sonar_regress_crossval.R   example using cross-validation

RidgeRegSonarData.R        example of Ridge Regression

 

 

Data to Download

Data Required for these programs

sonar_test.csv

sonar_train.csv

Data for Camp Example

winequality-red.csv