| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • Social distancing? Try a better way to work remotely on your online files. Dokkio, a new product from PBworks, can help your team find, organize, and collaborate on your Drive, Gmail, Dropbox, Box, and Slack files. Sign up for free.

View
 

DataMiningCamp

Page history last edited by PBworks 9 years, 6 months ago

 

 

Camp Presentation Materials

Camp Presentation

R Code

Description of Talk

 

 

 

 

Home Page For the Beginning Data Mining/Machine Learning Class

 

Install R 

To get started using R you can download it from   http://cran.r-project.org/ 

 

Complete instructions for downloading R and installing it on eclipse are here:  To run R on Eclipse

 

 

Reading Material 

To understand the mathematics behind linear regression check out these slides:  MultivariateLinearRegression.pdf

 

To understand why you need to normalize the data before the regression: RidgeBiasDiscussion.txt

 

 PredictionErrorvsModelComplexity This chart is from the Elements of Statistical Learning text book

 

Ridge Regression Section 3.4.1 on page 61 of the :

"The Elements of Statistical Learning - Data Mining, Inference, and Prediction"  by Trevor Hastie, Robert Tibshirani, and Jerome Friedman.      http://www-stat.stanford.edu/~tibs/ElemStatLearn

 

 

Regression(stats315a).pdf

PracticalRegression_R.pdf

 

 

 

Examples to Run 

Here are some examples you might find interesting:

SimpleRegressionExample.R      problem statement  this is a ridiculously simple example

sonar_regress_crossval.R   example using cross-validation

RidgeRegSonarData.R        example of Ridge Regression

 

 

Data to Download

Data Required for these programs

sonar_test.csv

sonar_train.csv

Data for Camp Example

winequality-red.csv

 

Comments (0)

You don't have permission to comment on this page.