Channel: Question and Answer » svm

↧

e1071: CVE returns linear separation, Hold-out returns large error

April 15, 2016, 8:00 pm

≫ Next: What is the difference between Machine Learning and Deep Learning?

≪ Previous: Multiclass classification one versus one with ensemble

I’m using the Adult dataset that can be found here: http://archive.ics.uci.edu/ml/datasets/Adult

After taking a sample of the dataset, I use the svm function of e1071 to obtain the accuracy with a linear kernel.

adult.df = read.csv("sample_adult.csv")
adult.df$X = NULL
Income = adult.df$Income...50k
summary(svm(formula=factor(Income)~., data=adult.df, type="C-classification", cost=1, kernel="linear", cross = 10))

This returns:

Number of Classes:  2 

Levels: 
  <=50K  >50K

10-fold cross-validation on training data:

Total Accuracy: 100 
Single Accuracies:
 100 100 100 100 100 100 100 100 100 100

However, I’ve implemented a holdout method of testing the accuracy (this is rather tailored to the dataset):

holdout <- function(data, params) {
  # randomize the dataset
  data <- data[sample(1:nrow(data)), ]
  # we use an 50/50 split. train on 50% of the data, test on the other 50%
  training.set = data[1:(nrow(data)/2),]
  t.income = training.set$Income
      testing.set = data[(nrow(data)/2 + 1):nrow(data),]
      # train a model on the training set
      model = NULL
      if(is.null(params$degree)){
    model = svm(formula=t.income~., data=training.set,
              type=params$type, cost=params$cost, 
              kernel=params$kernel, cross=10)
      } else {
        model = svm(formula=t.income~., data=training.set,
                    type=params$type, cost=params$cost, 
                    kernel=params$kernel, degree = params$degree, cross=10)
  }
  print(summary(model))
  # test each point in the testing set
  wrong = 0
  for(i in 1:nrow(testing.set)){ 
    prediction = predict(model, testing.set[i,])
    if(prediction != training.set[i,length(training.set)]) {
      wrong = wrong + 1
    }
  }
  return(wrong/nrow(testing.set))
}

If I run the holdout on the same SVM:

>holdout(adult.df,list(type="C-classification", cost=1, kernel="linear"))
...
10-fold cross-validation on training data:

Total Accuracy: 100 
Single Accuracies:
 100 100 100 100 100 100 100 100 100 100 



[1] 0.39

As you can see… The holdout and CVE values are entirely different. I think my holdout code is correct, and my implementation of the svm function is the problem. Please, any help would be appreciated.

Thank you!

↧

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

Trending Articles

Download: Bicko Bicko ft Rich Bizzy & Crew G- Wanfulanganya (Prod by: Bicko...

June 6, 2017, 12:40 pm

The Acorn Social Club Slaying: Providence Mob Figure Dickie Callei Killed By...

February 23, 2017, 10:59 am

236 kg banned scented tobacco worth Rs 1.26 lakh seized in Wadi

June 22, 2021, 5:54 am

Creating Succesfactors Logical Port via WSDL

March 13, 2015, 7:12 am

Love (2015).H264.Italian.English.Ac3.5.1.multisub.iCV-MIRCrew Seed (62)/Leech...

September 14, 2017, 10:49 am

It’s Kind of a Funny Story 2010 Dual Audio 720p BRRip [Hindi – English] ESubs

June 8, 2016, 6:15 am

CalCen

June 4, 2020, 6:35 pm

The 10 Tennessee Cities With The Largest Black Population For 2021

December 21, 2020, 10:12 am

More trouble at Northumberland County prison

October 17, 2012, 9:00 pm

MCGEE, SAMUEL O., DECEASED, OF...

March 19, 2025, 2:00 pm

Remote Desktop Services has taken too long to load the user configuration...

October 18, 2013, 10:41 pm

Neem Baba Extra Questions Answer Class 6 English Poorvi

February 1, 2025, 5:19 am

Bureau of Internal Revenue: Regional Offices (Directory)

January 9, 2014, 11:06 pm

Waves Complete v2019.02.14 Incl Emulator-R2R

February 16, 2019, 7:50 am

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

March 7, 2020, 11:19 pm

[RELEASE THREAD]--_A-Team_--Cricket_Dream_5G

September 25, 2022, 7:14 pm

Ted's Montana Grill Milkshake Cocktail Recipes

June 20, 2016, 7:34 am

Students hit streets to save Agriculture College land in city

October 13, 2018, 2:20 am

95 Woodford, Brewery Road, Stillorgan, Co Dublin - €695,000

August 27, 2014, 1:10 pm

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

© 2025 //www.rssing.com