Why my score accuracy is 0.0 ? is there something wrong with my dataset or what ?

Question

1.00/5 (1 vote)

See more:

Here the code from begin :

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import numpy as np
import warnings
warnings.filterwarnings('ignore')
import seaborn as sns
sns.set_style("whitegrid")
from sklearn.model_selection import train_test_split #function
from xgboost import XGBRegressor
from sklearn.preprocessing import OneHotEncoder
from sklearn.metrics import accuracy_score
from sklearn.metrics import r2_score #used to evaluate models and find error score accuracies

This is head of dataset :
Name Year Seller_type Transmission km_driven selling_price owner
0 X-Trail 2015 Individu Automatic 128729 191000000 1st owner
1 Terios 2019 Individu Automatic 76361 202000000 1st owner
2 HR-V 2017 Individu Automatic 45992 266000000 1st owner
3 City 2021 Individu Automatic 3544 269000000 1st owner
4 BR-V 2018 Individu Automatic 85512 179000000 1st owner

Tail of dataset :
Name Year Seller_type Transmission km_driven selling_price owner
694 Terios 2021 Individu Automatic 104087 198422198 1st owner
695 Terios 2013 Individu Automatic 172253 283860358 1st owner
696 Xenia 2020 Showroom Automatic 62043 405074059 2nd owner
697 Terios 2021 Individu Automatic 28409 120411290 2nd owner
698 Terios 2014 Individu Automatic 187904 244024577 2nd owner

Separate the independent & dependent
X = car_dataset.drop('selling_price',axis=1)
y = car_dataset['selling_price'] #Store it feature selling price into Y

Split training & test data
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.20, random_state=42)

MODEL TRAINING

xg = XGBRegressor()
xg.fit(X_train, y_train)

y_pred = xg.predict(X_test)
accuracy_score(Y_test, y_pred)
Result : 0.0

R2_score
r2_score(y_test, y_pred)
Result : -0.0441435169799782

What I have tried:

Notes : Before separate the data become independent & dependent, i have convert the text into numerical values.

Please i need ur help guys.

Posted 31-Mar-23 3:40am

Alva Rizky

Add a Solution

Comments

[no name] 31-Mar-23 13:03pm

What is it supposed to tell you? It's not a Magic 8 Ball.

Alva Rizky 31-Mar-23 17:34pm

I tried to predict the price of used car using xgboost

[no name] 1-Apr-23 11:28am

You've identified a "dependent" variable (selling price). I see nothing that defines the independent variables; e.g. mileage, brand, year, etc.

Member 15627495 31-Mar-23 18:32pm

debug is about identifying 'errors'.

by the way , you have a lot of vars to check.

as start :
- you, do you have an idea about the location of the fail in your script ?
- do you verify the content values of the vars before happens this failure ?
- "unitary tests" are codes to achieve debug too
- trace you vars contents and values all along your code.
/- or display the vars contents as much as possible ( easy but gross ), and see where the failure happens
- check all 'T Types' when 'functions and their inputs' are used
- you have twice import numpy
- a test with 0.0 can be a success of maths ? or a 'input Type serial' out of bounds ? ( leading to bug 0.0 as result .. )
- create another case in your dataset

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)