Click here to Skip to main content
15,440,314 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
I was working on a dataset, and performed Variance Inflation Factor(VIF),in Python. I got the results, which is very high, but when I look online, I see the results over than 10 are considered as multicollinear. In that case, all the features in my dataset are multicollinear. I add example of my results in the following:

Feature VIF

A 1250.76

B 130.65

c 190.45

D 20.76

I was wondering if there is anything wrong with this? and what should I do? any suggestions? Can anyone suggest any codes?

What I have tried:

Here is what I have sofar:
    vif_data = pd.DataFrame()
vif_data["feature"] = X.columns

vif_data["VIF"] = [variance_inflation_factor(X.values, i)
                   for i in range(len(X.columns))]

Updated 12-Mar-21 6:48am

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900