Click here to Skip to main content
16,018,904 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Hi all,

One of my independent variable in a data that I want to use for Multinomial logistic regression has more than one text values in some of the cells i.e like high,higher or high,higher,highest .

Can I use this variable as it is or is there any transformation that I need to do before modelling?

Thanks

What I have tried:

I have researched , but have not found an answer for this specific use case.
Posted
Updated 28-Oct-22 19:58pm

1 solution

It's called "binning"; do what you need to do to the data to have it make sense for your particular "model".

e.g. Height is almost meaningless (for most) unless it's in terms of short, medium, tall (i.e. range binning).

https://en.wikipedia.org/wiki/Data_binning
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900