Posts

Showing posts from May, 2021

ordinal and label encoding

 there are normally two types of data  1. ordinal data: those data which have any levels of dependencies i.e: for a degree column:                              bachelor degree is higher than high school; degree and master is greater than bachelor                      in such case we can level master as 3, Bachelor as 2  and high school as 1 and school as 0                              it has rank 2. Nominal data: those data which doesn't have any level of dependencies                             eg: car fuel type I.,e automatic and manual                                     gender:  either male or female...

Feature scaling-Standardization

all the data sets are brought to mean =0 and standard deviation=1;   datasets are brought to orifgin i.e mean=0  it is also called Z-distribution refer:untitled7 in jupyter notebook