What best practices would you recommend for data cleaning?
Data preprocessing/ Data cleaning/ Data wrangling is a ritual that every data scientist has to perform before the data is used for any machine learning model. some simplified steps for preprocessing our data are:
- Finding and handling missing values
- Data Formatting
- Data Normalization
- Data Binning / Converting Numerical data to Categorical Data
- Converting Categorical data to Numerical data