What best practices would you recommend for data cleaning?

What best practices would you recommend for data cleaning?

1 Like

Data preprocessing/ Data cleaning/ Data wrangling is a ritual that every data scientist has to perform before the data is used for any machine learning model. some simplified steps for preprocessing our data are:

  1. Finding and handling missing values
  2. Data Formatting
  3. Data Normalization
  4. Data Binning / Converting Numerical data to Categorical Data
  5. Converting Categorical data to Numerical data

ref:https://medium.com/analytics-vidhya/keep-your-data-clean-8450e35f4895