How Do You Design an Email Spam Filter in Machine Learning?

  1. Understand the business model: Try to understand the related attributes for the spam mail
  2. Data acquisitions: Collect the spam mail to read the hidden pattern from them
  3. Data cleaning: Clean the unstructured or semi structured data
  4. Exploratory data analysis: Use statistical concepts to understand the data like spread, outlier, etc.
  5. Use machine learning algorithms to make a model: can use naive bayes or some other algorithms as well
  6. Use unknown dataset to check the accuracy of the model