Introduction
This dataset is downloaded from the UCI machine learning repository (https://archive.ics.uci.edu/ml/datasets/Wine+Quality). The dataset is related to white variants of the Portuguese “Vinho Verde” wine.
In this problem, I’m going to classify different wine into different quality groups with different characteristics of wine (eg. acidity, sweetness, density, pH, etc.). Specifically, I’m going to group the quality score into two groups, with the quality score greater or equal to 6 being classified as 1: good quality and the rest quality score below 6 as 0: poor quality and will use Logistic Regression and Naive Bayes for the classification.
HAVE A GREAT OPPORTUNITY FOR ME? |
Feel free to drop me a line