DAT203.2x: Lab 5 - Diabetes Classification (R)

November 14, 2017
Starting point for DAT203.2x Lab 5
• Prepare the dataset for analysis • Investigate relationships in the data set with visualization using custom R or Python code. • Create a two-class logistic classification model. • Evaluate the performance to the classification model. In this lab, I have created a classification model using the Two-Class Logistic Regression algorithm. An initial evaluation of the model seems to indicate that it provides better results than random guessing, but more evaluation is needed, and it may be that the model could be improved. You will explore techniques for improving models later in this course.