Prediction of Breast Cancer using Logistic Regression/Decision Trees/Boosted Decision Trees
For any person suffering from Breast Cancer , the diagnosis whether the cancer is benign or malignant is imperative.We are going to predict whether the tumor is malignant or benign using Machine Learning Models
We have developed Logistic Regression/ Decision Tree/ Boosted Tree Classifier Models to classify the diagnosis of Breast Cancer cases into malignant and benign.
Biopsy: For this test, the doctor removes tissue or fluid from your breast. They look at it under a microscope to check for if cancer cells and, if they’re there, learn which type they are. Common procedures include:
Data collection
We are using the following Kaggle Dataset https://www.kaggle.com/uciml/breast-cancer-wisconsin-data.Contains Fine-needle aspiration results.It Contains 32 columns and 569 rows.
About the Dataset
Attribute Information:
1) ID number
2) Diagnosis (+1 = malignant, -1 = benign)
3) Ten real-valued features are computed for each cell nucleus:
List of important features :
Accuracy on Training Data : 0.98
Accuracy on Test Data : 1.00
Logistic Regression with L2 regularization
Accuracy on Training Data : 0.97
Accuracy on Test Data : 0.95
Comparing Boosted Tree Models with different number of iterations
Accuracy on Training Data : 0.94
Accuracy on Test Data : 0.96
Rajarshi Maity
rajarshimaity3235@gmail.com