- By Mayank, Pritika and Shivam | 04/21 - 07/21 | Major Project
Dataset Used:- https://www.kaggle.com/fedesoriano/stroke-prediction-dataset
-
- Evaluated and analyzed the dataset using seaborn, plotly and matplotlib libraries.
-
- Compared various ML classifiers based on accuracy on our dataset.
-
- Performed k-fold validation and Hyper Parameter Optimization.
-
- Prepared the backend using Flask and linked with front end made using HTML & CSS.
-
- Hosted the project on heroku.
- READING DATA AND EXPLORING DATASET
- BASIC DATA ANALYSIS
- PANDAS PROFILING
- CORRELATION
- ANOMALY DETECTION
- MISSING VALUES
- ENCODING
- TRAIN TEST SPLIT
- TRAINING THE MODEL
- SCORES OF MODELS
- EVALUATION OF MODEL
- K-FOLD VALIDATION
- HYPER PARAMETER OPTIMIZATION
- PREPARATION OF USER GUI
- INTEGRATION WITH GUI
- HOSTING
-
- Gaussian Naive Bayes
-
- Bernoulli naïve Bayes
-
- Logistic Regression
-
- Support Vector Machines
-
- Random Forest
-
- Decision Tree classifier
-
- K-Nearest Neighbor
-
- Gradient Boosting
-
- Stochastic Gradient Descent
-
- Neural nets
- click==8.0.0
- Flask==2.0.0
- gunicorn==20.1.0
- itsdangerous==2.0.0
- Jinja2==3.0.0
- joblib==1.0.1
- Markdown==3.3.4
- matplotlib==3.4.2
- matplotlib-inline==0.1.2
- numpy==1.19.5
- pandas==1.2.4
- pandas-profiling==3.0.0
- plotly==4.14.3
- plotly-geo==1.0.0
- preprocessing==0.1.13
- requests==2.25.1
- requests-oauthlib==1.3.0
- scikit-learn==0.24.2
- seaborn==0.11.1
- Werkzeug==2.0.0
- wordcloud==1.8.1
- xgboost==1.4.2
-
- Check the requirements
-
- Clone the repository
-
- Open the project directory
-
- Open and run app.py file
-
- A web page will open in the browser, in which you can fill the details and submit
-
- The result of prediction is shown on the next page based on input.
- Open "strokemsp.herokuapp.com"
- Fill in the details asked in the form
- Submit the details, and wait for a while.
- The model will show whether you are prone to stroke or not.