PCOS Detection Machine Learning Project

Project Overview

This project aims to build a machine learning model to predict the likelihood of Polycystic Ovary Syndrome (PCOS) using various health indicators. The dataset is in CSV format, and the solution involves data preprocessing, feature scaling, and training several machine learning models.

Dataset

The dataset is provided in CSV format. It contains multiple features related to health indicators that may influence PCOS diagnosis.

Target Variable:

PCOS Diagnosis (binary classification: 1 indicates PCOS, 0 indicates no PCOS)

Sample Features:

Age
BMI
Insulin levels
Hormonal indicators
Menstrual cycle information
PCOS status

Features

Data loading and exploratory analysis
Data preprocessing (handling categorical and missing data)
Feature scaling using StandardScaler
Model training and hyperparameter tuning using GridSearchCV
Performance evaluation metrics (accuracy, precision, recall, F1-score)

Technologies Used

Programming Language: Python
Libraries:
- Pandas
- Scikit-Learn
- NumPy
- Matplotlib (for visualizations)

Installation

Clone the repository:

git clone https://github.com/yourusername/pcos-detection.git
cd pcos-detection

Install required dependencies:
```
pip install -r requirements.txt
```
Place your dataset CSV file in the project directory.

Usage

Open the notebook FINAL_PCOS.ipynb.
Run the cells sequentially to preprocess the data, train the model, and evaluate its performance.
Modify hyperparameters or model types if needed.

Results

Initial results indicate that the model achieved the following performance metrics:

Accuracy: 91%
F1-Score: 91%

Further optimization is suggested to improve model performance.

Optimizations

Data Preprocessing:
- Consider target encoding for high-cardinality categorical variables.
Hyperparameter Tuning:
- Use RandomizedSearchCV for faster tuning when dealing with larger parameter spaces.
Pipeline Creation:
- Integrate the preprocessing and model training steps into a Scikit-Learn pipeline for cleaner and more maintainable code.
Cross-Validation:
- Increase the number of cross-validation folds to improve performance generalization.

Contributing

Contributions are welcome. Please create a pull request or open an issue for discussion.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
FINAL_PCOS.ipynb		FINAL_PCOS.ipynb
LICENSE		LICENSE
NEW DATASET PCOS 23.csv		NEW DATASET PCOS 23.csv
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PCOS Detection Machine Learning Project

Project Overview

Table of Contents

Dataset

Features

Technologies Used

Installation

Usage

Results

Optimizations

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

itsSwapnil/PCOS-Detection-Machine-Learning-Project

Folders and files

Latest commit

History

Repository files navigation

PCOS Detection Machine Learning Project

Project Overview

Table of Contents

Dataset

Features

Technologies Used

Installation

Usage

Results

Optimizations

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages