This project predicts sales revenue based on advertising budgets using machine learning. The dataset includes advertising expenditures on TV, Radio, and Newspaper, and the goal is to build a model to predict sales based on these investments.
The dataset consists of 200 entries with the following columns:
TV
β Advertising budget for TV (in $1000s)Radio
β Advertising budget for Radio (in $1000s)Newspaper
β Advertising budget for Newspaper (in $1000s)Sales
β Sales revenue generated (in $1000s) (Target variable)
- Python π
- Pandas & NumPy (Data Processing)
- Matplotlib & Seaborn (Data Visualization)
- Scikit-learn (Machine Learning β Linear Regression)
β
Data Cleaning and Preprocessing
β
Exploratory Data Analysis (EDA)
β
Sales Prediction using Linear Regression
β
Model Evaluation Metrics
- Load and explore the dataset.
- Perform Exploratory Data Analysis (EDA) to visualize trends in advertising and sales.
- Train a Linear Regression model to predict sales.
- Evaluate the model's performance using:
- RΒ² Score
- Mean Squared Error (MSE)
The notebook includes:
β
Pairplots for feature relationships
β
Correlation Heatmap to find important variables
β
Regression Plot to visualize predictions
Contributions are welcome! π
If youβd like to contribute, please:
- Fork the repository
- Create a new branch (
feature-branch
) - Submit a pull request
This project is licensed under the MIT License.