Skip to content

YukiP7/end-to-end-regression-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🔥 Algerian Forest Fires — Predicting Fire Weather Index with Machine Learning

📌 Overview

This end-to-end machine learning project focuses on predicting the Fire Weather Index (FWI) using the Algerian Forest Fires dataset. The goal is to help anticipate wildfire risk based on weather and environmental patterns from two regions in Algeria.


📁 Dataset

The dataset contains daily meteorological data from two regions:

  • Bejaia Region
  • Sidi Bel-Abbes Region

Each entry includes features like temperature, wind speed, humidity, rain, and corresponding FWI values.


🔍 Project Highlights

✅ Data Cleaning & Preprocessing

  • Merged region-wise data into a single structured dataset
  • Removed irrelevant columns and handled missing values
  • Standardized column names and converted dates
  • Encoded region and date-based features for modeling

📊 Exploratory Data Analysis (EDA)

  • Correlation heatmaps to identify linear relationships
  • Pair plots and KDE plots for feature distribution analysis
  • Line plots for temperature, wind speed, and humidity trends over time
  • Regional comparison of weather patterns and fire risk

🧠 Feature Engineering

  • Detected multicollinearity using heatmaps
  • Created additional features to improve prediction accuracy

🤖 Model Building

  • Trained and evaluated the following regression models:
    • Linear Regression
    • Ridge, Lasso , ElasticNet Regression
    • Model Pickling
  • Evaluated models using:
    • R² Score
    • Mean Absolute Error (MAE)
    • Root Mean Squared Error (RMSE)

📈 Visualizations

  • Correlation matrix and heatmaps
  • Region-wise line plots for environmental variables
  • Actual vs. predicted FWI plots
  • Feature importance charts from ensemble models

🧰 Tech Stack

  • Python
  • Pandas, NumPy
  • Matplotlib, Seaborn
  • Scikit-learn
  • Jupyter Notebook

🚀 Future Scope

  • Deploying the model with Flask or Streamlit for public use
  • Implementing time-series forecasting for FWI trends
  • Adding real-time data ingestion for dynamic prediction

🔗 GitHub Repository

Click here to view the project


🧠 Author

Yukti Garg
Passionate about machine learning, data science, and solving real-world problems through data.

Releases

No releases published

Packages

No packages published

Languages