Skip to content

Arun-George-Zachariah/XGBoost-Spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

XGBoost-Spark

XGBoost stands for eXtreme Gradient Boosting. It is a highly optimized and distributed implementation of gradient boosting decision trees, to handle large and complicated datasets. In this project, we would we using XGBoost4J-Spark to build simple Classification and Regression models.

Setup

Requirements:

On Ubuntu, you can use scripts/setup.sh to setup the pre-requisites.

Build

sbt clean assembly

Execute

  • Classification
spark-submit --class edu.missouri.XGBoost.ClassifierPipeline target/scala-2.12/XGBoost-Spark-assembly-0.1.jar
  • Regression
spark-submit --class edu.missouri.XGBoost.RegressionPipeline target/scala-2.12/XGBoost-Spark-assembly-0.1.jar

References:

About

XGBoost4J-Spark for Classification and Regression.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published