Skip to main content

XGBoost

XGBoost (Extreme Gradient Boosting) is a powerful machine learning framework for gradient boosting. Pick xgboost on AIchor when you want distributed XGBoost training across multiple worker pods.

How to use

Select XGBoost by setting spec.operator: xgboost in your manifest. The full field-by-field specification lives in the Manifest Reference, and more complete examples are in the Manifest Examples.

spec:
operator: "xgboost"
image: "xgboost-demo"
command: "python src/train.py"

types:
worker:
count: 2
resources:
cpus: 8
ramRatio: 2
...

Injected environment variables

The following environment variables are injected into every worker container to set up the distribution between the different containers:

VariableDescriptionExample
MASTER_PORTPort the master listens on for the distributed rendezvous.9999
MASTER_ADDRAddress of the master container that the others connect to.xgboost-dist-demo-master-0
WORLD_SIZETotal number of containers in the run.3
RANKRank of the current container, from 0 to WORLD_SIZE - 1.1

External documentation

Demo project

The AIchor team has shared this demo project that can be cloned and used for AIchor experiments using XGBoost: