Skip to content

Having trouble reproducing Large Model Training #595

Answered by ilyes319
daniel-sintef asked this question in Q&A
Discussion options

You must be logged in to vote

Hey @daniel-sintef ,

I realized we uploaded the wrong script for L=2 model on the mace-mp repo. Here is the correct one:

srun python /pscratch/sd/c/cyrusyc/mace/mace/cli/run_train.py \
    --name=$exp_name \
    --train_file="../../dataset/mptrj-gga-ggapu-train" \
    --valid_file="../../dataset/mptrj-gga-ggapu-val" \
    --statistics_file="../../dataset/mptrj-gga-ggapu-statistics.json" \
    --loss='universal' \
    --energy_weight=1 \
    --forces_weight=10 \
    --compute_stress=True \
    --stress_weight=100 \
    --stress_key='stress' \
    --eval_interval=1 \
    --error_table='PerAtomMAE' \
    --model="ScaleShiftMACE" \
    --interaction_first="RealAgnosticResidualInteractionBlock…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@daniel-sintef
Comment options

Answer selected by ilyes319
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #591 on September 18, 2024 07:27.