Training API Reference

Training infrastructure with multi-GPU support.

SHCTrainer

Main trainer class for SHC models.

TrainingArgs

Configuration for training.

DistillationTrainer

Trainer for SSM distillation.

DistillationConfig

Configuration for distillation.

Optimizer Utilities

create_optimizer

create_scheduler

Distributed Utilities

setup_distributed

cleanup_distributed