Posts Tagged "GPU memory, gradient checkpointing"

Gradient checkpointing

Gradient checkpointing enables you to run a more powerful model on your machine - beneficial under training.