5 Partitioning Methods for Multi-GPU Training | NanoGPT