Performance Benchmarks for OOD Generalization | NanoGPT