Multi-Fidelity Optimization for Neural Networks | NanoGPT