Guide to Optimizing Multimodal Pipeline Benchmarks | NanoGPT