How Load Balancing Reduces AI Deployment Costs | NanoGPT