Top 7 Low-Latency Inference Techniques | NanoGPT