DeepSeek R1 model | NanoGPT