DeepSeek V4 Flash (Thinking) model | NanoGPT