Qwen Long 10M model | NanoGPT