Huh, yeah that's a good point. The various distilled R1 models are definitely re...

		simonw on Feb 2, 2025 \| parent \| context \| favorite \| on: Recent results show that LLMs struggle with compos... Huh, yeah that's a good point. The various distilled R1 models are definitely regular transformer-based LLMs because the GGUF file versions of them work without any upgrades to the underlying llama.cpp library.