So far I only have the “cold start” data posted, but I’m planning on posting a full distillation dataset.
https://huggingface.co/datasets/dleemiller/lm25
There’s a package called “unsloth” that integrates with huggingface’s TRL library that can help.
So far I only have the “cold start” data posted, but I’m planning on posting a full distillation dataset.
https://huggingface.co/datasets/dleemiller/lm25