GPT OSS Release, Inference and Fine-tuning

Aug 6

In this video, I share:

5 Comments

Hi Ronan, how is watching your video different from the fine-tuning repo on trelis.com? Did you skip any parts of the code in the video, or are they the same?

Expand full comment

Reply (1)

Trelis Research

Aug 15

sorry for the slow reply and thanks for the q on the livestream today. The answer is that the repos provide the code that goes with the videos. If you want all the detail of the code, or to modify it, then that’s what the repos provide - as well as support via Github issues. Generally I try to cover most content in the videos, but often there isn’t time to cover all detail.

Expand full comment

poof

Aug 6

Hey question on GPT OSS. Have you been able to get it to work in GRPO with TRL / vLLM?

Expand full comment

Reply (1)

Trelis Research

Aug 6

Howdy!

I haven't run with TRL / vLLM for GRPO but in principle it should work. I'm planning to wait on unsloth and perhaps do a type of run then.

Expand full comment

Reply (1)

poof

Aug 6

Thanks! That would be great. It seems like there's some issue right now -- the SFT works, but trying to use vLLM on TRL seems buggy (I think maybe something with the MoE or weights not being setup to easily be updated on the vLLM side?)

Seems like a great model size for GRPO training though on a single node.

Expand full comment