Have you tuned a model?

venusaur@lemmy.world · 3 months ago

Have you tuned a model?

lunarwingorg@lemmy.world · 1 month ago

unfortunately, i did not notice much of a difference with model tuning. it took a pretty decent chunk of time. For my most powerful pc, which is what I run most models (the lower end machines with worse gpus run embedded text models) I got a fairly powerful machine with a single 4090. I have had better luck just downloading differently tuned variants of the same model from others

venusaur@lemmy.world · 1 month ago

Bummer. Do you think it was the training data or just nature of fine tuning? Something else? What were you tuning it for if you don’t mind my asking?

lunarwingorg@lemmy.world · 1 month ago

just the nature of them being quite old models without proper tool calling functionality. What actually DID help was setting up middleware and custom python servers/clients with proper json mapping to enable the proper tools to be selected. so, literally zero model tuning required in the end.

venusaur@lemmy.world · 30 days ago

Got it. You think if you tuned again after calibrating tool calling it would be beneficial?

lunarwingorg@lemmy.world · 23 days ago

it has anything to do with my external calibration stuff so no