Have you tuned a model?

venusaur@lemmy.world · 3 months ago

Have you tuned a model?

lunarwingorg@lemmy.world · 1 month ago

Personally, I’ve adjusted dozens of sampler values, written middlewares, llama-server scripts and configuration loading mechanisms, openai api compatible HTTP proxies, and even a python3 API for accessing context information and being able to switch models on the fly. I’ve even created a local model benchmark performance script.

But besides running some scripts which others have made to tune a model specifically with specific input parameters, not really. Honestly, I have a lot to learn.

venusaur@lemmy.world · 1 month ago

I’ve heard a lot about Lora like Unsloth. Have you used anything like it?

lunarwingorg@lemmy.world · 1 month ago

lora, yes. mostly custom scripts downloaded off of hugging face to automatically handle a lot of complicated stuff I’m not totally sure of how it actually works under the hood to be honest

venusaur@lemmy.world · edit-2 1 month ago

Did you notice a big improvement in the tasks you were tuning for? What is your hardware setup and how long did it take?

Thanks!

lunarwingorg@lemmy.world · 1 month ago

unfortunately, i did not notice much of a difference with model tuning. it took a pretty decent chunk of time. For my most powerful pc, which is what I run most models (the lower end machines with worse gpus run embedded text models) I got a fairly powerful machine with a single 4090. I have had better luck just downloading differently tuned variants of the same model from others

venusaur@lemmy.world · 1 month ago

Bummer. Do you think it was the training data or just nature of fine tuning? Something else? What were you tuning it for if you don’t mind my asking?

lunarwingorg@lemmy.world · 1 month ago

just the nature of them being quite old models without proper tool calling functionality. What actually DID help was setting up middleware and custom python servers/clients with proper json mapping to enable the proper tools to be selected. so, literally zero model tuning required in the end.

venusaur@lemmy.world · 30 days ago

Got it. You think if you tuned again after calibrating tool calling it would be beneficial?

lunarwingorg@lemmy.world · 23 days ago

it has anything to do with my external calibration stuff so no