Apologies if this seems like a survey post. I’m just learning about tuning and want to get a lay of the land. I don’t think I have the money to tune locally so might have to rent some VRAM, but curious how much better tuning is vs something like RAG.

What model? What was your use case? What tuning tool did you use? What is hardware setup? How large was your training set and how did you create it? How effective was the model as tasks pre- and post-tuning?

Thanks!

  • lunarwingorg@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 days ago

    Personally, I’ve adjusted dozens of sampler values, written middlewares, llama-server scripts and configuration loading mechanisms, openai api compatible HTTP proxies, and even a python3 API for accessing context information and being able to switch models on the fly. I’ve even created a local model benchmark performance script.

    But besides running some scripts which others have made to tune a model specifically with specific input parameters, not really. Honestly, I have a lot to learn.

      • lunarwingorg@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        4 days ago

        lora, yes. mostly custom scripts downloaded off of hugging face to automatically handle a lot of complicated stuff I’m not totally sure of how it actually works under the hood to be honest

        • venusaur@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          3 days ago

          Did you notice a big improvement in the tasks you were tuning for? What is your hardware setup and how long did it take?

          Thanks!

          • lunarwingorg@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 days ago

            unfortunately, i did not notice much of a difference with model tuning. it took a pretty decent chunk of time. For my most powerful pc, which is what I run most models (the lower end machines with worse gpus run embedded text models) I got a fairly powerful machine with a single 4090. I have had better luck just downloading differently tuned variants of the same model from others

            • venusaur@lemmy.worldOP
              link
              fedilink
              English
              arrow-up
              2
              ·
              3 days ago

              Bummer. Do you think it was the training data or just nature of fine tuning? Something else? What were you tuning it for if you don’t mind my asking?

              • lunarwingorg@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                2 days ago

                just the nature of them being quite old models without proper tool calling functionality. What actually DID help was setting up middleware and custom python servers/clients with proper json mapping to enable the proper tools to be selected. so, literally zero model tuning required in the end.

                • venusaur@lemmy.worldOP
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  2 days ago

                  Got it. You think if you tuned again after calibrating tool calling it would be beneficial?