Apologies if this seems like a survey post. I’m just learning about tuning and want to get a lay of the land. I don’t think I have the money to tune locally so might have to rent some VRAM, but curious how much better tuning is vs something like RAG.

What model? What was your use case? What tuning tool did you use? What is hardware setup? How large was your training set and how did you create it? How effective was the model as tasks pre- and post-tuning?

Thanks!

  • venusaur@lemmy.worldOP
    link
    fedilink
    English
    arrow-up
    2
    ·
    3 days ago

    Bummer. Do you think it was the training data or just nature of fine tuning? Something else? What were you tuning it for if you don’t mind my asking?

    • lunarwingorg@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 days ago

      just the nature of them being quite old models without proper tool calling functionality. What actually DID help was setting up middleware and custom python servers/clients with proper json mapping to enable the proper tools to be selected. so, literally zero model tuning required in the end.

      • venusaur@lemmy.worldOP
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 days ago

        Got it. You think if you tuned again after calibrating tool calling it would be beneficial?