• chrash0@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    ·
    2 days ago

    it’s kind of frustrating to have to keep explaining to people how these models work, mostly because of how intensely oversold they are.

    on the one hand you have people who think it’s literally just a normal computer program doing database lookups with conditional logic and decision trees plus some sort of hand wavy magic. it’s not.

    on the other hand you have people who think it’s a literal brain that can stub its toe and change the way it walks thereafter. it won’t.

    every attempt at “agent memory” or whatever has thus far been desperate bullshit. i don’t care how many markdown files and vector databases and prompt engineering hacks you implement; you’ll never change the fact that these models have limited context and frozen weights. reading a markdown file or querying a database is not “remembering”.

    • DishaweslemOride@lemmy.org
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 days ago

      I find it’s easier to just assume I’m always talking to a fresh agent. It’s annoying to have to be repetitive, but that way has given me the best results.

      It’s just like working with a junior developer… except they never learn anything and it’s always a different person every day. :(