• Nighed@feddit.uk
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    21 hours ago

    The almost endless opus usage couldn’t last to be fair. I assume those costs are closer to the real cost to provide the service, ouch!

    • Log in | Sign up@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      9 hours ago

      I assume those costs are closer to the real cost to provide the service, ouch!

      Not even close. Most AI companies are making something like 95% loss according to an article here a bit ago. So if you’re an AI provider, you need to multiply prices by 25 before you even begin to eat into your insane levels of debt.

    • Alex@lemmy.ml
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      4
      ·
      20 hours ago

      On the potentially bright side maybe this will make people think harder about which model to use for which task. You don’t need to feed your entire code base into Opus when a Gemini Flash sub-agent can do a perfectly fine job running grep and compiling a summary for the main agent.

      • boonhet@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        9
        ·
        20 hours ago

        If I have to think about that I’d honestly rather just do the work myself. Less effort and I trust the author more