OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors’ billions. In its announcement blog, OpenAI says its Agent “can now do work for you using its own computer,” but CEO Sam Altman warns that the rollout presents unpredictable risks.

[…]

OpenAI research lead Lisa Fulford told Wired that she used Agent to order “a lot of cupcakes,” which took the tool about an hour, because she was very specific about the cupcakes.

    • wise_pancake@lemmy.ca
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      5 hours ago

      I use agents a lot and have written several MCP servers now, the tasks I automate aren’t things like order cupcakes, it’s mainly the glue between complex things.

      I still can’t get Claude to nicely open a JIRA ticket for me, but I can get it to read through a sequence of connected documents and filter that into.

      I don’t think agents are ready for the main event and these are some poor examples of their power.

      I’m not saying they won’t improve, but using the right tool for the right job is critical. An hour to order cupcakes is silly even for an llm.

      • Evotech@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        4 hours ago

        It’s examples for the common guy in the streets who don’t know what an mcp server is.