PDF.

Today’s leading AI models engage in sophisticated behaviour when placed in strategic competition. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating their actions; and they exhibit credible metacognitive self-awareness, assessing their own strategic abilities before deciding how to act.

Here we present findings from a crisis simulation in which three frontier large language models (GPT-5.2, Claude Sonnet 4, Gemini 3 Flash) play opposing leaders in a nuclear crisis.

  • Auth@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    22 hours ago

    Humans are way to bad at using nukes. How many times have we seen red lines be set out only for someone not to have the balls to fire the nuke.

    • iglou@programming.dev
      link
      fedilink
      English
      arrow-up
      1
      ·
      20 hours ago

      The only country bad at using nukes is the only country who dropped some. The US.

      Nukes are a deterrence weapon. No one with a sane mind wants to use them.

      • Auth@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        20 hours ago

        If you dont use them people will think you’re scared to use them. Look at Russian nuke threats no one gives a fuck anymore. Now they have to nuke someone to regain that aura. Then when they do everyone will be pissed at them.

        If they’d set a single red line then nuked when it was crossed everyone would respect them and they’d have huge nuclear aura and no one could fault them.