• Ech
    link
    fedilink
    English
    1011 months ago

    I think I saw something about it that. It was an extended prisoner’s dilemma game, right? I wouldn’t say that’s directly applicable to every gaming genre.

    • @[email protected]
      link
      fedilink
      English
      611 months ago

      Without being in the room, we can only go off what the article lays out. These are wargaming scenarios though, so escalation is a very real concern. If both sides are running these models to provide recommendations and both are pushing for greater conflict, you find yourself in a prisoner’s dilemma real quick.

      • @[email protected]
        link
        fedilink
        English
        411 months ago

        These aren’t simulations that are estimating results, they’re language models that are extrapolating off a ton of human knowledge embedded as artifacts into text. It’s not necessarily going to pick the best long term solution.

        • @[email protected]
          link
          fedilink
          English
          311 months ago

          Language models can extrapolate but they can also reason (by extrapolating human reasoning).

          • @[email protected]
            link
            fedilink
            English
            411 months ago

            I want to be careful about how the word reasoning is used because when it comes to AI there’s a lot of nuance. LLMs can recall text that has reasoning in it as an artifact of human knowledge stored into that text. It’s a subtle but important distinction that’s important for how we deploy LLMs.

      • Ech
        link
        fedilink
        English
        211 months ago

        The models used by the writers of the article and those used by the military are going to be radically different.

        • @[email protected]
          link
          fedilink
          English
          1
          edit-2
          11 months ago

          The writers of the article are reporting on use of these models by the military. They aren’t using the models. If I remember right they called out some models developed by one of the defense contractors like palantir

          • Ech
            link
            fedilink
            English
            411 months ago

            The researchers tested LLMs such as OpenAI’s GPT-3.5 and GPT-4, Anthropic’s Claude 2 and Meta’s Llama 2

            All these AIs are supported by Palantir’s commercial AI platform – though not necessarily part of Palantir’s US military partnership

            Also, they’re reporting on a Stanford study of how these platforms could be used militaristically, not the military’s actual use of them.

            • @[email protected]
              link
              fedilink
              English
              2
              edit-2
              11 months ago

              You’re right. I was focused on this part above. I made like an AI and jumped the gun

              These results come at a time when the US military has been testing such chatbots based on a type of AI called a large language model (LLM) to assist with military planning during simulated conflicts, enlisting the expertise of companies such as Palantir and Scale AI. Palantir declined to comment and Scale AI did not respond to requests for comment.

      • @[email protected]
        link
        fedilink
        English
        111 months ago

        The way you said that tells me you don’t know what a prisoner’s dilemma is. It’s not “a situation where both sides have escalated”.

        • @[email protected]
          link
          fedilink
          English
          3
          edit-2
          11 months ago

          I’m not sure where our disconnect is. We have a situation where both sides can cooperate, one side can defect, or both sides can defect. Call it whatever you want, it’s the same scenario.

          Here it’s with planning for military force. Do you risk a nuclear strike to save yourself from one? If you can get a first strike (defect), then you win. If you both refrain (cooperate), then you stay alive. If you both attempt a first strike (defect), you all lose.

          Change the words around and it’s the same.

          Both suspects don’t tell (cooperate), both get minimum or no jail time. One tells on the other (defects), that one gets off but the other gets maximum. Both tell on each other (defect), both get some jail time.