• @[email protected]
    link
    fedilink
    English
    16 months ago

    even at 0 temperature the model will not be deterministic, because it depends on the seed used as well as things like numerical noise.

    • Turun
      link
      fedilink
      English
      1
      edit-2
      6 months ago

      Yeah no, that’s not how this works.

      Where in the process does that seed play a role and what do you even mean with numerical noise?

      Edit: I feel like I should add that I am very interested in learning more. If you can provide me with any sources to show that GPTs are inherently random I am happy to eat my own hat.

        • Turun
          link
          fedilink
          English
          16 months ago

          I appreciate the constructive comment.

          Unfortunately the API docs are incomplete (insert obi wan meme here). The seed value is both optional and irrelevant when setting the temperature to 0. I just tested it.

        • Turun
          link
          fedilink
          English
          16 months ago

          Addendum:

          The docs say

          For reproducible outputs, set temperature to 0 and seed to a number:

          But what they should say is

          For reproducible outputs, set temperature to 0 or seed to a number:

          Easy mistake to make