L4sBot@lemmy.worldMB to

Technology@lemmy.worldEnglish · 2 years ago

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’

cross-posted to:
[email protected]

320

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’

L4sBot@lemmy.worldMB to

Technology@lemmy.worldEnglish · 2 years ago

cross-posted to:
[email protected]

Tech experts are starting to doubt that ChatGPT and A.I. 'hallucinations' will ever go away: 'This isn’t fixable'

Even OpenAI CEO Sam Altman was skeptical a few weeks ago: "I probably trust the answers that come out of ChatGPT the least of anybody on Earth."

Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’::Experts are starting to doubt it, and even OpenAI CEO Sam Altman is a bit stumped.

Chat

dudeami0@lemmy.dudeami.win
link
fedilink
English
arrow-up
10·
2 years ago
Disclaimer: I am not an AI researcher and just have an interest in AI. Everything I say is probably jibberish, and just my amateur understanding of the AI models used today.

It seems these LLM’s use a clever trick in probability to give words meaning via statistic probabilities on their usage. So any result is just a statistical chance that those words will work well with each other. The number of indexes used to index “tokens” (in this case words), along with the number of layers in the AI model used to correlate usage of these tokens, seems to drastically increase the “intelligence” of these responses. This doesn’t seem able to overcome unknown circumstances, but does what AI does and relies on probability to answer the question. So in those cases, the next closest thing from the training data is substituted and considered “good enough”. I would think some confidence variable is what is truly needed for the current LLMs, as they seem capable of giving meaningful responses but give a “hallucinated” response when not enough data is available to answer the question.

Overall, I would guess this is a limitation in the LLMs ability to map words to meaning. Imagine reading everything ever written, you’d probably be able to make intelligent responses to most questions. Now imagine you were asked something that you never read, but were expected to respond with an answer. This is what I personally feel these “hallucinations” are, or imo best approximations of the LLMs are. You can only answer what you know reliably, otherwise you are just guessing.
- drem@lemmy.world
  link
  fedilink
  English
  arrow-up
  7·
  2 years ago
  I have experience in creating supervised learning networks. (not large language models) I don’t know what tokens are, I assume they are output nodes. In that case I think increasing the output nodes don’t make the Ai a lot more intelligent. You could measure confidence with the output nodes if they are designed accordingly (1 node corresponds to 1 word, confidence can be measured with the output strength). Ai-s are popular because they can overcome unknown circumstances (most of the cases), like when you input a question slightly different way.
  
  I agree with you on that Ai has a problem understanding the meaning of the words. The Ai’s correct answers happened to be correct because the order of the words (output) happened to match with the order of the correct answer’s words. I think “hallucinations” happen when there is no sufficient answers to the given problem, the Ai gives an answer from a few random contexts pieced together in the most likely order. I think you have mostly good understanding on how Ai-s work.
  - Scubus@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1·
    2 years ago
    You seem like you are familiar with back-propogation. From my understanding, tokens are basically just a bit of information that is assigned a predicted fitness, and the token with the highest fitness is then used for back-propogation.
    
    Eli5: im making a recipe. At step 1, i decide a base ingredient. At step 2, based off my starting ingredient, i speculate what would go good with that. Step 3 is to implement that ingredient. Step 4 is to start over at step 2. Each “step” here would be a token.
    
    I am also not a professional, but I do do a lot of hobby work that involves coding AI’s. As such, if I am incorrect or phrased that poorly, feel free to correct me.
    - drem@lemmy.world
      link
      fedilink
      English
      arrow-up
      2·
      2 years ago
      I did manage to write a back-propogation algorithm, at this point I don’t fully understand the math behind back-propogation. Generally back-propogation algorithms take the activation, calculate the delta(?) with the activation and the target output (only on last layer). I don’t know where tokens come in. From your comment it sounds like it has to do something in a unsupervised learning network. I am also not a professional. Sorry if I didn’t really understand your comment.
      - Scubus@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2·
        2 years ago
        Mathematically, I have no idea where the tokens come in exactly. My studies have been more conceptual than actually getting down to the knitty-gritty, for the most part.
        
        But conceptually, from my understanding, tokens are just a variable that is assigned a speculated fitness, then used as the new “base” data set.
        
        I think chicken would go good in this, but beef wouldn’t be as good. My token is the next ingredient i am deciding to put in.
        
        SirGolan@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        3·
        2 years ago
        You guys should all check out Andrej Karpathy’s neural networks zero to hero videos. He has one on LLMs that explains all this.
        
        PipedLinkBot@feddit.rocksB
        link
        fedilink
        English
        arrow-up
        1·
        2 years ago
        Here is an alternative Piped link(s): https://piped.video/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ
        
        Piped is a privacy-respecting open-source alternative frontend to YouTube.
        
        I’m open-source, check me out at GitHub.
- kromem@lemmy.world
  link
  fedilink
  English
  arrow-up
  3·
  2 years ago
  This is a common misconception that I’ve even seen from people who have a background in ML but just haven’t been keeping up to date on the emerging research over the past year.
  
  If you’re interested in the topic, this article from a joint MIT/Harvard team of researchers on their work looking at what a toy model of GPT would end up understanding in its neural network might be up your alley.
  
  The TLDR is that it increasingly seems like when you reach a certain complexity of the network, the emergent version that best predicted text is one that isn’t simply mapping some sort of frequency table, but is actually performing more abstracted specialization in line with what generated the original training materials in the first place.
  
  So while yes, it trains on being the best to predict text, that doesn’t mean the thing that best does that can only predict text.
  
  You, homo sapiens, were effectively trained across many rounds of “don’t die and reproduce.” And while you may be very good at doing that, you picked up a lot of other skills along the way as complexity increased which helped accomplish that result, like central air conditioning and Netflix to chill with.
- BehindTheBarrier@lemmy.world
  link
  fedilink
  English
  arrow-up
  3·
  2 years ago
  Also not a researcher, but I also believe hallucinations are simply the artifact of being able generate responses that aren’t pure reproduction of training data. Aka, the generalization we want. The problem is we have something that generalize without the ability to judge what it thinks of.
  
  It will in my opinion never go away, but I’m sure it can be improved significantly.

Technology@lemmy.world

technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

4.43K users / day
11.7K users / week
20.1K users / month
34.8K users / 6 months
307 local subscribers
67.1K subscribers
14K Posts
578K Comments
Modlog