@[email protected]M to

[email protected] • 10 months ago

ChatGPT provides false information about people, and OpenAI can’t correct it

321

ChatGPT provides false information about people, and OpenAI can’t correct it

@[email protected]M to

[email protected] • 10 months ago

noyb today filed a complaint against the ChatGPT maker OpenAI with the Austrian DPA

It’s clear that companies are currently unable to make chatbots like ChatGPT comply with EU law, when processing data about individuals. If a system cannot produce accurate and transparent results, it cannot be used to generate data about individuals. The technology has to follow the legal requirements, not the other way around.

Chat

@[email protected]
link
fedilink
English
42•10 months ago

They are capable of accessing and recalling the contents of that information repository, and relaying information from that repository to an end user.

This is not correct based on my understanding of LLMs, but I am certainly not an expert. As I understand it, it’s basically a statistics exercise in how they determine what order to put words into. They don’t ‘look stuff up’ in their training data. They probably don’t even have access to their training data once the model is complete. These models are trained on terabytes of data but are small enough to fit in memory, so it’s impossible for them to still have access to all that. But it wouldn’t matter if they did, because that’s not how they work.
- @[email protected]
  link
  fedilink
  7•10 months ago
  LLMs do not look stuff up (except when they have an API that allows them to), but I think OP’s point still stands. The statistical next token predictor metaphor is useful , but in many regards that’s what text and language are. If you can understand that certain words are linked to certain other words, then you should be able to appreciate that certain groups of words can be associated in a way that is functionally the same as data.
  
  I have not memorized the pytorch documentation, but I can use what I understand about pytorch and other libraries to infer specific aspects of the library that I am not familiar with. Functionally, this is no different than if I accessed the documentation directly. If I communicate this information to others I have functioned as a data repository. The repository works on a more abstract and error-prone level, but it works nonetheless.
  
  Here is another very concrete example: LLMs know George Washington’s birthday. Not because they look up that information, but because of the learned associations between George Washington, birthday, and his actual date of birth.
  - @[email protected]
    link
    fedilink
    English
    12•10 months ago
    
    I can use what I understand about pytorch and other libraries to infer specific aspects of the library that I am not familiar with.
    
    This is what LLM’s can’t do though. They can’t use what they understand because they don’t understand anything. They can’t infer, they can’t reason, they can’t evaluate or compare. They can spit out words that make it look like they did those things, but they didn’t.
    - @[email protected]
      link
      fedilink
      2•
      edit-2
      10 months ago
      Here I think you are behind on the literature. LLMs can infer and reason, and there are whole series of papers that evaluate LLMs for these properties the exact same way we evaluate humans. So if you can’t trust the metrics, then you cannot even assert that humans can reason and infer and understand.
      - @[email protected]
        link
        fedilink
        English
        6•10 months ago
        https://arxiv.org/html/2403.04121v1
        
        Good read from a group of computer scientists at Arizona State. Their conclusions are the same as mine but they illustrate the problems better than I ever could.
        
        @[email protected]
        link
        fedilink
        1•10 months ago
        You linked a paper on planning in LLMs. Planning is largely in the domain of reinforcement learning. The paper you linked conflates reasoning with planning, alongside the obviously biased prose, so the author really doesn’t seem credible. I prefer nuanced and careful evaluations such as: https://www.sciencedirect.com/science/article/pii/S2949719123000298
        
        @[email protected]
        link
        fedilink
        2•10 months ago
        Without commenting on the content of the paper,
        
        so the author really doesn’t seem credible. I prefer nuanced and careful evaluations
        
        Novelis
        
        https://novelis.io/
        
        Innovation in action: Optimizing Business Efficiency
        
        Our mission is to revolutionize business efficiency through Innovation. Leveraging AI, automation and process intelligence, we transform operations, supercharge performance, and unlock growth for businesses worldwide
        
        Hm. 🤔
        
        @[email protected]
        link
        fedilink
        1•9 months ago
        Notice that there are methods, data, and peer reviews that I can freely scrutinize. All things your opinion piece lacks.
- @[email protected]OPM
  link
  fedilink
  7•10 months ago
  To me that sounds like a distinction without a difference. A jpeg is not an image, but a set of data that can be algorithmically processed and rendered as an image - which is why it can fit in a smaller space than a bmp. Despite the technical differences, a jpg and a bmp are legally equivalent. If something is illegal in a bmp, it’s also illegal in a jpg. The same laws apply to EVs and gas vehicles. The same laws apply to vinyl records and cassette tapes. The law does not care about the mechanism.*
  
  *for the most part
  - @[email protected]
    link
    fedilink
    English
    32•10 months ago
    You’re illustrating the issue so many people have with this technology. Without a fundamental understanding of how it works, people will attempt to use it in ways it shouldn’t be used, and won’t understand why it isn’t giving them correct information. It simply doesn’t have the ability to do anything but put words in an order that statistically will resemble how a human might answer the question.
    
    LLMs don’t know anything. They can’t tell fact from fiction (and are incapable of even trying), and don’t understand concepts such as verifying info when requested. That’s the problem, they don’t ‘understand’ anything, including what they are telling you. But they do spit out words in a statistically probable order, even if the result is complete bullshit. They do it so well that they can fool most people into thinking the computer actually knows what it’s telling you.
  - @[email protected]
    link
    fedilink
    24•10 months ago
    Here’s a better metaphor because yours completely misses the mark when it comes to the difference between an LLM and an actual encyclopedia.
    
    A painter will spend years honing his craft by studying other paintings as well as photos and real life. If you ask him to paint you a house from memory and try to build it with what he gives you, that just makes you an idiot, it doesn’t make him a bad architect.
    
    Chatgpt is not an encyclopedia and any thing it says that is remotely important to your personal or work life should be verified. They explicitly tell you it can and will give false responses.
  - @[email protected]
    link
    fedilink
    English
    18•10 months ago
    LLM isn’t a compilation of its training data, anymore than a cake is a pile of eggs, flower and sugar.
  - @[email protected]
    link
    fedilink
    12•10 months ago
    Jpg is a lossy compression algorithm. Statistical probability of words occuring in sequence is not compression. That’s like saying generative images are compression, they aren’t. It’s not producing blurry matches of images, it’s producing something “novel”. Otherwise, that would be considered over fitting the data.

[email protected]

[email protected]

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Welcome to the News community!

Rules:

1. Be civil

Attack the argument, not the person. No racism/sexism/bigotry. Good faith argumentation only. This includes accusing another user of being a bot or paid actor. Trolling is uncivil and is grounds for removal and/or a community ban. Do not respond to rule-breaking content; report it and move on.

2. All posts should contain a source (url) that is as reliable and unbiased as possible and must only contain one link.

Obvious right or left wing sources will be removed at the mods discretion. Supporting links can be added in comments or posted seperately but not to the post body.

3. No bots, spam or self-promotion.

Only approved bots, which follow the guidelines for bots set by the instance, are allowed.

4. Post titles should be the same as the article used as source.

Posts which titles don’t match the source won’t be removed, but the autoMod will notify you, and if your title misrepresents the original article, the post will be deleted. If the site changed their headline, the bot might still contact you, just ignore it, we won’t delete your post.

5. Only recent news is allowed.

Posts must be news from the most recent 30 days.

6. All posts must be news articles.

No opinion pieces, Listicles, editorials or celebrity gossip is allowed. All posts will be judged on a case-by-case basis.

7. No duplicate posts.

If a source you used was already posted by someone else, the autoMod will leave a message. Please remove your post if the autoMod is correct. If the post that matches your post is very old, we refer you to rule 5.

8. Misinformation is prohibited.

Misinformation / propaganda is strictly prohibited. Any comment or post containing or linking to misinformation will be removed. If you feel that your post has been removed in error, credible sources must be provided.

9. No link shorteners.

The auto mod will contact you if a link shortener is detected, please delete your post if they are right.

10. Don't copy entire article in your post body

For copyright reasons, you are not allowed to copy an entire article into your post body. This is an instance wide rule, that is strictly enforced in this community.

4.05K users / day
8.52K users / week
15K users / month
26.9K users / 6 months
25.2K subscribers
17.6K Posts
410K Comments
Modlog