To add to this, the way the AI is trained is that you pass in images with descriptions (for the most part). Since most descriptions focus on the main concepts, it generally won’t have the actual text included in the descriptions. Without the being included in the descriptions, the AI will have a hard time learning the meaning of the squiggles in the images. In addition those squiggles can represent a lot of different things, so even if it grows to “understand” letters, it’s really hard to “understand” their meaning; thus leading to a lot of weird words/text.
it’s pretty fun to look at how they almost get it right in some cases, like if you prompt “birthday” you might get some text that almost looks like “happy birthday” followed by a smudge that is supposed to be a name, but also probably some actually correct numbers because those are much more predictable!
To add to this, the way the AI is trained is that you pass in images with descriptions (for the most part). Since most descriptions focus on the main concepts, it generally won’t have the actual text included in the descriptions. Without the being included in the descriptions, the AI will have a hard time learning the meaning of the squiggles in the images. In addition those squiggles can represent a lot of different things, so even if it grows to “understand” letters, it’s really hard to “understand” their meaning; thus leading to a lot of weird words/text.
it’s pretty fun to look at how they almost get it right in some cases, like if you prompt “birthday” you might get some text that almost looks like “happy birthday” followed by a smudge that is supposed to be a name, but also probably some actually correct numbers because those are much more predictable!