cross-posted from: https://lemmy.ca/post/37011397

[email protected]

The popular open-source VLC video player was demonstrated on the floor of CES 2025 with automatic AI subtitling and translation, generated locally and offline in real time. Parent organization VideoLAN shared a video on Tuesday in which president Jean-Baptiste Kempf shows off the new feature, which uses open-source AI models to generate subtitles for videos in several languages.

  • @[email protected]
    link
    fedilink
    English
    31 day ago

    It is not the lack of training material that is the issue, it doesn’t understand context and cultural references. Someone commented here that crunchyroll AI subtitles translated Asura Hall a name to asshole.

    • @[email protected]
      link
      fedilink
      English
      121 hours ago

      It would be able to behave like it understands context and cultural references it if it had the appropriate training data, no problem.

          • @[email protected]
            link
            fedilink
            English
            12 hours ago

            You said that with training data it will be able to understand. I mean that even with training data it will take years and it also has other problems like hallucinations. I admit, I didn’t word it correctly.

            • @[email protected]
              link
              fedilink
              English
              12 hours ago

              *would, not will.

              It is not know if the needed training data will ever even exist. But if it did, training an AI with that data would result in great, cultural subtitle generation.

              • @[email protected]
                link
                fedilink
                English
                12 hours ago

                Are you sure it is would? In the sentence you are referring to the AI understanding culture from language which is future tense.

                • @[email protected]
                  link
                  fedilink
                  English
                  147 minutes ago

                  Will is future tense in a way that it is definitely gonna happen. Would just means there is the possibility.

                  And yes, I am sure, that one could brute force a solution with having enough computing power and learning data. If it would make sense (ethical and sustainably wise) is a whole other question.

                  I am sure it can, because LLM are statistically systems as humans are as well for a great factor (just not as strict as a machine). If you have enough data with action and response to such cultural traditions, there is nothing that would suggest that a LLM would fail to replicate that.