Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

  • Empricorn@feddit.nl
    link
    fedilink
    English
    arrow-up
    25
    ·
    24 days ago

    Uh… Are you saying simply using social media is endorsing stealing personal information to train LLMs? Because that’s a wild take, if so. Personally, I feel like there’s no stopping them, so what am I supposed to do, stay silent? Not engage with anything, even anonymously?

    • FaceDeer@fedia.io
      link
      fedilink
      arrow-up
      5
      ·
      24 days ago

      Uh… Are you saying simply using social media is endorsing stealing personal information to train LLMs?

      Of course not. No “stealing” is happening. People are posting content on an open protocol that permits anyone to read it. Exactly as intended.

      Personally, I feel like there’s no stopping them, so what am I supposed to do, stay silent?

      If you do not want to be heard then yes, I suppose you could stay silent. That would indeed accomplish that.

      You could also find a social media platform whose content is locked behind a walled garden of some sort that makes it more difficult for your posts to be seen by the public. But that’s antithetical to how the Fediverse works, you want someplace very different from here if that’s how you want to approach this.

      Basically, you are on a platform that’s specifically designed to broadcast your comments far and wide without restriction, and then you’re getting upset that someone you didn’t want to hear your comments is hearing your comments. I’m not sure what you expected.

      • Feyd@programming.dev
        link
        fedilink
        arrow-up
        6
        ·
        24 days ago

        Participating in a public forum that has no technical way of preventing data from being used by a particular class of actor does not preclude having an opinion that a particular class of actor should have rules about what data they are allowed to use.

        • FaceDeer@fedia.io
          link
          fedilink
          arrow-up
          2
          ·
          24 days ago

          People can have whatever opinions they want to have. In this case that opinion flies in the face of obvious reality and I’m pointing that out.

          It’s like trying to drive your car across the Atlantic ocean and then griping about how the car failed to stay above the water because you really thought it should be able to handle that.

          • Feyd@programming.dev
            link
            fedilink
            arrow-up
            4
            ·
            24 days ago

            It doesn’t matter how many pithy analogies you make. You need to recognize the difference between “I know they’re scraping this website because they can” and “I don’t think they should be allowed to scrape this website”. You’re arguing that they’re incompatible when they’re not.

            • FaceDeer@fedia.io
              link
              fedilink
              arrow-up
              2
              ·
              24 days ago

              As I said, people can have whatever opinion they want. Reality is under no obligation to respect those opinions.

              Analogies are merely explanatory.

              • Feyd@programming.dev
                link
                fedilink
                arrow-up
                3
                ·
                24 days ago

                If you understand, then you should be able to understand that your “they were dressed like they wanted it” level argument bullshit is completely unnecessary.

                • FaceDeer@fedia.io
                  link
                  fedilink
                  arrow-up
                  2
                  ·
                  23 days ago

                  Ah, the “people who disagree with me are supporting rape” argument, how classy.

                  It’s not that they’re “dressed like they wanted it.” The ActivityPub protocol explicitly and deliberately does this. If you post a comment on a Fediverse community then by design that comment is going to be broadcast to every instance with a subscription and displayed in public to anyone who wants to see it. That’s what the protocol is for. There should be no misunderstanding or misinterpretation here.

                  • Feyd@programming.dev
                    link
                    fedilink
                    arrow-up
                    1
                    ·
                    edit-2
                    23 days ago

                    Ok? That doesn’t mean that everyone has to agree that AI companies should be allowed to train on the data. Are you seriously so dense you can’t distinguish between technology and social issues?

                    Ps: I very obviously didn’t say you support rape, but drew the very obvious comparison to what you’re saying. Use your head for 2 seconds.