Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

  • FlyingCircus@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    4 months ago

    So I’m seeing leftists and nsfw instances being mainly targeted. Are they training AI, or collecting kompromat?

  • InvalidName2@lemmy.zip
    link
    fedilink
    arrow-up
    1
    ·
    4 months ago

    This is why I go out of my way quite a bit to poison the AI with my pointless boomer anecdotes, largely made up or confiscated. Plus, I rarely proof read my comments anymore, so apologies for the grammatical issues and the hard to believe and rarely either one way or the other but twice the times there’s another type of type that you can also quite not, right?

  • sunbytes@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    4 months ago

    I mean, the API is open.

    I’ve been operating MORE privately on here than I would have on a closed/limited API.

    This data was always going to end up harvested.

  • absquatulate@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    4 months ago

    Can’t wait for that LLM to become a reddit-hating bloodthirsty linux obsessed furry femboy communist tankie with a weird fondness for beans, star trek and sturgeon

    • ferric_carcinization@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 months ago

      And multiple times, up to once per instance. Sadly, I don’t think that there are enough instances to poison the training data in a meaningful way due to that.