• bigchungus@piefed.blahaj.zone
    link
    fedilink
    English
    arrow-up
    15
    ·
    1 day ago

    Extremely low paid Africans have been a vital part of the process to creating LLMs. Someone’s got to manually sift through the scraped training dataset to make sure that it doesn’t contain CSAM or NSFL.

    • slacktoid@lemmy.ml
      link
      fedilink
      English
      arrow-up
      3
      ·
      19 hours ago

      Yeah it’s fucking horrible… Also even if these jobs were well paying jobs, they are soul sucking. What we are trading off is not worth it

    • halcyoncmdr@piefed.social
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 day ago

      Someone’s got to manually sift through the scraped training dataset to make sure that it doesn’t contain CSAM or NSFL.

      You think most of these companies are bothering to do that? Seriously? Even after the recent Grok CSAM generation stuff?