• halcyoncmdr@piefed.social
    link
    fedilink
    English
    arrow-up
    2
    ·
    13 hours ago

    Someone’s got to manually sift through the scraped training dataset to make sure that it doesn’t contain CSAM or NSFL.

    You think most of these companies are bothering to do that? Seriously? Even after the recent Grok CSAM generation stuff?