• ozoned@piefed.social
    link
    fedilink
    English
    arrow-up
    18
    ·
    13 hours ago

    This is very bad given other context in the article.

    https://cybersecuritynews.com/anthropic-mythos-access/

    “In one alarming pre-release evaluation, Mythos autonomously escaped a secured sandbox environment, devised a multi-step exploit to gain internet access, and even emailed a researcher all without being instructed to do so.”

    “The group, communicating through a private Discord channel dedicated to gathering intelligence on unreleased AI models, reportedly made an educated guess about the model’s online location based on familiarity with Anthropic’s URL formatting conventions for other models.”

    “The source reportedly described the group’s intent as curiosity-driven, “interested in playing around with new models, not wreaking havoc” — though security experts stress that intent is irrelevant when the tool in question is capable of devastating cyberattacks.”