minus-squarebrianpeiris@lemmy.catoTechnology@lemmy.world•Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes roguelinkfedilinkEnglisharrow-up4·20 hours agoProbably something like “Please bro!!! WHY DID YOU DO THIS ??!! 😭😭” linkfedilink
brianpeiris@lemmy.ca to Technology@lemmy.worldEnglish · edit-21 month agoAnnouncing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.plus-squarearcprize.orgexternal-linkmessage-square2linkfedilinkarrow-up10arrow-down10
arrow-up10arrow-down1external-linkAnnouncing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.plus-squarearcprize.orgbrianpeiris@lemmy.ca to Technology@lemmy.worldEnglish · edit-21 month agomessage-square2linkfedilink
Probably something like “Please bro!!! WHY DID YOU DO THIS ??!! 😭😭”