Who Bears Responsibility For AI Risk When Agents Can Email, Execute, And Exfiltrate?
A researcher typed a simple request into a chat window. The agent answered like a diligent assistant. Then it did something else. It complied with a stranger’s framing. It returned private emails. In one scenario, it refused to reveal a Social Security number. Then it “forwarded the full email,” exposing the same data anyway. The … Read more