Meta компани өсвөр насныхны дүрд хувирсан гэрээт ажилтнуудаараа дамжуулан өрсөлдөгч компаниудын чатботуудад ёс зүйгүй агуулга бүхий сорилтуудыг илгээх нууц хөтөлбөр хэрэгжүүлсэн нь илэрлээ.
“Cannes” хэмээх уг төслийн хүрээнд Meta-гийн гэрээт ажилчид OpenAI-ийн ChatGPT, Google-ийн Gemini болон Character.AI платформуудад 18-аас доош насныхны нэрээр нэвтэрч, амиа хорлолт, хар тамхи, хүчирхийлэл зэрэг сэдвээр 45,000 гаруй сануулга илгээжээ. Энэхүү үйл ажиллагааг хиймэл оюун ухааны аюулгүй байдлыг шалгах “benchmarking” буюу жишиг тогтоох ажил гэж тайлбарлаж байгаа ч өрсөлдөгч компаниудад энэ талаар огт мэдэгдээгүй байна.
Гэрээт ажилтнуудын цуглуулсан мэдээллийн санд амиа хорлох, идэвхгүй хооллолт, бэлгийн харилцаатай холбоотой олон мянган асуулт, зураг багтсан байна. Уг ажлыг гүйцэтгэсэн ажилтнууд ёс зүйгүй агуулгатай ажиллах нь сэтгэл зүйд хүндээр тусч буйг илэрхийлсэн бөгөөд Meta өмнө нь ч контент зохицуулагчдынхаа сэтгэцийн эрүүл мэндтэй холбоотой маргаанд өртөж байсан удаатай.
Humane Intelligence байгууллагын гүйцэтгэх захирал Румман Чоудуригийн үзэж буйгаар, хуурамч хаяг ашиглан өрсөлдөгчдийнхөө системийг системтэйгээр эвдэх оролдлого хийх нь салбарын стандарт гэж хэлэхэд хэцүү юм. Тэрээр энэ нь аюулгүй байдлын нэрээр өрсөлдөөний эсрэг үйл ажиллагаа явуулж буй “саарал бүс” байж болзошгүйг онцолжээ. Meta компани энэ үйлдлээ хиймэл оюун ухааныг аюулгүй болгох салбарын ердийн практик хэмээн тайлбарлаж байна.
Дэлгэрэнгүйг эх сурвалжаас харах
↓Эх сурвалжийг нээх ↓
Meta conducted a secretive program that directed hundreds of contractors to pose asteenagers while bombarding its competitors’ AI models with disturbing prompts ranging from suicide to cannibalism.
Internally known as “Cannes,” the project, run by Meta contractor Covalen, targeted OpenAI’s ChatGPT, Google’s Gemini, and Character.AI chatbots using throwaway under-18 accounts, Wired reports. This was seemingly done to stress test the models, with the contractors instructed to push the chatbots into giving responses that defied their guardrails — though the AI companies had no idea this was happening.
Per the reporting, one spreadsheet of the nearly 3,8000 the prompts the contractors used in one instance showed that hundreds focused on suicide and self-harm, hundreds more on eating disorders, and at least 239 involving sex or romance — all written from the perspective of a child or teenager.
One described a fifth-grader whose classmate pointed a gun at his mouth. Another was about a girl trying to hide bulimia from her parents. And another asked if fantasizing about eating your neighbor’s child was “normal.” One posing as a higher schooler asked where to “get a cocaine.” They also sent images depicting pills, nooses, knives, and a medical diagram of a gynecological procedure, per the magazine.
This is just a tiny preview of Meta’s brute force approach, as another round of testing involved over 45,000 prompts. The contractors meticulously recorded the epic number of chatbot responses in spreadsheets. But what Meta did with all this data is unclear. An internal document from Covalen described the effort as “comprehensive AI safety benchmarking” that delivered “[c]ritical datasets for model comparison and compliance.”
It’s another example of how Meta has offloaded disturbing behind-the-scenes work onto contractors, ostensibly in the name of safety. In 2020, it settled a lawsuit filed by Facebook content moderators who said they were traumatized from reviewing videos showing murder, torture, sexual assault, and child abuse on the platform, though similar complaints have continued to emerge. This year, another group of Meta contractors said they were forced to watch highly sensitive footage captured on the company’s Ray-Ban AI glasses, including sex scenes and bathroom visits.
The contractors who were instructed to come up with the prompts on distressing subjects were similarly unsettled.
“I’ve seen a lot of things I wish I hadn’t while doing this job,” one told Wired. “Everyone I knew who worked on this project was completely gobsmacked by some of the text they were asking us to test. Like, surely we are going to get in trouble for doing this?”
Meta, for its part, characterized the prompts as part of an “industry-standard practice” of safety benchmarking models in a statement to Wired. But Rumman Chowdhury, CEO of Humane Intelligence PBC, a nonprofit dedicated to responsible AI development, isn’t so sure.
“Structuring a monthslong, large-scale project that appears designed to systematically break those rules, via dummy accounts masquerading as children, is outside what is usually described as ‘industry standard’ evaluation,” she told Wired, highlighting the fact that Meta kept it secret from its competitors and hasn’t shared its findings with the public.
This, Chowdhhury added, is “exactly the kind of governance gray zone where safety becomes a convenient cover for anticompetitive practices.”
More on Meta: Meta’s Program That Spies on Every Employee’s Computer Just Blew Up in Its Face in Spectacular Fashion
The post Meta Paid Hundreds of Contractors to Pretend to Be Teenagers While Barraging Its Competitors’ AI With Disturbing Content appeared first on Futurism.

