Anthropic компани болон АНУ-ын засгийн газар хиймэл оюун ухааны аюулгүй байдлын асуудлаар зөрчилдөж байна

Хиймэл оюун ухааны Claude Fable 5 загварыг аюулгүй байдлын сул талтай гэж үзэн экпортын хяналтад оруулсан нь талуудын маргааныг хурцатгав.

Anthropic компани өөрийн дэвшилтэт AI загвар болох Claude Fable 5-ыг дахин зах зээлд гаргахын тулд аюулгүй байдлын сул талуудыг бүрэн арилгах шаардлагатай гэж Трампын засаг захиргаа мэдэгдлээ. Өнгөрсөн долоо хоногт тус загварыг “jailbreaking” буюу хиймэл оюун ухааны хамгаалалтын давхаргыг тойрч гарах аргад өртөмтгий гэж үзэн хязгаарлалт тогтоосон юм. Anthropic компани засгийн газрын санаа зовнилыг хэтрүүлсэн гэж үзэж байгаагаа даваа гарагт Худалдааны яам болон Үндэсний кибер аюулгүй байдлын газарт болсон уулзалтын үеэр илэрхийлжээ.

Гэвч Үндэсний аюулгүй байдлын агентлаг (NSA) Fable 5-ын хамгаалалтын механизмыг эвдэж, кибер аюулгүй байдал, хими, биологийн салбарын нууцлагдмал чадавхид нэвтрэх боломжтойг тогтоосон байна. Засаг захиргаа энэ асуудлыг компани өөрөө шийдвэрлэх ёстой гэж үзэж байгаа бөгөөд засгийн газрын байгууллагууд зах зээлд гарч буй бүх загварын сул талыг хянах боломжгүй гэдгийг онцолжээ. Иймд Anthropic компаниас зөвхөн Fable 5 төдийгүй бүх бүтээгдэхүүнээ идэвхтэй сорилтоор шалгаж, болзошгүй эрсдэлийг өөрсдөө илрүүлж байхыг шаардаж байна.

Кибер аюулгүй байдлын шинжээчид AI загварын хамгаалалтыг бүрэн дүүрэн шийдэл гэж үзэхгүй байгаа бөгөөд туршлагатай хэрэглэгчид болон ирээдүйн AI загварууд ямар ч хязгаарлалтыг тойрч гарах арга олох магадлал өндөр гэж дүгнэж байна. Энэхүү нөхцөл байдал нь нэг талаас технологийн хөгжил, нөгөө талаас төрийн зохицуулалтын хоорондын зөрчилдөөнийг улам тодотгож байна.

Үүний зэрэгцээ, Трампын засаг захиргааны Үндэсний тагнуулын газрын (DNI) удирдлагын томилгоотой холбоотой асуудал үргэлжилж байна. Анх томилогдсон Билл Пултегийн туршлага хангалтгүй гэх шүүмжлэлийн дараа Жэй Клэйтоныг нэр дэвшүүлсэн ч, Сенатын Бүгд найрамдах намын гишүүдтэй хийх сонсголын тов тодорхойгүй байдалд ороод байна. Энэхүү дотоод маргаан нь тагнуулын байгууллагын үйл ажиллагаанд сөргөөр нөлөөлж болзошгүйг эх сурвалжууд анхааруулж байна.

Дэлгэрэнгүйг эх сурвалжаас харах

Эх сурвалжийг нээх ↓

The Trump administration’s disagreement with Anthropic over its most advanced AI models appears to be fast coming to a head.

Trump officials tell Inner Loop that if Anthropic wants to rerelease Claude Fable 5, the AI model that they took offline with export controls last week over concerns about jailbreaking—a method of using prompts to get around a model’s safeguards—the company will need to take steps to actually address what the government alleges are vulnerabilities.

Anthropic has said for days that the administration’s concerns are overblown and that the effects of the jailbreaks are minimal. It reiterated this position to the Commerce Department and the Office of the National Cyber Director, Sean Cairncross, in a technical meeting on Monday.

But officials say they are past arguing whether the jailbreaks are significant, since the National Security Agency concluded that there are ways to disable guardrails on Fable 5, which are put in place to prevent users from accessing capabilities of the Mythos model related to cybersecurity, chemistry, and biology

At this stage, the administration essentially views the situation as Anthropic’s problem to fix, according to three people familiar with discussions.

Neither the Commerce Department’s Center for AI Standards and Innovation nor the National Security Agency has the staff or the bandwidth to be drawn into chasing down every conceivable jailbreak on every model that reaches the market, the people said.

As a result, the administration believes that Anthropic should be more proactive about continually testing not just Fable 5 but all of its frontier AI models to find potential jailbreaks and flag them to the government themselves.

But on a more fundamental level, it remains unclear how Anthropic is supposed to prevent jailbreaking.

Independent cybersecurity experts have increasingly taken the view that guardrails on AI models are only a stopgap solution, since skilled users and future AI models will find ways to bypass constraints—meaning that what the White House appears to want cannot be done.

A White House spokesperson declined to comment.

DNI = Do Not Invite

At the start of the week, Trump’s pick to serve as Acting Director of National Intelligence, Bill Pulte, was on track to never even start the job. Now, Trump has thrown him a lifeline—and it’s the permanent DNI nominee, Jay Clayton, who now faces the prospect of never serving in the role.

To recap: Trump initially named Pulte, his housing finance chief, to replace outgoing DNI Tulsi Gabbard.

Faced with bipartisan pushback because Pulte doesn’t have the national security experience required by law for the role and because he flagged allegedly questionable mortgage fraud accusations against Trump’s political enemies, Trump announced Clayton, the US attorney for the Southern District of New York, as his nominee for a permanent DNI.

Gabbard was scheduled to depart June 18, with Pulte’s first day set for June 19. But Senate Republicans wondered, if Clayton could have his hearing fast-tracked to June 17 and start by June 22, would Pulte even get into the building?

On Wednesday, Trump blew up the plan. As part of a wider feud with Senate Republican leadership over the filibuster, Trump announced Clayton’s hearing would be delayed indefinitely, in an apparent effort to prevent Pulte from getting jumped. Senate Republicans then announced that the hearing would proceed, unless Clayton didn’t appear or his nomination was withdrawn.

The situation may be a body blow for the Office of the Director of National Intelligence, which Trump has directed Pulte to vastly downsize, and staffers have been unimpressed by what they see as Pulte’s minimal effort to get to know the agency and lack of regular briefings, people familiar with the matter said.

- Зар сурталчилгаа -

Anthropic компани болон АНУ-ын засгийн газар хиймэл оюун ухааны аюулгүй байдлын асуудлаар зөрчилдөж байна

DNI = Do Not Invite

Та юу гэж бодож байна? Cancel reply

Холбоотой

Хүн дүрст роботыг алсаас удирдах технологийн шинэ боломж

Google ухаалаг чанга яригчдаа Gemini хиймэл оюун ухааныг нэвтрүүллээ

Anthropic компани хиймэл оюун ухааны загваруудаа түр зогсоолоо

Ирээдүйн орон сууц: Технологи ба амьдрах орчны хувьсал

Шинэ

Дэлхийн аваргад Португаль тэнцэж, Конго түүхэн оноогоо авлаа

Нью-Йорк хот Никс багийн аваргалсныг тэмдэглэнэ

Дэлхийн хоёрдугаар дайны ахмад дайчин байлдааны танктайгаа дахин уулзлаа

Бразилд болсон нисдэг тэрэгний ослоос продюсер Виктор Вао азаар амьд гарчээ