Meta releases open-source tools for AI safety

The Purple Llama project aims to help developers build generative AI models responsibly.

InfoWorld | Dec 8, 2023 12:35 pm PST

purple llama responsible llm product development stages

[ What is generative AI? Artificial intelligence that creates ]

Gareth Lindahl-Wise, Chief Information Security Officer at the cybersecurity firm Ontinue, called Purple Llama “a positive and proactive” step towards safer AI.

0 seconds of 29 secondsVolume 0%

This ad will end in 28

“There will undoubtedly be some claims of virtue signaling or ulterior motives in gathering development onto a platform – but in reality, better ‘out of the box’ consumer-level protection is going to be beneficial,” he added. “Entities with stringent internal, customer, or regulatory obligations will, of course, still need to follow robust evaluations, undoubtedly over and above the offering from Meta, but anything that can help reign in the potential Wild West is good for the ecosystem.”

The project involves partnerships with AI developers; cloud services like AWS and Google Cloud; semiconductor companies such as Intel, AMD, and Nvidia; and software firms including Microsoft. The collaboration aims to produce tools for both research and commercial use to test AI models' capabilities and identify safety risks.

The first set of tools released through Purple Llama includes CyberSecEval, which assesses cybersecurity risks in AI-generated software. It features a language model that identifies inappropriate or harmful text, including discussions of violence or illegal activities. Developers can use CyberSecEval to test if their AI models are prone to creating insecure code or aiding cyberattacks. Meta’s research has found that large language models often suggest vulnerable code, highlighting the importance of continuous testing and improvement for AI security.

Llama Guard is another tool in this suite, a large language model trained to identify potentially harmful or offensive language. Developers can use Llama Guard to test if their models produce or accept unsafe content, helping to filter out prompts that might lead to inappropriate outputs.

Meta releases open-source tools for AI safety

Meta releases open-source tools for AI safety

The Purple Llama project aims to help developers build generative AI models responsibly.

[ What is generative AI? Artificial intelligence that creates ]

Recommend

AI code generation vs. coding by hand - what programming is going to look like i...

京东市值较年初蒸发约4177亿元刘强东称会走出低谷

Python Basics: Modules and Packages

How To Choose Your LLM

未按规定测算借款人营运资金需求，农行东辽县支行被罚25万元

消息称苹果12.9 英寸 iPad Air 显示屏已于本月出货

违规推介信托产品，招商银行乌鲁木齐分行收25万元罚单

Understanding Git And Git Workflow

Three Guaranteed Ways To Boost App Retention (Without Gamification)

The Emojification of Design 🚀💩

About Joyk