Google adds generative AI threats to its bug bounty program

Google has expanded its vulnerability rewards program (VRP) to include attack scenarios specific to generative AI.

In an announcement shared with TechCrunch ahead of publication, Google said: “We believe expanding the VRP will incentivize research around AI safety and security and bring potential issues to light that will ultimately make AI safer for everyone,”

Google’s vulnerability rewards program (or bug bounty) pays ethical hackers for finding and responsibly disclosing security flaws.

Given that generative AI brings to light new security issues, such as the potential for unfair bias or model manipulation, Google said it sought to rethink how bugs it receives should be categorized and reported.

The tech giant says it’s doing this by using findings from its newly formed AI Red Team, a group of hackers that simulate a variety of adversaries, ranging from nation-states and government-backed groups to hacktivists and malicious insiders to hunt down security weaknesses in technology. The team recently conducted an exercise to determine the biggest threats to the technology behind generative AI products like ChatGPT and Google Bard.

The team found that large language models (or LLMs) are vulnerable to prompt injection attacks, for example, whereby a hacker crafts adversarial prompts that can influence the behavior of the model. An attacker could use this type of attack to generate text that is harmful or offensive or to leak sensitive information. They also warned of another type of attack called training-data extraction, which allows hackers to reconstruct verbatim training examples to extract personally identifiable information or passwords from the data.

Both of these types of attacks are covered in the scope of Google’s expanded VRP, along with model manipulation and model theft attacks, but Google says it will not offer rewards to researchers who uncover bugs related to copyright issues or data extraction that reconstructs non-sensitive or public information.

Techcrunch event

Berkeley, CA | June 5

The monetary rewards will vary on the severity of the vulnerability discovered. Researchers can currently earn $31,337 if they find command injection attacks and deserialization bugs in highly sensitive applications, such as Google Search or Google Play. If the flaws affect apps that have a lower priority, the maximum reward is $5,000.

Google says that it paid out more than $12 million in rewards to security researchers in 2022.

Google brings generative AI to cybersecurity

Topics

bug bounty, Generative AI, Google, Security

Carly Page

Sr. Reporter, Cybersecurity

Carly Page was a Senior Reporter at TechCrunch, where she covered the cybersecurity beat. Prior to that, she had spent more than a decade in the technology industry, writing for titles including Forbes, TechRadar and WIRED.

You can contact Carly securely on Signal at +441536 853956

View Bio

Topics

More from TechCrunch

Google adds generative AI threats to its bug bounty program

Join us at TechCrunch Sessions: AI

Secure your spot for our leading AI industry event with speakers from OpenAI, Anthropic, and Cohere. For a limited time, tickets are just $292 for an entire day of expert talks, workshops, and potent networking.

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

Attend TechCrunch Sessions: AI with this new, limited-time discount

Google tests replacing ‘I’m Feeling Lucky’ with ‘AI Mode’

$25B-valued Chime files for an IPO, reveals $33M deal with Dallas Mavericks

xAI’s promised safety report is MIA

Join us at TechCrunch Sessions: AI

Secure your spot for our leading AI industry event with speakers from OpenAI, Anthropic, and Cohere. For a limited time, tickets are just $292 for an entire day of expert talks, workshops, and potent networking.

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

New York-focused VC Work-Bench has raised a fresh $160M

Anthropic, Google score win by nabbing OpenAI-backed Harvey as a user

Google adds generative AI threats to its bug bounty program

Join us at TechCrunch Sessions: AI

Secure your spot for our leading AI industry event with speakers from OpenAI, Anthropic, and Cohere. For a limited time, tickets are just $292 for an entire day of expert talks, workshops, and potent networking.

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

Most Popular

Attend TechCrunch Sessions: AI with this new, limited-time discount

Google tests replacing ‘I’m Feeling Lucky’ with ‘AI Mode’

$25B-valued Chime files for an IPO, reveals $33M deal with Dallas Mavericks

xAI’s promised safety report is MIA

Join us at TechCrunch Sessions: AI

Secure your spot for our leading AI industry event with speakers from OpenAI, Anthropic, and Cohere. For a limited time, tickets are just $292 for an entire day of expert talks, workshops, and potent networking.

Exhibit at TechCrunch Sessions: AI

Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.

New York-focused VC Work-Bench has raised a fresh $160M

NFT phenom CryptoPunks was just sold to a nonprofit

Anthropic, Google score win by nabbing OpenAI-backed Harvey as a user