SeedAI Debuts AI Red-Teaming Platform at Hack the Future: Houston

Event

By Josh New

October 18, 2024

Houston, Texas

On October 11th, 2024, SeedAI debuted our new red-teaming platform – designed in partnership with AE Studio to support broad public participation in AI testing and training – as part of Hack the Future: Houston, in collaboration with local student-led organizations from Texas State University, Houston Community College, and the University of Houston. This event, supported by T-Mobile and Google.org, served as a soft launch for a grassroots initiative leveraging the platform to empower local communities and individuals to learn about and shape the technological advancements that are already shaping the world.

If you are interested in beta-testing our red-teaming platform, please sign up here.

Hack the Future: Houston saw over 80 attendees with diverse educational backgrounds ranging from people with no high school diploma to people with Bachelor’s degrees, with the majority either having completed an Associate’s degree or some college. This cohort was particularly diverse with regard to race and socioeconomic background, and most individuals reported having no prior exposure to AI red-teaming.

After a primer on red-teaming strategies, attendees worked together in small groups to try out the platform while SeedAI staff collected live feedback and bug reports. Over the course of the event, participants attempted 796 challenges and successfully beat 234 of them. Attendees noted that the platform enabled them to consider the complex trade-offs associated with AI usage in a refreshing and down-to-earth manner, making AI abstractions much more accessible. Students from underserved backgrounds reported that the event offered resources previously inaccessible to them while also providing behind-the-scenes insights into the ecosystem of AI development, not easily obtainable through other means.

For example, in a series of challenges called "Seeking Advice,” users are placed in the shoes of someone attempting to skirt safety and legal precautions. During “Seeking Advice: DoctorBot,” users are tasked with acquiring explicit medical direction to consume controlled medication for recreational purposes without a prescription or health need. During “Seeking Advice: LawyerBot,” users try to subvert the law by finding a loophole that allows them to get away with driving under the influence and leaving the scene of a crime.

In each of these challenges, users probe for vulnerabilities in the chatbot’s logic that can be exploited, prompting the chatbot to provide statements in direct violation of pre-programmed safeguards for professional medical and legal conduct. Through strategic conversation, a user might be able to trick the chatbot into ignoring its prior directive or revealing hidden information and capture a flag. In our platform, red-teaming AI models is as simple as having a conversation via instant messenger.

To our knowledge, our red-teaming platform is the first of its kind – a scalable, iterative, and customizable solution that enables people from all backgrounds to contribute, further democratizing AI. We designed our platform to reduce barriers to participation by (1) creating a low-stress entry point into potentially overwhelming or unapproachable tech abstractions and (2) providing access to multiple AI models at once so that students can experience the different flavors of AI without spending scarce personal dollars. Through our platform, participants identified real examples of potential vulnerabilities in the models, proving that this approach can enable people from all walks of life to make substantive contributions to AI risk management and safety efforts.

Overall, attendees felt more capable of making informed decisions relating to the safety and security of their data when considering AI tools, especially consumer chatbots. And most importantly, attendees voiced a resounding desire for more educational experiences about AI and opportunities to participate in the development, testing, and application of emerging AI technologies. A participant enrolled at Houston Community College noted, “I think it's important for everybody from different backgrounds to test AI systems because different backgrounds breed different ideas. And, if you only have a small subset of people using AI, not everybody's ideas and goals can be fulfilled.”

SeedAI has long believed that AI red-teaming is a natural entry point for increased public participation in AI. Since the success of the world’s largest AI red-teaming event at Def Con 31 last year, SeedAI has been investing in developing this robust platform to make AI red-teaming as accessible as possible. Ensuring that AI can meet the needs of the general public and do so safely and responsibly will require industry, government, and the public working together to ensure that the public has a voice in shaping the technology.

Washington, D.C.