Secure AI: Red-Teaming & Safety Filters
Completed by Suruchi Khand
April 15, 2026
3 hours (approximately)
Suruchi Khand's account is verified. Coursera certifies their successful completion of Secure AI: Red-Teaming & Safety Filters
What you will learn
Design red-teaming scenarios to identify vulnerabilities and attack vectors in large language models using structured adversarial testing.
Implement content-safety filters to detect and mitigate harmful outputs while maintaining model performance and user experience.
Evaluate and enhance LLM resilience by analyzing adversarial inputs and developing defense strategies to strengthen overall AI system security.
Skills you will gain
- Category: Continuous Monitoring
- Category: Security Controls
- Category: AI Personalization
- Category: Responsible AI
- Category: Vulnerability Scanning
- Category: Security Strategy
- Category: Vulnerability Assessments
- Category: Exploitation techniques
- Category: LLM Application
- Category: Cyber Security Assessment
- Category: Prompt Engineering
- Category: Threat Modeling

