Secure AI: Red-Teaming & Safety Filters
Completed by Suruchi Khand
April 15, 2026
3 hours (approximately)
Suruchi Khand's account is verified. Coursera certifies their successful completion of Secure AI: Red-Teaming & Safety Filters
What you will learn
Design red-teaming scenarios to identify vulnerabilities and attack vectors in large language models using structured adversarial testing.
Implement content-safety filters to detect and mitigate harmful outputs while maintaining model performance and user experience.
Evaluate and enhance LLM resilience by analyzing adversarial inputs and developing defense strategies to strengthen overall AI system security.
Skills you will gain
- Category: System Implementation
- Category: AI Security
- Category: Security Controls
- Category: Large Language Modeling
- Category: Vulnerability Scanning
- Category: Vulnerability Assessments
- Category: Continuous Monitoring
- Category: Threat Modeling
- Category: Responsible AI
- Category: Cyber Security Assessment
- Category: Security Strategy
- Category: Exploitation techniques

