AI Safety Expert - Red Team
$ cat job-description.txt
About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: AI Safety Experts — English & Urdu
Type: Contract
Compensation: $20–$22/hour
Location: Remote
Role Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly to produce reports, datasets, and attack cases that customers can act on.
- Review AI outputs on sensitive topics like bias and misinformation, with optional participation in higher-sensitivity projects.
Qualifications
Must-Have
- Native fluency in English and Urdu .
- Prior experience in red teaming (AI adversarial work, cybersecurity, socio-technical probing).
- Ability to explain risks clearly to both technical and non-technical stakeholders.
Preferred
- Experience in Adversarial ML , Cybersecurity , or socio-technical risk analysis.
- Skills in creative probing such as psychology, acting, or writing for unconventional adversarial thinking.
Application Process (Takes 20–30 mins to complete)
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
- For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
first seen 2026-06-19 20:24:01 · last verified 2026-06-21 12:24:01
pentestcareers.com // breach the job market