AI Safety Expert - Red Team
$ cat job-description.txt
About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: AI Safety Experts — English & Telugu
Type: Contract
Compensation: $20–$22/hour
Location: Remote
Role Responsibilities
- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.
- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.
- Document reproducibly by producing reports, datasets, and attack cases that customers can act on.
- Work independently and asynchronously to improve AI model performance and ensure flexible hours .
Qualifications
Must-Have
- Fluent in English & Telugu .
- Prior red teaming experience in AI adversarial work , cybersecurity , or socio-technical probing.
- Ability to explain risks clearly to both technical and non-technical stakeholders.
Preferred
- Experience with adversarial ML , cybersecurity , or socio-technical risk analysis.
- Skills in creative probing such as psychology, acting, or writing.
Compensation & Legal
- Hourly contractor
- Paid weekly via Stripe Connect
Application Process (Takes 20–30 mins to complete)
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
- For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
first seen 2026-07-02 00:24:01 · last verified 2026-07-02 00:24:01
pentestcareers.com // breach the job market