AI Safety Expert - Red Team

Mercor· Toronto, Ontario· Posted 1h ago· via Talent.com
region Canada
salary CAD 20
Apply Now

$ cat job-description.txt

About the job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: AI Safety Experts — English & Telugu

Type: Contract

Compensation: $20–$22/hour

Location: Remote

Role Responsibilities

- Red team conversational AI models and agents to identify jailbreaks, prompt injections, and misuse cases.

- Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.

- Apply structure by following taxonomies, benchmarks, and playbooks to ensure consistent testing.

- Document reproducibly by producing reports, datasets, and attack cases that customers can act on.

- Work independently and asynchronously to improve AI model performance and ensure flexible hours .

Qualifications

Must-Have

- Fluent in English & Telugu .

- Prior red teaming experience in AI adversarial work , cybersecurity , or socio-technical probing.

- Ability to explain risks clearly to both technical and non-technical stakeholders.

Preferred

- Experience with adversarial ML , cybersecurity , or socio-technical risk analysis.

- Skills in creative probing such as psychology, acting, or writing.

Compensation & Legal

- Hourly contractor

- Paid weekly via Stripe Connect

Application Process (Takes 20–30 mins to complete)

- Upload resume

- AI interview based on your resume

- Submit form

Resources & Support

- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome

- For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

first seen 2026-07-02 00:24:01 · last verified 2026-07-02 00:24:01

pentestcareers.com // breach the job market

Get new pentesting jobs in your inbox