Bilingual Evaluator - Fully Remote | Upto $30/hr

TaiwanPosted about 1 month ago
Description

About the job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: Bilingual Traditional Chinese Generalist Evaluator Expert
Type: Contract
Compensation: $25–$30/hour
Location: Remote
Duration: 2–4 months
Commitment: 20+ hours/week

Role Responsibilities

  • Create detailed prompts in Traditional Chinese and/or English with multiple constraints and instructions. Ensure natural phrasing and real-world relevance for users in Hong Kong and Taiwan.
  • Establish high-level expectations for correct responses in consumer contexts. Develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions.
  • Run prompts through models and assess outputs for accuracy, fluency, and cultural fit in Traditional Chinese. Compare results against English where needed.
  • Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor. Maintain consistency and reliability across Traditional Chinese–language benchmarks before integration into official evaluations.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Qualifications

Must-Have

  • Native-level fluency in Traditional Chinese (written), specific to Hong Kong or Taiwan usage, with strong reading/writing ability in English.
  • Must be native to Hong Kong or Taiwan and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity.
  • BS or BA from a reputable institution (completed or in progress).
  • Strong writing and critical thinking skills.
  • Ability to work independently and meet deadlines.
  • Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests.
  • Based in Hong Kong or Taiwan (or able to reliably produce Hong Kong- or Taiwan-specific, culturally accurate Traditional Chinese).

Preferred

  • Experience in teaching, research, editing, or academic writing.
  • Experience creating evaluation criteria, rubrics, or grading guidelines.
  • Familiarity with LLMs, prompting, or model evaluation.

Application Process (Takes 20–30 mins to complete)

  • Complete an AI-led interview (about 15 minutes).
  • If approved, complete a paid assessment focused on writing and rubric creation.
  • Then, if selected, you will be invited to work on the project.

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
  • For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Apply Now
Take the next step in your career
Apply for this Position
About Mercor

Founded

2023 (over 3 years ago)

People

51-200 employees

Industry

Software Development

Type

Privately Held

Locations

Links