AI Benchmarking Spec. - Chinese, International Seller Growth
Amazon · Shanghai, China · Editorial, Writing, & Content Management
About this role
Amazon is hiring a mid-level Data Annotator in the operations function based in Shanghai, China. The posting calls out experience with LLMs, Testing, Machine Learning.
- Role
- Data Annotator
- Function
- operations
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Shanghai, China
- Department
- Editorial, Writing, & Content Management
- Posted
- Apr 8, 2026
More roles at Amazon
Job description
from Amazon careersThe Seller AI team within International Seller Services organization is focused on helping sellers with the right set of Gen-AI/LLM powered tools and agentic solutions that can enable them to accelerate business growth on Amazon. Our primary focus lies in handling annotations for training, measuring, and improving Artificial Intelligence (AI) and Large Language Models (LLMs), enabling Amazon to deliver a superior seller experience to our sellers worldwide. The AI Benchmarking Associate supports the evaluation of AI systems by designing and executing benchmarking and audit activities to assess model quality, compliance, robustness, and fairness. The role combines elements of AI auditing, quality assurance, and traditional audit-style documentation and stakeholder communication. By joining us, you will play a pivotal role in shaping the future of selling on Amazon for sellers worldwide. Key job responsibilities As part of your role, you will have the opportunity to, • Assist in planning and executing benchmarking exercises for AI models, including defining test plans, metrics, and acceptance criteria across accuracy, robustness, bias, and reliability • Support content accuracy, relevancy, and privacy checks by reviewing datasets, model outputs, and data handling practices, escalating potential regulatory risks. • Validate data based on specific annotation guidelines, ensuring the accuracy…