Data & AI Platform Infrastructure Engineer
Apple · Beijing, China · Machine Learning and AI
At Apple, we believe in hard work, a fun environment, and the creativity and innovation that only comes about when hardworking people from diverse backgrounds approach problems from varying perspectives The people here at Apple don’t just build products — they craft the kind of wonder that’s revolutionized entire industries. It’s the diversity of those people and their ideas that encourages the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts. Join Apple, and help us leave the world better than we found it! Building this environment starts with YOU!
At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. If you bring passion and dedication to your job and there's no telling what you could accomplish.
This is a visible and important role at Apple in China and will have great impact on Sales Team. The individual in this role will be responsible for interpreting quantitative data and developing statistical models to forecast and monitor sales demand & supply for sales analytics team.
We are building a unified data‑and‑AI platform that serves engineers. The platform runs on multiple cloud platforms and provides reliable, scalable, and cost‑effective infrastructure services (compute, storage, networking, security, monitoring, CI/CD, etc.).
As an Infrastructure Engineer you will be responsible for:
- Cloud‑Native Resource Management
* Provision, configure, and maintain AWS services (EC2, S3, EKS, Lambda, ECS, etc.) and Alibaba Cloud services (ECS, OSS, ACK, Function Compute, ECI, etc.).
* Write and maintain Infrastructure‑as‑Code scripts (Terraform, CloudFormation, ROS, Ansible, etc.) to automate the lifecycle of resources.
* Cost Optimization: apply reserved instances, autoscaling, and right‑sizing to reduce cloud spend.
- Platform Component Build‑out
* Deliver shared services such as data lakes, metadata stores, job schedulers, logging & monitoring stacks, and identity‑access management.
* Design and implement CI/CD pipelines for code, configuration, and infrastructure delivery.
- Operations & Monitoring
* Build observability (Prometheus/Grafana, CloudWatch, Log Service, etc.) and alerting systems.
* Perform day‑to‑day troubleshooting, performance tuning, and cost‑optimization.
- Cross‑Team Collaboration
* Work closely with Engineering, Product, and Business teams to translate requirements into platform solutions.
* Produce clear documentation, runbooks, and best‑practice guides to help users adopt the platform quickly.
<h3>Minimum Qualifications</h3>Bachelor’s degree or higher in Computer Science, Software Engineering, Information Technology, or a related field.
3+ years of professional experience working with cloud platforms (AWS and/or Alibaba Cloud).
Familiar with Linux command line tools, Shell scripts, IaC tools, containers and orchestration, networking concepts, Git concepts, CI/CD fundamentals, Kubernetes.
Proficient with at least one programming language of: Python, Rust, Golang
Strong communication and teamwork abilities, self‑motivated learner, able to pick up new technologies quickly, Analytical mindset for troubleshooting and problem solving.
<h3>Preferred Qualifications</h3>Detailed understanding of OLTP and OLAP systems
Experience with big data technologies (Kafka, Spark, Flink, Hive, etc.)
Experience with job schedulers(Airflow, Step Functions, etc.)
Familiar with ML foundations(Pytorch, Tensorflow, model serving, etc)
Familiar with LLM foundations(LLM internals, vLLM, SGLang, Unsloth, etc)