Yuezhou Liu

Senior Cloud and AI Infrastructure Engineer with 17+ years of experience spanning telecom core networks, cloud platforms (Kubernetes, OpenStack), and GPU/AI infrastructure. Proven track record in building and operating large-scale distributed systems, leading technical teams, and landing AI workloads on cloud infrastructure. Currently focused on SAP AI Core deployment on Alibaba Cloud. Deep expertise in Kubernetes, bare-metal provisioning (Metal3), GPU cluster networking (InfiniBand/RDMA), and LLM deployment.

Skills

Cloud & Container Orchestration
  • Kubernetes
  • OpenStack
  • Docker
  • Metal3
  • Helm
  • Rook-Ceph
AI / GPU Infrastructure
  • Nvidia BCM
  • InfiniBand
  • GPFS
  • RDMA
  • GPU Operator
  • vLLM
Programming & Automation
  • Shell/Bash
  • Python
  • Ansible
  • Erlang
  • C++
  • Java
Telecom & Networking
  • 3GPP (2G-5G)
  • Core Network (SGSN/MME)
  • IP Networking
  • OVS/DPDK/SR-IOV
  • DNS
  • Load Balancing
DevOps & Platform
  • CI/CD
  • Linux
  • Git
  • Prometheus
  • Grafana
  • Alibaba Cloud

Experience

Senior DevOps Engineer

China Datacom (SAP Cloud China)

SAP AI Core China landing on Alibaba Cloud.

  • Responsible for infrastructure deployment and operations of SAP’s AI platform on the Chinese public cloud environment

September 2025 - Present

Principal DevOps Engineer

Yoocar

Built and operated AI GPU cluster infrastructure from the ground up.

  • Designed and deployed H20 GPU server clusters managed by Nvidia BCM (BaseCommand Manager)
  • Implemented high-performance networking with InfiniBand, RDMA, and IBM GPFS high-speed storage
  • Built Kubernetes platform on bare-metal with automated provisioning based on Metal3/ClusterAPI
  • Deployed and fine-tuned Large Language Models (DeepSeek, LLaMA) using vLLM, NIM, and Ollama
  • Participated in drafting National Standards for Intelligent Computing Terms

April 2023 - July 2025

Senior Cloud Support Engineer / Team Leader

Ericsson

Led cloud infrastructure team for Ericsson’s telecom cloud products (CCD & CEE).

  • Technical lead for CCD (CNCF-certified Kubernetes platform) and CEE (Mirantis OpenStack platform)
  • Kubernetes lifecycle management automation development (Heat/Ansible)
  • OpenStack components (Nova/Neutron) customization for China Mobile (Python/Ansible/Shell)
  • Prototype research on bare-metal K8S deployment based on Metal3
  • Supported China Mobile 5GC CP1 performance testing, ranking #1 among all vendors
  • Customized CEE (OpenStack) to support 1,500 physical nodes for China Mobile
  • NFV virtual layer networking (OVS/DPDK/SR-IOV) design, development, and automation
  • Supported KDDI Japan CEE deployment

September 2016 - March 2023

Senior Support Engineer

Ericsson

Expert-level 2G/3G/4G/5G core network technical support.

  • Analyzed core network signaling and helped operators configure and troubleshoot SGSN/MME
  • 7x24 R&D Level 3 emergency support for global telecom operators
  • Deployed and tested virtual core network (vEPC) on Ericsson OpenStack cloud platform
  • On-site support for 2016 G20 Summit (Zhejiang Telecom core network)
  • China Mobile NB-IoT test specification drafting
  • Softbank VoLTE pre-launch functional testing

July 2012 - August 2016

Software Developer

Tellabs

Developed Tellabs 6300 Network Management System (NMS).

  • Module design and development using C++, Shell, and TCL

October 2011 - July 2012

Software Developer

HP (Hewlett-Packard)

Developed HP OpenView TeMIP (Telecom Management Information Platform).

  • Server alarm module development (Shell/C++)
  • Windows client GUI development (MFC)
  • Web client development (Java)

February 2008 - September 2011

Education

JiangSu University

Bachelor of Engineering
Information Security

2004 - 2008
Nifty tech tag lists from Wouter Beeftink