Projects and Engineering Portfolio

Browse selected work across AI infrastructure, embodied AI, systems engineering, and digital applications

Browse by Type

AI Infrastructure, Model Serving, and Training Optimization

15 projects

2025.08.18 — Present
Type: Open-Source Contribution + Product Delivery
Role: Contributor / MLsys Engineer
GitHub Stars: 416

vLLM-Kunlun (Baidu Kunlun Chip Inference Framework)

vLLM inference framework for Kunlun XPU, used to bring high-performance LLM serving to Baidu Kunlun P800 hardware.

PythonPyTorchvLLMKunlun XPULLM Inference
Source Code
2026.05.19
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 16,139

FunASR

End-to-end speech recognition toolkit and open-source pretrained model library.

PythonASRVADPunctuationTesting
Source Code
2026.05.05
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 80,503

vLLM

High-throughput LLM inference and serving engine.

PythonvLLMTokenizersTransformers
Source Code
2026.04.28
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 373,294

OpenClaw

Cross-platform personal AI assistant.

TypeScriptAI AssistantSchemaTesting
Source Code
2026.04.10 — 2026.05.08
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 4,636

ExecuTorch

On-device AI runtime for PyTorch.

PyTorchExecuTorchXNNPACKDynamic ShapesTesting
Source Code
2026.04.07 — 2026.04.09
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 3,882

LitServe

AI inference serving framework maintained by Lightning AI for multi-API and multi-model deployments.

PythonModel ServingAuthenticationHealth Checks
Source Code
2026.04.08
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 2,316

Olive

Microsoft toolkit for model finetuning, conversion, quantization, and deployment optimization.

PythonOliveQuantizationHQQRTN
Source Code
2026.04.08
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 5,755

anomalib

Open-source anomaly detection library for industrial inspection and vision workloads.

PythonanomalibPandasDataFrameVision
Source Code
2026.04.07
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 2,828

PyTorch AO

PyTorch model optimization and quantization toolkit for compiler and inference workflows.

PythonPyTorchPT2EQuantization
Source Code
2026.04.06
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 28,032

SGLang

High-performance LLM and multimodal serving framework focused on optimized runtime and deployment.

PythonPyTorchLLM ServingAuthenticationTesting
Source Code
2026.04.04
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 5,544

RLLM

RL-for-LLM research framework with integrations such as verl for training workflows.

PythonRLHFverlTraining
Source Code
2026.03.05 — 2026.05.06
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 4,815

vLLM-Omni

Omni and multimodal inference extension built on vLLM.

PythonvLLMMultimodalGradioLogging
Source Code
2026.04.03
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 9,525

OpenRLHF

Distributed RLHF training framework for alignment experiments on large models.

PythonRayRLHFNCCL
Source Code
2026.03.17 — 2026.04.02
Type: Closed-Source Baidu AI Inference System
Role: MLsys / Inference Engineer

Baidu AIAK-SGLang DeepSeek / GLM5 PD Deployment

PD deployment, benchmarking, and operator-level performance debugging

Built Prefill/Decode-disaggregated deployments for DeepSeek-V3.2 and GLM5 on Kunlun P800 clusters with the AIAK-customized SGLang stack.

SGLangAIAKKunlun P800Kubernetesaiakperf
2026.03.30
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 160,788

Transformers

Hugging Face's flagship model library for model implementations, training, and inference tooling.

PythonTransformersPILTorchVision
Source Code