Projects and Engineering Portfolio

Browse selected work across AI infrastructure, embodied AI, systems engineering, and digital applications

Browse by Type

AI Infrastructure, Model Serving, and Training Optimization

15 projects

2025.08.18 — Present
Type: Open-Source Contribution + Product Delivery
Role: Contributor / MLsys Engineer
GitHub Stars: 431

vLLM-Kunlun (Baidu Kunlun Chip Inference Framework)

vLLM inference framework for Kunlun XPU, used to bring high-performance LLM serving to Baidu Kunlun P800 hardware.

PythonPyTorchvLLMKunlun XPULLM Inference
Source Code
2026.05.19
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 18,054

FunASR

End-to-end speech recognition toolkit and open-source pretrained model library.

PythonASRVADPunctuationTesting
Source Code
2026.05.05
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 83,002

vLLM

High-throughput LLM inference and serving engine.

PythonvLLMTokenizersTransformers
Source Code
2026.04.28
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 378,924

OpenClaw

Cross-platform personal AI assistant.

TypeScriptAI AssistantSchemaTesting
Source Code
2026.04.10 — 2026.05.08
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 4,732

ExecuTorch

On-device AI runtime for PyTorch.

PyTorchExecuTorchXNNPACKDynamic ShapesTesting
Source Code
2026.04.07 — 2026.04.09
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 3,893

LitServe

AI inference serving framework maintained by Lightning AI for multi-API and multi-model deployments.

PythonModel ServingAuthenticationHealth Checks
Source Code
2026.04.08
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 2,333

Olive

Microsoft toolkit for model finetuning, conversion, quantization, and deployment optimization.

PythonOliveQuantizationHQQRTN
Source Code
2026.04.08
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 5,839

anomalib

Open-source anomaly detection library for industrial inspection and vision workloads.

PythonanomalibPandasDataFrameVision
Source Code
2026.04.07
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 2,860

PyTorch AO

PyTorch model optimization and quantization toolkit for compiler and inference workflows.

PythonPyTorchPT2EQuantization
Source Code
2026.04.06
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 29,064

SGLang

High-performance LLM and multimodal serving framework focused on optimized runtime and deployment.

PythonPyTorchLLM ServingAuthenticationTesting
Source Code
2026.04.04
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 5,618

RLLM

RL-for-LLM research framework with integrations such as verl for training workflows.

PythonRLHFverlTraining
Source Code
2026.03.05 — 2026.05.06
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 5,155

vLLM-Omni

Omni and multimodal inference extension built on vLLM.

PythonvLLMMultimodalGradioLogging
Source Code
2026.04.03
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 9,646

OpenRLHF

Distributed RLHF training framework for alignment experiments on large models.

PythonRayRLHFNCCL
Source Code
2026.03.17 — 2026.06.11
Type: Closed-Source Baidu AI Inference System
Role: MLsys / Inference Engineer

Baidu AIAK-SGLang DeepSeek / GLM5 PD Deployment

PD deployment, benchmarking, and operator-level performance debugging

Built Prefill/Decode-disaggregated deployments for DeepSeek-V3.2, DeepSeek-V4-Flash, and GLM5 on Kunlun P800 clusters with the AIAK-customized SGLang stack.

SGLangAIAKKunlun P800Kubernetesaiakperf
2026.03.30
Type: Open-Source Contribution
Role: Contributor
GitHub Stars: 161,620

Transformers

Hugging Face's flagship model library for model implementations, training, and inference tooling.

PythonTransformersPILTorchVision
Source Code