Infrastructure Engineer

Beijing Tranxmart Technology Co., Ltd. | Beijing, China | Dec. 2025 – Present

  • Customized Linux kernel for NVIDIA Drive Orin platform: selectively configured kernel modules (virtualization, real-time mechanisms, etc.), completed kernel compilation and system image packaging
  • Designed and implemented an autonomous driving Minimum Risk Condition (MRC) system compliant with ISO 26262 functional safety standards
  • Deployed L4-level MRC on ultra-low-compute embedded chip TC397 using pure rule-based code, covering the full perception-planning-control-localization pipeline for safe pull-over maneuvers and emergency braking

Senior AI Infrastructure Engineer

Moore Threads Intelligent Technology Co., Ltd. | Beijing, China | Apr. 2025 – Oct. 2025

  • Developed fused operators and distributed data sharding strategies for Alibaba’s Wan2.1/Wan2.2 video generation models on Moore Threads GPUs, reducing single inference latency to one-quarter of baseline
  • Adapted the RoPE operator for Moore Threads hardware, which lacks native complex arithmetic support; decomposed complex rotary position encoding into equivalent floating-point operations to leverage the hardware’s FP acceleration units
  • Ported open-source inference frameworks xdit and long-context-attention to Moore Threads by replacing cuDNN SDPA with muDNN SDPA implementation
  • Conducted a feasibility study on porting CUTLASS core operators by analyzing architectural differences between CUDA PTX and MUSA instruction sets

Convex Optimization Algorithm Engineer

Shenzhen Poisson Software Technology Co., Ltd. | Beijing, China | Jun. 2024 – Feb. 2025

  • Implemented L-BFGS quasi-Newton optimizer in C++17 with polymorphic memory allocators (PMR) and template compilation mechanisms, significantly reducing memory overhead and improving runtime performance
  • Built a dense linear algebra library in C++17 covering LU/QR decomposition, matrix arithmetic, and SIMD vectorized multi-issue optimization
  • Modernized the open-source pocketfft library to interface with STL generic containers, reducing integration complexity

Planning & Control Algorithm Engineer

Beijing Phigent Technology Co., Ltd. | Beijing, China | Jun. 2023 – Jun. 2024

  • Invented a quintic spline interpolation algorithm (C4 smoothness, O(n) complexity) with a dedicated sparse matrix solver, significantly improving motion control precision on high-curvature roads
  • Designed and implemented dynamically expandable sparse matrix libraries (CscMatrix/CsrMatrix/CooMatrix/LilMatrix) providing efficient data structures for large-scale convex optimization formulation
  • Proposed a Workspace+Task architectural design decoupling mathematical algorithms from business logic, enabling modular refactoring of the planning & control system

Software Engineer

Beijing Qingtian Truck Technology Co., Ltd. | Beijing, China | Jul. 2022 – Jun. 2023

  • Developed specialized sparse matrix solvers to improve matrix inversion efficiency
  • Introduced higher-order boundary condition control to enhance trajectory smoothness
  • Conducted Linux-based C++ development using the Bazel build system and Docker containerized deployment