Head of AI · LLM Platform & Decision Systems Leadership

PHILIP CHEUNG

AI-first technology leader who ships production LLM systems end-to-end: strategy, architecture, governance, and runtime.

Head of AI at Weidmann & Cie. AG. Leads Qorinix, LaSpend, and Fixxmi platform delivery with multi-provider AI routing and audit-grade decision workflows.

MSc Computer Science, AI & Data Science (Merit)

  • Builds enterprise AI platforms that connect business goals to deployable systems
  • LLM routing · RAG · guardrails · human-review gates · sub-100ms inference engineering
  • Leads cross-functional AI teams across product, compliance, and engineering
Philip Cheung

Professional Summary

Philip Cheung is an AI-first technology leader focused on turning LLM capability into measurable business outcomes. He operates at the Head of AI level across strategy, architecture, execution, and governance.

He builds production AI systems for high-stakes environments: multi-provider model routing, policy-aware prompt orchestration, RAG with evidence traceability, human-review escalation, and cost/performance telemetry.

At Weidmann & Cie. AG, he leads Qorinix, LaSpend, and Fixxmi AI platform execution, combining low-latency inference engineering with auditability and compliance-ready controls. He regularly aligns engineering design with legal, risk, and operations stakeholders.

His background spans AI engineering, platform design, and technical leadership across cloud-native systems. He is strongest when a company needs both AI depth and delivery discipline to move from prototype to resilient production.

Core Competencies

Head of AI Leadership

AI roadmap, org design, operating model, cross-functional execution

LLM Platform Architecture

Model routing, prompt registry, RAG pipelines, reliability engineering

AI Governance & Safety

Guardrails, policy enforcement, audit trail, human-in-the-loop approvals

ML/Inference Engineering

Sub-100ms serving, scaling, observability, continuous evaluation

Product & Delivery

Translate business priorities into shipped AI capabilities and KPIs

Stakeholder Communication

Executive narrative, hiring support, architecture reviews, mentoring

Notable Achievements

Head of AI Leadership

Defined and executed AI roadmap across platform, governance, and delivery. Built an operating model where engineering, product, compliance, and business stakeholders align on one decision system.

Qorinix LLM Program

Led end-to-end design, training, and deployment of Qorinix LLM for production decision support. Focused on reliability, latency, and auditability in a high-stakes operating context.

Multi-Provider AI Platform

Delivered model routing, fallback policy, prompt registry, and evaluation workflow so teams can adopt AI safely without locking into a single provider.

Audit-Grade AI Governance

Implemented policy checks, evidence traceability, and human-review escalation for regulated decision flows. Every material decision can be traced, reviewed, and explained.

Inference & Runtime Performance

Architected low-latency inference services with observability and failover patterns, enabling stable production behavior under live workload pressure.

Team Building & Mentorship

Mentored engineers and set technical standards for AI delivery, including model lifecycle management, release discipline, and cross-functional review rituals.

Research Foundation

MSc in AI & Data Science (Merit), with work spanning LLM systems, machine learning, and production-grade AI architecture.

Business Impact

Converted AI from experimentation to operational capability with measurable impact on decision speed, engineering quality, and organisational confidence in AI rollout.

Professional Experience

Head of AI · LLM Platform & Decision Systems Architect

Weidmann & Cie. AG

London, UK

Dec 2024 - Present

Responsibilities

  • Owns the AI strategy and execution roadmap across Qorinix, LaSpend, and Fixxmi product lines.
  • Built a multi-provider LLM platform with routing, fallback policy, prompt registry, and cost/latency observability.
  • Established production RAG workflows with evidence packaging and policy checks for high-stakes decisions.
  • Implemented guardrails and human-review escalation patterns for regulated or sensitive workflows.
  • Led architecture reviews and translated leadership priorities into delivery plans and release gates.
  • Shipped low-latency inference patterns with reliability controls, canary rollout, and rollback playbooks.

Key Outcomes

  • Operationalised AI decision workflows with audit traceability and governance checkpoints.
  • Standardised model lifecycle practices: evaluation baselines, release criteria, and drift monitoring.
  • Improved cross-team AI delivery speed by introducing reusable platform capabilities instead of one-off integrations.

Skills & Technologies

Head of AI
LLM Platform
Model Routing
RAG
AI Governance
MLOps
Evaluation
Observability
Cloudflare
AWS
Lead AI Engineer & Technical Architect

Pacific Cloud Computing Ltd.

Hong Kong & Remote UK

Jan 2015 - Dec 2024

Responsibilities

  • Led AI transformation and built intelligent systems processing millions of predictions daily.
  • Designed distributed inference and data pipelines with production reliability targets.
  • Built RAG and NLP systems for faster information retrieval and richer decision support.
  • Introduced MLOps practices: retraining workflow, staged rollout, and continuous quality checks.
  • Mentored engineers and established architecture and coding standards.

Key Outcomes

  • Scaled ML services into stable production operations.
  • Reduced retrieval latency and improved information quality for internal teams.
  • Delivered enterprise systems with high availability and operational discipline.

Skills & Technologies

Python
FastAPI
LangChain
NLP
RAG
MLOps
AWS
Docker
PostgreSQL
Redis
Senior Product Manager (Technical) / Web Manager

Groupon.com

Hong Kong

Apr 2013 - Dec 2014

Responsibilities

  • Led strategic account management and merchant coordination.
  • Managed project planning, estimation, and execution tracking.
  • Improved operational process and delivery quality.

Key Outcomes

  • Delivered measurable revenue and operational improvements.

Skills & Technologies

Product Management
Operations
Planning
Head of AI · Platform Delivery

Flagship AI Programs

A curated view of AI leadership, LLM platform architecture, and production decision systems delivered by Philip.

Head of AI Operating Model
Expand
Leadership

Head of AI Operating Model

Defined an execution model that aligns product, engineering, compliance, and business stakeholders around one AI roadmap. Built delivery rhythms, review gates, and accountability paths so AI work moves from prototype to reliable production.

Qorinix LLM, In-House Program Leadership
Expand
LLM Program

Qorinix LLM, In-House Program Leadership

Led end-to-end design, training, and deployment of Qorinix LLM. Emphasis was on controlled runtime behavior, policy-safe decision support, and measurable latency/throughput gains for high-stakes workflows.

Multi-Provider LLM Routing Control Plane
Expand
Platform

Multi-Provider LLM Routing Control Plane

Built model routing and fallback policy across providers, with prompt versioning and performance/cost visibility. This reduced lock-in risk and allowed teams to scale usage without fragile one-off integrations.

Audit-Grade AI Decision Loop
Expand
Governance

Audit-Grade AI Decision Loop

Designed closed-loop decision flow with policy checks, evidence retrieval, action runtime, and audit ledger. Every critical decision is traceable, reviewable, and aligned with governance requirements.

Sub-100ms Inference Mesh
Expand
Infra

Sub-100ms Inference Mesh

Engineered low-latency inference services with routing, autoscaling, and health-weighted failover. Supports high-volume production usage while maintaining stable performance and operational resilience.

Real-Time AI Operations Dashboard
Expand
Operations

Real-Time AI Operations Dashboard

Built operational observability across model performance, cost, policy events, and service reliability. The dashboard enables faster incident response and executive-level reporting on AI system health.

Unified AI Delivery Stack
Expand
Platform

Unified AI Delivery Stack

Integrated Python, TypeScript, FastAPI, Next.js, PostgreSQL/TimescaleDB, Redis, Docker, and cloud services into a cohesive AI delivery stack. Designed to support both rapid iteration and production-grade controls.

LLM-Augmented Research to Production Pipeline
Expand
LLM Program

LLM-Augmented Research to Production Pipeline

Converted advanced research concepts into deployable systems through evaluation discipline, release gates, and runtime monitoring. Focus on repeatability and business impact, not one-off experiments.

Skills & Expertise

Programming & Frameworks

Python (FastAPI, data/ML tooling)93%
TypeScript / JavaScript91%
React / Next.js90%
SQL (PostgreSQL, TimescaleDB)87%
C++ (performance-critical services)80%
MQL5 / domain scripting78%
HTML/CSS / Tailwind / shadcn/ui90%

AI/ML Capabilities

LLM Platform Architecture95%
Model Routing & Fallback Orchestration94%
RAG Pipelines & Evidence Traceability92%
Prompt Registry / Policy Prompting90%
Evaluation & Model Quality Gates89%
Deep Learning (Transformers, PyTorch)88%
NLP / Sentiment / Information Extraction87%
MLOps (Canary, Drift Monitoring, Rollback)86%

AI Leadership & Strategy

Head of AI Roadmap Design95%
Cross-functional AI Operating Model93%
AI Governance & Risk Controls92%
Executive Communication & Decision Support90%
Hiring, Mentorship, and Team Scaling89%

LLM Platform & Runtime

Low-Latency Inference Architecture92%
API Gateways & Runtime Control Plane90%
Observability (Tracing, Metrics, Cost)90%
Reliability Engineering & Failover88%
Security-Aware Service Design86%

Backend & Data

FastAPI / Node.js / Service APIs90%
PostgreSQL / Redis / MongoDB88%
Vector and Hybrid Retrieval Patterns86%
Data Pipelines / ETL / Streaming84%
Workflow Automation82%

Cloud & Delivery Tooling

Cloudflare (Pages, Workers, DNS)90%
AWS (EC2, Lambda, RDS, S3)86%
Docker / CI-CD / Release Workflow86%
Git / GitHub / Branch Strategy92%
Monitoring & Incident Response85%

AI System Design

AI Solution Architecture94%
Governance by Design92%
Evaluation Frameworks & Benchmarks90%
Cost-Performance Optimisation88%
Production Rollout Playbooks87%

Domain & Stakeholder Skills

Regulated/High-Stakes Decision Workflows90%
Business-to-Architecture Translation92%
Stakeholder Alignment (Product/Risk/Legal)89%
Multilingual Communication (EN/Cantonese/Mandarin)88%
AI Adoption Enablement90%

Education & Certifications

Education

MSc in Computer Science (AI & Data Science, Merit)
2023 - 2025

AI Systems, Applied ML, and LLM Decision Workflows

University of Wolverhampton

Visit Website

Focused on production-oriented AI, machine learning system design, and evidence-based decision support architecture.

Completed
Bachelor of Business Administration (BBA)
1995

Marketing (with IS minor)

The Hong Kong University of Science & Technology

Visit Website

Foundation in business strategy, communication, and cross-functional leadership.

Completed

Certifications & Qualifications

Gemini Certified Educator
2025

Applied AI enablement and practical adoption coaching.

Google Cloud MLOps Specialization
2024

Google Cloud

Visit Website

Production lifecycle practices for ML systems.

Harvard CS50x
2024

Harvard University

Visit Website

Computer science foundations and software engineering discipline.

Licensing Examination for Securities and Futures Intermediaries (LE)
2021

SFC / HKSI

Visit Website

Passed Papers 1, 7, 8, 12.

Frequently Asked Questions

Common hiring and execution questions about Philip's Head of AI profile

Resources & Playbooks

Head of AI 90-Day Execution Framework
Playbook2026
Author: Philip Cheung
philip.pm

A practical framework for first-90-day AI leadership: baseline assessment, platform foundation, and high-value use-case delivery plan.

LLM Platform Governance Design
Playbook2026
Author: Philip Cheung
philip.pm

Policy-first architecture for LLM deployments, covering prompt controls, human-review escalation, and auditability requirements.

Evaluation and Release Gates for AI Systems
Playbook2026
Author: Philip Cheung
philip.pm

How to set measurable release criteria, benchmark drift, and maintain quality over time in production AI systems.

Schedule an Interview
Book a time slot that works for you using my Calendly scheduling system.

Loading booking calendar...

If the embedded widget is blank, please use direct booking.

Open Calendly

Contact Me

Email

Use contact form only

Phone

Use contact form only

Location

London, United Kingdom