About

Hey! I'm currently living in SF and building Instaplay, AI for multiplayer games.

Before this, I was an Associate Product Manager at Salesforce, where I spearheaded RL Environments and was Product Lead for Agentforce Tableau's agentic data analysis platform.

I received my Bachelor's and Master's degrees in Computer Science concurrently from Harvard University through the AB/SM program, focusing on ML Fairness and Distributed Systems and learning from Jim Waldo, Stephen Chong, Yiling Chen, and Cynthia Dwork.

Previously, I co-founded an LLMOps startup building custom LLM Evals and raised from Pear VC and Unusual Ventures.

Outside of work, I play tennis, eat spicy Sichuan food, and DJ.

Gary Wu portrait

Research

Graduate-level coursework and research projects.

Benchmarking Explainability of ImageNet Classification Against Data Suboptimality

CS 2822R: ML Interpretability - 2024

We explore interpretable machine learning techniques to help stakeholders understand ML models and their decision-making processes. This work focuses on capturing the underlying reasoning of black box methods through various interpretability approaches.

Stable Compute Marketplace Problem with Multi-dimensional Preference Ordering

CS 236r: Advanced Topics in Econ/CS - 2024

We address the challenge of creating stable compute marketplaces for machine learning, where users can rent computational resources for training large models. Our work implements pricing and matching algorithms that handle multi-dimensional preference reporting in practice.

Incentive-Aligned Agent Selection Strategy for Federated Learning

CS 243: Advanced Networking for ML Systems - 2023

We present PRIMES, a novel incentive-aligned approach for federated learning that addresses the misreporting and free-riding problem. By selecting clients based on next-step loss and paying them based on global performance, we incentivize truthful reporting and high-quality contributions.

Homomorphic Encryption for Neural Network Deployment

CS 254: Advanced Topics in Security - 2025

We implement a homomorphic encryption-based neural network using polynomial activation approximation to enable secure inference on encrypted data. We compare the security posture against standard unencrypted networks, focusing on membership inference, model extraction, and gradient-based input recovery attacks.

Achieving Fairness in the Multi-Round Online Gaming Context

CS 226: Algorithmic Fairness - 2024

We investigate how fairness breaks down in online learning settings with strategic gaming, where agents may modify their reported features for favorable outcomes. We introduce algorithmic changes to ensure fairness across multiple metrics, accounting for different cost functions between advantaged and disadvantaged agents.

Metric-Based LLM-Router Using KMeans Clustering

Neuro 240: Biological & Artificial Intelligence - 2024

We explore the feasibility of Large Language Model routers that can intelligently recommend the best LLM for a given task. Using KMeans clustering on prompts, we build a model to predict which LLM will provide the best response.

Experience

Associate Product Manager - Salesforce

1 of 12 APM's in cohort. Pitched and led RL Environments. Product Lead for Agentforce Tableau, building the agentic data analysis platform.

Associate Product Manager Intern - Salesforce

Owned multi-modal prompt engineering integration from 0-1: led customer calls, wrote requirements, drove stakeholder alignment. Convinced execs to increase project priority and shipped API MVP, unlocking Agentforce's Service Agent. Doubly operated as TPM, spearheading API design and partnered with multiple engineering teams. Collaborated with Sales/AE teams to build and pitch customized demos for client C-suite audience in $XXm ACV AI deals.

Co-Founder & Head of Product - Verita

Founded LLM Evals startup with Harvard classmates - powering businesses to test and improve their fine-tuned LLM's. Engineered MVP, conducted user/competitive research and led product growth. Raised funding from Pear VC and Unusual Ventures and participated in Unusual Academy.

Product Manager Intern - Teradata

Collaborated with Engineering to design a migration system from PySpark to Teradata's ML python package, enabling users to port over existing jobs, enabling and increasing onboarding efficiency. Engineered end-to-end machine learning (XGBoost) product demo within the Teradata MLOps platform, from data preparation and model training to automating deployment & evaluation.

Software Engineering Intern - Alpaca Markets

Engineered a Slack App using Python, PostgreSQL, and Flask, integrating multiple API's, OAuth 2.0, and backend logic, allowing users to submit trades using Slack commands, such that "/buy AAPL 10" would buy the user 10 shares of Apple. Wrote 3 technical articles of algorithmic trading strategies, reaching 1000's of customers. Created feature requirements and managed end-to-end development of the above projects.

Contact