CV
Tong Zhu
Summary
Currently employed at UCLA. A PhD Candidate in Biostatistics.
Education
- Doctor of Philosophy (PhD) in BiostatisticsUniversity of California, Los AngelesCourses: Uncertainty in LLMs, Agent-based AI
- Master of Science in Computer Science2024-05-01Northeastern UniversityGPA: 4.0Courses: Algorithm, Distributed Database
- Master of Science in Statistics and Operations Research2020-06-01University of North Carolina at Chapel HillCourses: Applied Statistics, Machine Learning, Time Series Forecasting
Work Experience
- Software Engineer Intern2024-06-01 - 2024-08-01Amazon
- Developed key features for Amazon B2B in AWS platform to automate event-driven applications using EventBridge
- Implemented enhancements in the Visibility Service using Java, JavaScript and TypeScript, improving real-time tracking and monitoring of important business transactions across the platform
- Conducted integration tests for all Outbound services and events, ensuring smooth and error-free deployment
- Designed and tested dashboards using CloudWatch to monitor service performance metrics
- Data Scientist2021-02-01 - 2022-08-01ByteDance Ltd.
- Collaborated with cross-function team to deploy dynamic subscription tool to improve customer experience in platform
- Verified product feasibility and deployed XGBoost model to select the important indicators for customers to help design product features
- Applied interrupted time series to estimate the potential revenue impact, and evaluate the risk of restricting creator's quotes strategy in advance
- Conducted attribution analysis to evaluate marketing campaign performance which provided powerful evidence to spur on product marketing
- Data Scientist2020-08-01 - 2020-12-01Blingby
- Built data ETL pipelines through Apache Spark to transform raw data into features by combining business sense and statistical knowledge
- Developed, maintained web-based dashboards with Tableau to update daily data analysis report, which increased 20% daily work efficiency
- Machine Learning Intern2020-06-01 - 2020-08-01TouchSuite
- Queried and cleaned terabyte-sized order data from Azure SQL using pyodbc
- Conducted online analytical processing (OLAP) to display critical sales performance from different dimensions
- Developed item-based approaches to handle cold-start problems and tuned the model hyper-parameters through SparkML cross-evaluation toolbox which reduced root mean square errors by 10%
Skills
Programming Languages
- Python
- R
- Java
- JavaScript
- SQL
Tools and Platforms
- MySQL
- Tableau
- AWS
- Spark
Publications
- Incentivizing Truthful Language Models via Peer Elicitation Games2025NeurIPS 2025This paper introduces Peer Elicitation Games (PEG), a training-free, game-theoretic framework for aligning LLMs.
Teaching
- Biostat 203A2025University of California, Los Angeles, Biostatistics DepartmentRole: TAIntroduction to Data Management and Statistical Computing
- Biostat 2012025University of California, Los Angeles, Biostatistics DepartmentRole: TATopics in Applied Regression
Portfolio
- Portfolio item number 1