Yuanzheng (Jack) Yu 「虞源正」

| CV | Email | Google Scholar |
| GitHub | LinkedIn |

I am a freshman at the Carnegie Mellon University, advised by Cy Gage. I currently major in Statistics and Machine Learning. My research interest lies in applying computational and statistical knowledge in fields like sports analysis and sustainability to make people's lives better.

Goal: Build intelligent systems that meaningfully augment human decision-making and interaction in the real world.

Focus: Designing data-driven and learning-based systems that can reason over context, adapt to changing environments, and translate statistical insight into reliable real-world behavior.

Method: Combining statistical modeling, machine learning, and full-stack system design to build scalable, interpretable, and deployable intelligent systems.

Robots: I'm particularly interested in embodied and interactive AI systems, including robotics, where perception, learning, and decision-making must work together under real-world constraints.

Email: jacky2@andrew.cmu.edu


  News

  Publications
sym

Analysis and Prediction of CBA Player Position Data Characteristics Based on Machine Learning
Yuanzheng Yu*
AJAMS 2024

journal | PDF | abstract | bibtex

This study uses player performance statistics from the Chinese Basketball Association for six seasons, from 2017 to 2022, to evaluate the statistical characteristics of guards, forwards, and centers. 20 key performance indicators including points per game, rebounds, assists, shooting percentages, etc. are employed to provide empirical evidence to identify the singular traits that have come to be associated with each position. The study uses eight different machine learning models -- Decision Tree, Linear Discriminant Analysis, Multinomial Logistic Regression, Naive Bayes, Neural Network, Random Forest, Support Vector Machine, and XGBoost -- for position prediction of players from their performance data. From the results, it can be learned that guards are much more adept at scoring, assists, steals, and three-point shooting; forwards are better rebounders and three-point shooters; centers are proficient in rebounding, blocking, and field goal percentage. Among all the considered predictive models, Random Forest and XGBoost have the best test accuracies, while some models are clearly overfitted. This study suggests that using an ensemble machine-learning approach on performance data in the CBA context works particularly well when predicting player’s position. The study contributes to a better understanding of positional attributes in professional basketball and provides methodological references for future research in the field of sports analytics.

  @article{yu2024cba,
    title={Analysis and Prediction of CBA Player Position Data Characteristics Based on Machine Learning},
    author={Yu, Yuanzheng},
    journal={American Journal of Applied Mathematics and Statistics},
    volume={12},
    number={4},
    pages={75--79},
    year={2024},
    publisher={Science and Education Publishing},
    doi={10.12691/ajams-12-4-1},
    url={https://pubs.sciepub.com/ajams/12/4/1/index.html}
  }
  Projects
CarbonTrack_logo

CarbonTrack Youth Environmental Education Platform 「CarbonTrack校园碳账户」

Website Code | iOS App Code | Official IG

A student-led initiative that uses a personal carbon accounting platform designed for teenagers to help them cultivate sustainable living habits as part of youth environmental education. 10+ schools worldwide have used CarbonTrack in their campus.

  Reviewer Service
Not Yet 2025




Website template from here.