Haoyang Ling is a second-year master’s student at the University of Michigan, pursuing a degree in Information Science with a focus on Big Data Analytics. His interests include artificial intelligence, natural language processing, information retrieval, and programmable matter. With a strong foundation in both computer science and data science, he has worked on many data science-related course projects that have real-life applications with an eagerness to apply knowledge and expertise to make meaningful contributions to this field.
MSc in Information Science (GPA 4.0/4.0), 2024
University of Michigan
BSc in Electrical and Computer Engineering (GPA 3.92/4.0), 2023
Shanghai Jiao Tong University
NPHardEval serves as a comprehensive benchmark for assessing the reasoning abilities of large language models (LLMs) through the lens of computational complexity classes. This repository contains datasets, data generation scripts, and experimental procedures designed to evaluate LLMs in various reasoning tasks. The benchmark offers several advantages compared with current benchmarks:
It contains a React-based summarizer built with ChatGPT whose primary purpose of the application is to generate a summary of paragraphs with highlighting of the relevant keywords. It aims to facilitate the comprehension of the original text and to enhance user trust in the generated summary. I also make efforts in protecting the personally identifiable information (PII) with Presidio.
Augmented graph data with random masking and self-attention after comparative analyses
to enhance the model’s robustness, surpassing baseline models in 5 out of 9 datasets.
This is a project for music recommendations with Spark.
This project involves working with a dataset from the Cook County Assessor’s Office in Illinois, which contains over 500,000 records describing houses sold in the area in recent years. The dataset is split into training and test sets.
In Part 1, Exploratory Data Analysis (EDA) ais performed. In Part 2, the focus is on advanced prediction with machine learning. The criterion for evaluation is L2 loss, and the baseline model is ridge regression.
Responsibilities include:
Responsibilities include:
Responsibilities include:
Responsibilities include:
My research interests include artificial intelligence, information retrieval, and programmable matter. If you are interested in working together or have any questions, please feel free to contact me using the information below. I would love to hear about any opportunities that may be a good fit for my skills and experience.