PhD. Candidate in Computer Science and Technology, 2021-now
MEng. in Computer Science and Technology, 2018-2021
BEng. in Electrical Engineering and its Automation, 2018
Predicting conversion rate (e.g., the probability that a user will purchase an item) is a fundamental problem in machine learning based recommender systems. However, accurate conversion labels are revealed after a long delay, which harms the timeliness of recommender systems. Previous literature concentrates on utilizing early conversions to mitigate such a delayed feedback problem. In this paper, we show that post-click user behaviors are also informative to conversion rate prediction and can be used to improve timeliness. We propose a generalized delayed feedback model (GDFM) that unifies both post-click behaviors and early conversions as stochastic post-click information, which could be utilized to train GDFM in a streaming manner efficiently. Based on GDFM, we further establish a novel perspective that the performance gap introduced by delayed feedback can be attributed to a temporal gap and a sampling gap. Inspired by our analysis, we propose to measure the quality of post-click information with a combination of temporal distance and sample complexity. The training objective is re-weighted accordingly to highlight informative and timely signals. We validate our analysis on public datasets, and experimental performance confirms the effectiveness of our method.
From an engineering perspective, a design should not only perform well in an ideal condition, but should also resist noises. Such a design methodology, namely robust design, has been widely implemented in the industry for product quality control. To achieve data-efficient robust design, we propose Robust Inverse Design under Noise (RID-Noise), which can utilize existing data to train a conditional invertible neural network. With the visual results from experiments, we clearly justify how RID-Noise works by learning the distribution and robustness from data. Further experiments on several real-world benchmark tasks with noises confirm that our method is more effective than other state-of-the-art inverse design methods.
Conversion rate (CVR) prediction is one of the most critical tasks for digital display advertising. However, conversions usually do not happen immediately after a user click. We propose Elapsed-Time Sampling Delayed Feedback Model (ES-DFM), which models the relationship between the observed conversion distribution and the true conversion distribution.
Program Committee Member (PC member): AAAI'21, AAAI'22, AAAI'23
Advanced C++ Programming. Fall, 2018.
Digital Signal Processing. Fall, 2021.
Scholar Award: NeurIPS'22, AAAI'22
ACM/ICPC Asian Regional, HongKong Onsite, Gold Award, Problem A First Blood, 2016
China Collegiate Programming Contest(CCPC), South China Invitational Contest, Gold Award, 2016
Alibaba Global Scheduling Algorithm Competition, Champion(1/2116), 2018