Abstract: In this paper, based on the open source UC dataset, the original dataset was first subjected to data preprocessing operations such as logarithmic transformation, outlier and missing value ...