情报科学 ›› 2021, Vol. 39 ›› Issue (8): 126-131.

• 业务研究 • 上一篇    下一篇

基于时间特性的信息用户行为特征挖掘研究 

  

  • 出版日期:2021-08-01 发布日期:2021-08-05

  • Online:2021-08-01 Published:2021-08-05

摘要: 【目的/意义】当前的信息用户行为特征挖掘方法无法将数据统一整合,且无法准确计算出时间序列内滑动
窗口内的数据均值,导致特征挖掘精度偏低。为此,提出了基于时间特性的信息用户行为特征挖掘方法。【方法
/
程】计算时间序列内滑动窗口内的数据均值,得出起始序列向量,再将用户行为划分成若干等值的时间片,通过取
样统计各种用户群体,得出用户的行为状态定性。以平均查询频率作为标准,观察用户的查询行为特征,输出信息
挖掘结果。【结果
/结论】实验结果表明:所提方法挖掘出夜晚用户行为信息多于白天,休息日比工作日多,且在网络
波动下,虽然耗时增加,不过处于合理范围内。与传统方法相比,所提方法具有更低的挖掘误差,应用性较强。以
上实验结果证明了基于时间特性的信息用户行为特征挖掘研究能获取更准确的用户行为意向,提高用户兴趣预测
准确度,优化网络服务效果。【创新
/局限】为进一步提高网络信息特征挖掘的效率,后续将重点研究多个网络用户
行为的并行分析,使该方法更适用于网络海量信息处理。

Abstract: Purpose/significanceFor the current information user behavior feature mining methods can not integrate the data, and
can not accurately calculate the mean value of the data in the sliding window of the time series, resulting in the low accuracy of feature mining. Therefore, a method of mining information user behavior characteristics based on time characteristics is proposed.
Method/processThe average value of the data in the sliding window within the time series is calculated to obtain the initial sequence vector,and then the user behavior is divided into several equivalent time slices. Through sampling and statistics of various user groups, the qualitative status of user behavior is obtained. The average query frequency is taken as the standard to observe the user's query behav⁃ior characteristics and output the information mining results.Result/conclusionThe experimental results show that the proposed method can mine more user behavior information at night than in the day, and more rest days than working days. In the case of network fluctuation, the time consumption is increased, but it is within a reasonable range. Compared with the traditional method, the proposed
method has lower mining error and strong applicability. The above experimental results prove that the research of information user be⁃havior characteristic mining based on time characteristics can obtain more accurate user behavior intention, improve the accuracy of user interest prediction and optimize the effect of network service.
Innovation/limitationIn order to further improve the efficiency of network information feature mining, we will focus on the parallel analysis of multiple network users' behavior in the future, so that this method is more suitable for massive network information processing.