首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
目前协同过滤被广泛应用于数字图书馆、电子商务等领域的个性化服务系统.最近邻算法则是最早提出和最主要的协同过滤推荐算法,但用户评分数据稀疏性严重影响推荐质量.针对上述问题,提出了一种基于Rough集理论的最近邻协同过滤算法,以用户评分项并集作为用户相似性计算基础,并将非目标用户区分为无推荐能力和有推荐能力两种类型;对于无推荐能力用户不再计算用户相似性以改善推荐实时性,对于有推荐能力用户则提出一种基于Rough集理论的评分预测方法来填补用户评分项并集中的缺失值,从而降低数据稀疏性.实验结果表明新算法能有效提高推荐质量.  相似文献   

2.
Information filtering is an area getting more important as we have long been flooded with too much information, where product brokering in e-commerce is a typical example. Systems which can provide personalized product recommendations to their users (often called recommender systems) have gained a lot of interest in recent years. Collaborative filtering is one of the commonly used approaches which normally requires a definition of user similarity measure. In the literature, researchers have proposed different choices for the similarity measure using different approaches, and yet there is no guarantee for optimality. In this paper, we propose the use of machine learning techniques to learn the optimal user similarity measure as well as user rating styles for enhancing recommendation acurracy. Based on a criterion function measuring the overall prediction error, several ratings transformation functions for modeling rating styles together with their learning algorithms are derived. With the help of the formulation and the optimization framework, subjective components in user ratings are removed so that the transformed ratings can then be compared. We have evaluated our proposed methods using the EachMovie dataset and succeeded in obtaining significant improvement in recommendation accuracy when compared with the standard correlation-based algorithm.  相似文献   

3.
基于组合加权评分的Item-based协同过滤算法   总被引:1,自引:0,他引:1  
针对Item-based协同过滤算法中用户评分数据稀疏性严重影响推荐质量的问题,提出一种基于组合加权评分的Item-based协同过滤算法,以用户评分项并集作为用户相似性计算基础,并提出一种组合加权评分方法来对并集中的未评分项进行计算和填补,从而降低了数据稀疏性。实验结果表明该算法能有效提高推荐质量。  相似文献   

4.
适应用户兴趣变化的协同过滤增量更新机制   总被引:1,自引:0,他引:1  
高维、稀疏的用户-项目评分矩阵对基于项目的协同过滤推荐算法造成严峻的可扩展性问题.传统的解决方法是离线计算项目相似性并保存在系统中以供算法调用,但是不能充分利用最新评分数据以体现用户兴趣的变化.针对上述问题,提出了适合在线应用的协同过滤项目相似性增量更新机制,使得推荐系统在当前用户提交项目评分之后,能够实时完成相应项目与其他项目之间的相似性数据更新,从而推荐系统可以基于最新的项目相似性数据进行推荐处理,以适应用户兴趣的变化.实验结果表明,本文提出的项目相似性增量更新机制能够有效提高基于项目的协同过滤算法可扩展性.  相似文献   

5.
针对高校图书馆场景存在的无显式反馈、借阅数据稀疏和传统推荐算法效果不好问题,提出基于时间上下文优化协同过滤的推荐算法,包含读者阅读行为评分、时间上下文和内容兴趣变迁3个要素。在数据准备阶段,通过制定评分转化规则、设计标准化函数来构建一种基于用户行为操作的兴趣评分模型,以解决用户评分缺失问题;在推荐召回阶段,提出一种非线性的时间衰减模型来对评价矩阵进行优化,以提高推荐效果;在推荐排序阶段,提出一种兴趣捕捉模型对召回结果按照图书类别进行精排序,以缓解数据稀疏问题并进一步提高推荐效果。实验结果表明,文章提出的优化算法在Top5的F值较未经优化的协同过滤提升增幅达141%。  相似文献   

6.
数字图书馆中主动信息过滤系统的构建研究   总被引:6,自引:0,他引:6       下载免费PDF全文
设计了一个结合使用协作过滤和基于内容过滤的主动信息过滤的实验系统。其结构框架的主要部分有:智能代理、检索服务器、用户需求文档数据库、过滤服务器、结果处理器和推送服务器。它采用机器学习的机制来预测用户新的兴趣。  相似文献   

7.
基于属性值偏好矩阵的协同过滤推荐算法   总被引:7,自引:2,他引:5  
传统的协同过滤推荐算法面临用户评分数据稀疏性和冷启动问题的挑战.针对上述问题,提出了基于属性值偏好矩阵的协同过滤推荐算法,首先采用奇异值分解(SVD)对用户-项目评分矩阵降维得到目标用户的初始邻居用户集,生成新的用户-项目评分矩阵;然后将用户评分映射到相应的项目属性值上,生成每个用户的属性值偏好矩阵,并基于属性值偏好矩阵进行用户相似性度量,从而缓解了评分数据稀疏性;将新项目的属性值与用户的属性值偏好矩阵进行匹配,从而找出匹配度最高的前N个用户作为新项目的推荐受众.实验结果表明了该算法的有效性.  相似文献   

8.
为了帮助用户有效地发现、过滤和利用信息 ,信息过滤技术应运而生。协作过滤作为其中一种技术也得到迅速发展 ,但传统的协作过滤算法存在矩阵稀疏性等问题 ,影响预测效果。本文给出一种基于信息项的新算法 ,它从矩阵的列来考虑 ,能有效地解决矩阵稀疏性等问题 ,并提高预测准确性。  相似文献   

9.
利用云模型改进基于项目的协同过滤推荐算法   总被引:1,自引:1,他引:1  
基于项目的协同过滤推荐算法能有效解决传统的基于用户的协同过滤推荐系统可扩展性差、缺乏稳定性的缺点,但仍然不能解决数据稀疏的问题,在数据极度稀疏的情况下,传统的项目相似性度量方法无法实现准确度量,导致推荐效果急剧下滑。本文借鉴基于云模型的云相似性度量方法来实现基于知识层面的项目相似性度量,提出了一种新的基于项目的协同过滤推荐算法。实验结果表明即使在数据极度稀疏的情况下,改进后的算法仍然能取得较好的推荐效果。  相似文献   

10.
陶剑文  潘红艳 《情报学报》2008,27(2):199-204
推荐系统是电子商务系统中最重要的技术之一.随着电子商务系统用户数目和商品数目日益增加,在整个商品空间上用户评分数据极端稀疏,传统的相似性度量方法均存在各自的弊端,导致推荐系统的推荐质量急剧下降.针对用户评分数据极端稀疏情况下传统相似性度量方法的不足,本文提出了一种基于相似项目与用户评分预测的协同过滤推荐算法,综合利用相似项目和相似用户评分信息预测用户对未评分项目的评分.通过聚类算法形成用户候选近邻集,减小了算法搜索空间,降低了最近邻用户的搜索时间,从而增强了算法的扩展性.实验结果表明,本算法可以有效解决用户评分数据极端稀疏情况下传统协同推荐算法存在的问题,显著提高推荐系统的推荐质量.  相似文献   

11.
针对传统协同过滤推荐算法的不足,依据现实生活经验,认为在协同过滤推荐过程中考虑用户的专家信任因素十分必要。详细阐述专家信任的概念以及利用用户评分数据计算专家信任度的方法,提出一种基于专家优先信任的协同过滤推荐算法。在公开数据集GroupLens上的实验结果表明,该算法预测用户评分的精度和成功率都明显优于传统的最近邻法。  相似文献   

12.
Collaborative filtering is a general technique for exploiting the preference patterns of a group of users to predict the utility of items for a particular user. Three different components need to be modeled in a collaborative filtering problem: users, items, and ratings. Previous research on applying probabilistic models to collaborative filtering has shown promising results. However, there is a lack of systematic studies of different ways to model each of the three components and their interactions. In this paper, we conduct a broad and systematic study on different mixture models for collaborative filtering. We discuss general issues related to using a mixture model for collaborative filtering, and propose three properties that a graphical model is expected to satisfy. Using these properties, we thoroughly examine five different mixture models, including Bayesian Clustering (BC), Aspect Model (AM), Flexible Mixture Model (FMM), Joint Mixture Model (JMM), and the Decoupled Model (DM). We compare these models both analytically and experimentally. Experiments over two datasets of movie ratings under different configurations show that in general, whether a model satisfies the proposed properties tends to be correlated with its performance. In particular, the Decoupled Model, which satisfies all the three desired properties, outperforms the other mixture models as well as many other existing approaches for collaborative filtering. Our study shows that graphical models are powerful tools for modeling collaborative filtering, but careful design is necessary to achieve good performance.  相似文献   

13.
Collaborative filtering is a popular recommendation technique. Although researchers have focused on the accuracy of the recommendations, real applications also need efficient algorithms. An index structure can be used to store the rating matrix and compute recommendations very fast. In this paper we study how compression techniques can reduce the size of this index structure and, at the same time, speed up recommendations. We show how coding techniques commonly used in Information Retrieval can be effectively applied to collaborative filtering, reducing the matrix size up to 75 %, and almost doubling the recommendation speed. Additionally, we propose a novel identifier reassignment technique, that achieves high compression rates, reducing by 40 % the size of an already compressed matrix. It is a very simple approach based on assigning the smallest identifiers to the items and users with the highest number of ratings, and it can be efficiently computed using a two pass indexing. The usage of the proposed compression techniques can significantly reduce the storage and time costs of recommender systems, which are two important factors in many real applications.  相似文献   

14.
[目的/意义] 为解决高校图书推荐过程中面临的“数据稀疏”和“冷启动”问题,研究表明:优化读者评价矩阵和相似度模型是提高图书推荐质量的关键。[方法/过程] 提出一种协同过滤改进方法,以图书分类为项目生成用户评价矩阵,并考虑借阅方式、借阅时间和图书相似度对用户兴趣度的影响,优化矩阵中的样本数据;同时,在计算读者相似度时融入读者特征和图书特征。[结果/结论] 实验结果表明,该方法可有效解决“数据稀疏”和“冷启动”问题,显著降低计算量。与基本协同过滤和聚类改进的协同过滤方法相比,无论是在推荐准确率还是在用户满意率上都有较大的提高,综合推荐效果更好。  相似文献   

15.
基于群体兴趣偏向度的数字图书馆协同过滤技术研究*   总被引:1,自引:1,他引:1  
针对数字图书馆协同过滤推荐系统所面临的用户评分数据稀疏性问题,提出群体兴趣偏向度的计算方法,对用户-项目评分矩阵空缺值进行预测。实验结果表明,该算法能有效提高推荐质量。  相似文献   

16.
协同过滤是推荐系统中广泛使用的最成功的推荐技术,但是随着系统中用户数目和商品数目的不断增加,整个商品空间上的用户评分数据极端稀疏,传统协同过滤算法的最近邻搜寻方式存在很大不足,导致推荐质量急剧下降。针对这一问题,本文提出了一种基于项类偏好的协同过滤推荐算法。首先为目标用户找出一组项类偏好一致的候选邻居,候选邻居与目标用户兴趣相近,共同评分较多,在候选邻居中搜寻最近邻,可以排除共同评分较少用户的干扰,从整体上提高最近邻搜寻的准确性。实验结果表明,该算法能有效提高推荐质量。  相似文献   

17.
When speaking of information retrieval, we often mean text retrieval. But there exist many other forms of information retrieval applications. A typical example is collaborative filtering that suggests interesting items to a user by taking into account other users’ preferences or tastes. Due to the uniqueness of the problem, it has been modeled and studied differently in the past, mainly drawing from the preference prediction and machine learning view point. A few attempts have yet been made to bring back collaborative filtering to information (text) retrieval modeling and subsequently new interesting collaborative filtering techniques have been thus derived. In this paper, we show that from the algorithmic view point, there is an even closer relationship between collaborative filtering and text retrieval. Specifically, major collaborative filtering algorithms, such as the memory-based, essentially calculate the dot product between the user vector (as the query vector in text retrieval) and the item rating vector (as the document vector in text retrieval). Thus, if we properly structure user preference data and employ the target user’s ratings as query input, major text retrieval algorithms and systems can be directly used without any modification. In this regard, we propose a unified formulation under a common notational framework for memory-based collaborative filtering, and a technique to use any text retrieval weighting function with collaborative filtering preference data. Besides confirming the rationale of the framework, our preliminary experimental results have also demonstrated the effectiveness of the approach in using text retrieval models and systems to perform item ranking tasks in collaborative filtering.  相似文献   

18.
基于社会化标签系统的个性化信息推荐探讨   总被引:4,自引:0,他引:4  
针对用户个人特征并向其提供准确恰当信息的个性化信息推荐研究,一直是学术界和产业界所关注的热点。结合后控词表,对用户分散的、个性化的标注进行处理,并将用户兴趣用向量表示,然后借鉴协同过滤算法的思想,寻找出相似用户集及其内部的资源集。在此基础上,采用相对匹配策略,提出一种基于社会化标签系统的个性化推荐方法。  相似文献   

19.
随着数字图书馆的文献数量和种类高速增长,数字图书馆用户迫切需要有效的个性化推荐工具来帮助其在众多文献中发现对其有价值的文献。协同过滤方法是推荐系统广泛采用的推荐技术,但数据稀疏性是影响其推荐效果的关键因素之一。在文献推荐领域,这一问题更加显著。文章提出了一个利用文献间共被引关系的协同过滤文献推荐方法。实验表明所提方法具有较好的推荐性能。  相似文献   

20.
基于Hadoop开源分布式计算框架和Mahout协同过滤推荐引擎技术构建图书推荐引擎系统,并利用云模型和Pearson系数对传统协同过滤推荐算法进行改进,改善传统单机推荐算法在高维稀疏矩阵上进行运算所导致的系统性能不佳及推荐结果不准确的问题。利用实验对分布式推荐平台的整体性能及改善后的协同过滤推荐算法进行测试评估,发现当虚拟机节点不断增加时,协同过滤推荐引擎的计算时间不断减少,这表明推荐引擎系统的总体性能较传统单机推荐引擎得到提升;利用MAE分别对原始协同过滤推荐效果和改进后的推荐算法进行测评,发现改进后的推荐引擎算法的推荐准确率较改进前提高13.1%。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号