首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于并行 Apriori算法的电网日志故障挖掘系统
引用本文:潘,磊.基于并行 Apriori算法的电网日志故障挖掘系统[J].教育技术导刊,2009,19(9):186-189.
作者姓名:  
作者单位:南京工程学院 计算机工程学院,江苏 南京 211167
基金项目:南京工程学院基础研究专项基金项目(JCYJ201825)
摘    要:为提升电网系统日志故障诊断效率,在 Spark 环境下,基于并行 Apriori 算法构建分布式日志故障挖掘系统,针对电网系统相关设备后台日志数据,构建频繁项集并挖掘关联规则,形成系统故障规则库,用于系统故障诊断。系统对 50 万条真实日志数据进行检验。结果表明,该系统可有效发现相关故障日志。同时,该系统在 80G 内存、10 个虚拟节点的集群上以 50s 的速度完成了故障挖掘工作,准确率达 90%,同时提升了原单机系统效率,实现了预期效果。

关 键 词:日志挖掘  关联规则挖掘  频繁项集  Apriori  Spark  
收稿时间:2020-07-28

Power Log Fault Mining System Based on Parallel Apriori Algorithm
PAN Lei.Power Log Fault Mining System Based on Parallel Apriori Algorithm[J].Introduction of Educational Technology,2009,19(9):186-189.
Authors:PAN Lei
Institution:School of Computer Engineering,Nanjing Institute of Technology,Nanjing 211167,China
Abstract:In order to improve the efficiency of log fault diagnosis in the power system,this paper constructs a distributed log fault mining system based on the parallel Apriori algorithm in the Spark environment,which can build frequent itemsets and mine association rules for the log data of related equipment in the power system. A rule base of system faults is formed to diagnose system faults. A real log data containing 14 million records is verified on the system,and the results show that the system can find related fault logs effectively. At the same time,the system can complete the frequent item set mining in 20 seconds on a cluster of 80G memory and 10 virtual nodes,with an accuracy rate of 90% . Therefore the system can improve the efficiency of the original stand-alone system and achieve the expected results.
Keywords:log mining  association rule mining  frequent itemsets  Apriori  Spark  
点击此处可从《教育技术导刊》浏览原始摘要信息
点击此处可从《教育技术导刊》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号