引用本文
  • 朱春江,陆宇旻,李陶深,杜衡斌,唐晟.基于Web挖掘自动分类器的设计与实现[J].广西科学院学报,2008,24(4):310-312,316.    [点击复制]
  • ZHU Chun-jiang,LU Yu-min,LI Tao-shen,DU Heng-bin,TANG Sheng.Design and Implementation of the Auto Classifier based on Web Mining[J].Journal of Guangxi Academy of Sciences,2008,24(4):310-312,316.   [点击复制]
【打印本页】 【在线阅读全文】【下载PDF全文】 查看/发表评论下载PDF阅读器关闭

←前一篇|后一篇→

过刊浏览    高级检索

本文已被:浏览 438次   下载 472 本文二维码信息
码上扫一扫!
基于Web挖掘自动分类器的设计与实现
朱春江, 陆宇旻, 李陶深, 杜衡斌, 唐晟
0
(广西大学计算机与电子信息学院, 广西南宁 530004)
摘要:
分析分布式实时网络行为监控系统中Web网页安全性挖掘问题,设计实现一个基于Web挖掘的自动分类器,并构造一个实验环境来检测分类器的性能。该自动分类器利用特征提取算法实现对每个样本的特征向量提取和待分类文本的特征向量提取,利用基于k个"最近邻"(KNN)分类算法实现对网页的分类,能够提取出带有不安全信息的网页,分类效果良好。
关键词:  网络行为监控  Web网页挖掘  分类器  KNN分类算法  特征提取
DOI:
投稿时间:2008-10-12
基金项目:广西科技攻关项目(桂科攻关033008-9)资助。
Design and Implementation of the Auto Classifier based on Web Mining
ZHU Chun-jiang, LU Yu-min, LI Tao-shen, DU Heng-bin, TANG Sheng
(School of Computer, Electronics and Information, Guangxi University, Nanning, Guangxi, 530004, China)
Abstract:
This paper analyzes Web security mining problem in distributed real-time network behavior monitoring system.An auto classifier based on Web minning was designed and implemented.An experiment environment to test the performance of the classifier was constructed.This classfier extracts the feature vector of each samples and documents to be classified by using the feature extraction algorithm.Web page was classfied by using the K-Nearest-Neighbor(KNN) classification algorithm.The experimental results show that this auto classifier based on Web minning can fetch insecurity Web pages,and its classification is effective.
Key words:  network behavior monitoring  Web page minning  classifier  KNN classification algorithm  feature extraction

用微信扫一扫

用微信扫一扫