引用本文: |
-
朱春江,陆宇旻,李陶深,杜衡斌,唐晟.基于Web挖掘自动分类器的设计与实现[J].广西科学院学报,2008,24(4):310-312,316. [点击复制]
- ZHU Chun-jiang,LU Yu-min,LI Tao-shen,DU Heng-bin,TANG Sheng.Design and Implementation of the Auto Classifier based on Web Mining[J].Journal of Guangxi Academy of Sciences,2008,24(4):310-312,316. [点击复制]
|
|
摘要: |
分析分布式实时网络行为监控系统中Web网页安全性挖掘问题,设计实现一个基于Web挖掘的自动分类器,并构造一个实验环境来检测分类器的性能。该自动分类器利用特征提取算法实现对每个样本的特征向量提取和待分类文本的特征向量提取,利用基于k个"最近邻"(KNN)分类算法实现对网页的分类,能够提取出带有不安全信息的网页,分类效果良好。 |
关键词: 网络行为监控 Web网页挖掘 分类器 KNN分类算法 特征提取 |
DOI: |
投稿时间:2008-10-12 |
基金项目:广西科技攻关项目(桂科攻关033008-9)资助。 |
|
Design and Implementation of the Auto Classifier based on Web Mining |
ZHU Chun-jiang, LU Yu-min, LI Tao-shen, DU Heng-bin, TANG Sheng
|
(School of Computer, Electronics and Information, Guangxi University, Nanning, Guangxi, 530004, China) |
Abstract: |
This paper analyzes Web security mining problem in distributed real-time network behavior monitoring system.An auto classifier based on Web minning was designed and implemented.An experiment environment to test the performance of the classifier was constructed.This classfier extracts the feature vector of each samples and documents to be classified by using the feature extraction algorithm.Web page was classfied by using the K-Nearest-Neighbor(KNN) classification algorithm.The experimental results show that this auto classifier based on Web minning can fetch insecurity Web pages,and its classification is effective. |
Key words: network behavior monitoring Web page minning classifier KNN classification algorithm feature extraction |