爬取特定元素信息类图2
2016-05-26 00:17:52 0 举报
在这个类图中,我们可以看到一个名为“爬取特定元素信息”的类。这个类的主要功能是从网页中提取特定的信息元素。为了实现这个功能,它依赖于两个主要的组件:一个是“解析器”,负责将HTML或XML文档解析成可以处理的数据结构;另一个是“存储管理器”,负责将提取到的信息元素存储到适当的位置,如数据库或文件系统。此外,这个类还提供了一些辅助方法,如“设置爬取深度”和“设置爬取速度限制”,以帮助用户更好地控制爬取过程。总的来说,这个类图展示了一个用于网络爬虫的基本框架,可以帮助用户高效地从网页中获取所需的信息。
作者其他创作
大纲/内容
Crawls_Aiqiyi_Name_Amount
+ input:String+ num:int+ name_tmp:String[]+ name:String[]+ i:int+ m:int+ httpClient:HttpClient+ url:String url+ httpGet :HttpGet + date:Date + response:HttpResponse+ entity:HttpEntity + html:String + amount:String + name:String
+ searchInJD():void+ exec():void
AMovie_Info_Select
+ jf:JFrame+ cb:1JCheckBox+ cb2:JCheckBox+ panel:JPanel+ jl:JLabel+ jb1:JButton
+ init():void
Movies_Web_Select
+ jf:JFram+ jl:JLabel+ movies_web1:JButton+ movies_web2:JButton
Crawls_Dang_Name
+ input:String+ num:int+ name_tmp:String[]+ name:String[]+ i:int+ m:int+ httpClient:HttpClient+ url:String url+ httpGet :HttpGet + date:Date + response:HttpResponse+ entity:HttpEntity + html:String + name:String
Crawls_Aiqiyi_Amount
+ input:String+ num:int+ name_tmp:String[]+ name:String[]+ i:int+ m:int+ httpClient:HttpClient+ url:String url+ httpGet :HttpGet + date:Date + response:HttpResponse+ entity:HttpEntity + html:String + amount:String
Dang_Info_Select
Crawls_Dang_Name_Price
+ input:String+ num:int+ name_tmp:String[]+ name:String[]+ i:int+ m:int+ httpClient:HttpClient+ url:String url+ httpGet :HttpGet + date:Date + response:HttpResponse+ entity:HttpEntity + html:String + price:String + name:String
Crawls_Aiqiyi_Name
Goods_Web_Select
+ jf:JFram+ jl:JLabel+ goods_web1:JButton+ goods_web2:JButton
Crawls_Dang_Price
+ input:String+ num:int+ name_tmp:String[]+ name:String[]+ i:int+ m:int+ httpClient:HttpClient+ url:String url+ httpGet :HttpGet + date:Date + response:HttpResponse+ entity:HttpEntity + html:String + price:String
Kinds_select
+ jf:JFrame + jl:JLabel
0 条评论
下一页