WebCrawler WebCrawler搜寻服务(著名的搜寻引擎之一)
- Analyze the pages which are crawled by WebCrawler, remove the control order and format on the page and remain the content only;
闭于网络爬虫爬取到的网页加以剖析,去除网页外的控造命令和格局,只保留外容; - One of the most important parts of search engine is WebCrawler which can get the original information from network for the search engine.
搜索引擎一个重要部分是网络爬虫程序,依靠网络爬虫,搜索引擎可以获取用来检索的原材料信息。 - In this article we use the heuristic search to get the specific information. This can reduce the links largely and make the links visited by WebCrawler point to useful information.
在本文中,提出使用人工智能中的启发式搜索来获取特定的信息,这样可以极大地减少遍历的链接数量,使被访问到的链接尽量地指向有用的信息。