Focused crawler is programme of the search engine to automatically download websites, and is a key which the search engines find and index the deep web data.This paper describes a kind of web crawler which using PageRank algorithm to analysis the importance of website,by website structure pruning technic and determine pages algorithms to filter URL is not page-topic needed,effectively improve the quality and efficiency of deep web data integration.