web crawler - Web Crawling with seed URLs from search engine -
i need know if worth build crawler on top of results given search engine.
by means, given query, grab n urls search engine , input them crawler find more relevant pages search. there scientific paper/experiment claiming doing helps gathering more relevant pages instead of getting urls search engine?
if understood right, rebuild search engine, because job bring related/relevant results first on search. and, although did not mention directly search engine, guess google, suggest use advanced search options before trying else. google provides api performing searches, can use in system. if approach not fit you, possible craw on google results, , perform custom searches (for example filtering results site, term or etc) google not happy , block calls. suggest give try on open api...
Comments
Post a Comment