web crawler - Web Crawling with seed URLs from search engine -

January 15, 2015

i need know if worth build crawler on top of results given search engine.

by means, given query, grab n urls search engine , input them crawler find more relevant pages search. there scientific paper/experiment claiming doing helps gathering more relevant pages instead of getting urls search engine?

if understood right, rebuild search engine, because job bring related/relevant results first on search. and, although did not mention directly search engine, guess google, suggest use advanced search options before trying else. google provides api performing searches, can use in system. if approach not fit you, possible craw on google results, , perform custom searches (for example filtering results site, term or etc) google not happy , block calls. suggest give try on open api...

Search This Blog

Force Net

web crawler - Web Crawling with seed URLs from search engine -

Comments

Post a Comment

Popular posts from this blog

ubuntu - PHP script to find files of certain extensions in a directory, returns populated array when run in browser, but empty array when run from terminal -

php - How can i create a user dashboard -

javascript - How to detect toggling of the fullscreen-toolbar in jQuery Mobile? -