web crawler - How can I get MediaWiki to ignore page views from a Google Search Appliance? -
the page view counter on each mediawiki page seems great way identify popular pages worth putting more effort keeping up-to-date , useful, i've hit problem.
we use google search appliance index our mediawiki installation. problem have gsa increments page view counter each time crawls page. dominates statistics, swamping views made real users.
i know how reset page counters start again. there way configure mediawiki ignore page requests gsa purposes of counting page views?
this can done adding condition in article.php:
includes/article.php:2861:function viewupdates():
if( !$wgdisablecounters && !$wguser->isallowed('bot') && $this->getid() ) {
add:
&& strpos($_server['http_user_agent'], 'gsa-crawler') === false
where gsa-crawler part of default gsa ua...
another way setup forms authentication in gsa, , have login wikimedia user in bot group..
Comments
Post a Comment