web crawler - How can I get MediaWiki to ignore page views from a Google Search Appliance? -


the page view counter on each mediawiki page seems great way identify popular pages worth putting more effort keeping up-to-date , useful, i've hit problem.

we use google search appliance index our mediawiki installation. problem have gsa increments page view counter each time crawls page. dominates statistics, swamping views made real users.

i know how reset page counters start again. there way configure mediawiki ignore page requests gsa purposes of counting page views?

this can done adding condition in article.php:

includes/article.php:2861:function viewupdates():

if( !$wgdisablecounters && !$wguser->isallowed('bot') && $this->getid() ) { 

add:

&& strpos($_server['http_user_agent'], 'gsa-crawler') === false 

where gsa-crawler part of default gsa ua...

another way setup forms authentication in gsa, , have login wikimedia user in bot group..


Comments

Popular posts from this blog

unicode - Are email addresses allowed to contain non-alphanumeric characters? -

c++ - Convert big endian to little endian when reading from a binary file -

C#: Application without a window or taskbar item (background app) that can still use Console.WriteLine() -