Login | Register For Free | Help
Search for: (Advanced)

Mailing List Archive: Wikipedia: Mediawiki

Usage stats questions

 

 

Wikipedia mediawiki RSS feed   Index | Next | Previous | View Threaded


Jon.G.Bartlett at gsk

Nov 25, 2009, 1:45 AM

Post #1 of 7 (750 views)
Permalink
Usage stats questions

All,

I'm trying to gauge the value of my internal company Wiki, and I want to produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc.
Can anyone point me to where either the database structures are described, so I can create some offline queries myself ?

Also, anyone know if there is a way to get the Statistics page (Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines...
I think I asked the latter Question before - but got no replies...

Many thanks,
Jon

-----------------------------------------------------------
This e-mail was sent by GlaxoSmithKline Services Unlimited
(registered in England and Wales No. 1047315), which is a
member of the GlaxoSmithKline group of companies. The
registered address of GlaxoSmithKline Services Unlimited
is 980 Great West Road, Brentford, Middlesex TW8 9GS.
-----------------------------------------------------------
_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


Platonides at gmail

Nov 25, 2009, 2:23 PM

Post #2 of 7 (705 views)
Permalink
Re: Usage stats questions [In reply to]

Jon Bartlett wrote:
> All,
>
> I'm trying to gauge the value of my internal company Wiki, and I want to produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc.
> Can anyone point me to where either the database structures are described, so I can create some offline queries myself ?
>
> Also, anyone know if there is a way to get the Statistics page (Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines...
> I think I asked the latter Question before - but got no replies...
>
> Many thanks,
> Jon

The wiki doesn't store the reads. It can have a per page read count, but
nothing more. What you can do is analyse your server access log to get
the data you want.


_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


nevinhoalarcao at gmail

Nov 26, 2009, 2:50 AM

Post #3 of 7 (694 views)
Permalink
Re: Usage stats questions [In reply to]

Hi Platonides, I am also very interested in counting the number a particular
page has been viewed on the corporate wiki I am administering, even if it is
only few times. which code lines should be add to localssetings? Is there?

2009/11/25 Platonides <Platonides [at] gmail>

> Jon Bartlett wrote:
> > All,
> >
> > I'm trying to gauge the value of my internal company Wiki, and I want to
> produce some usage statistics (primarily around pages being read, rather
> than edited). Typically, things like number of pages read today, or this
> week etc.
> > Can anyone point me to where either the database structures are
> described, so I can create some offline queries myself ?
> >
> > Also, anyone know if there is a way to get the Statistics page
> (Special:Statistics) to Exclude hits from the various indexers/spiders for
> my internal company Wiki - as currently I suspect the huge number of daily
> page reads are due to search engines...
> > I think I asked the latter Question before - but got no replies...
> >
> > Many thanks,
> > Jon
>
> The wiki doesn't store the reads. It can have a per page read count, but
> nothing more. What you can do is analyse your server access log to get
> the data you want.
>
>
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l [at] lists
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>



--
{+}Nevinho
Venha para o Movimento Colaborativo http://sextapoetica.com.br !!
_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


Sam.Sexton at thomsonreuters

Nov 26, 2009, 3:01 AM

Post #4 of 7 (692 views)
Permalink
Re: Usage stats questions [In reply to]

Nevio,

Platonides suggested looking at your web server access log. The following works for me:

[2864](root [at] ncq-wiki0)/usr/local/apache2/logs/Nov: grep 'GET.*a_page_name' 20091126_access.log | wc -l
8
[2865](root [at] ncq-wiki0)/usr/local/apache2/logs/Nov:

You'd need to script that to generalise it and identify all the pages [you need] - eliminating *.css, *.js, all flavours of images etc. is a start.


/Sam

-----Original Message-----
From: mediawiki-l-bounces [at] lists [mailto:mediawiki-l-bounces [at] lists] On Behalf Of nevio carlos de alarcão
Sent: 26 November 2009 10:50
To: MediaWiki announcements and site admin list
Subject: Re: [Mediawiki-l] Usage stats questions

Hi Platonides, I am also very interested in counting the number a particular page has been viewed on the corporate wiki I am administering, even if it is only few times. which code lines should be add to localssetings? Is there?

2009/11/25 Platonides <Platonides [at] gmail>

> Jon Bartlett wrote:
> > All,
> >
> > I'm trying to gauge the value of my internal company Wiki, and I
> > want to
> produce some usage statistics (primarily around pages being read,
> rather than edited). Typically, things like number of pages read
> today, or this week etc.
> > Can anyone point me to where either the database structures are
> described, so I can create some offline queries myself ?
> >
> > Also, anyone know if there is a way to get the Statistics page
> (Special:Statistics) to Exclude hits from the various indexers/spiders
> for my internal company Wiki - as currently I suspect the huge number
> of daily page reads are due to search engines...
> > I think I asked the latter Question before - but got no replies...
> >
> > Many thanks,
> > Jon
>
> The wiki doesn't store the reads. It can have a per page read count,
> but nothing more. What you can do is analyse your server access log to
> get the data you want.
>
>
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l [at] lists
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>



--
{+}Nevinho
Venha para o Movimento Colaborativo http://sextapoetica.com.br !!
_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


This email was sent to you by Thomson Reuters, the global news and information company.
Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.



_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


nevinhoalarcao at gmail

Nov 26, 2009, 3:12 AM

Post #5 of 7 (693 views)
Permalink
Re: Usage stats questions [In reply to]

Hi Sam.Sexton, thanks for clarification. Is Thomson Reuters that news agency
you work on? It got a internal wiki? Could you send an email to
nevinho [at] bb for us to engage in a conversation? Thanks for attention,
Nevio

2009/11/26 <Sam.Sexton [at] thomsonreuters>

> Nevio,
>
> Platonides suggested looking at your web server access log. The following
> works for me:
>
> [2864](root [at] ncq-wiki0)/usr/local/apache2/logs/Nov: grep
> 'GET.*a_page_name' 20091126_access.log | wc -l
> 8
> [2865](root [at] ncq-wiki0)/usr/local/apache2/logs/Nov:
>
> You'd need to script that to generalise it and identify all the pages [you
> need] - eliminating *.css, *.js, all flavours of images etc. is a start.
>
>
> /Sam
>
> -----Original Message-----
> From: mediawiki-l-bounces [at] lists [mailto:
> mediawiki-l-bounces [at] lists] On Behalf Of nevio carlos de
> alarcão
> Sent: 26 November 2009 10:50
> To: MediaWiki announcements and site admin list
> Subject: Re: [Mediawiki-l] Usage stats questions
>
> Hi Platonides, I am also very interested in counting the number a
> particular page has been viewed on the corporate wiki I am administering,
> even if it is only few times. which code lines should be add to
> localssetings? Is there?
>
> 2009/11/25 Platonides <Platonides [at] gmail>
>
> > Jon Bartlett wrote:
> > > All,
> > >
> > > I'm trying to gauge the value of my internal company Wiki, and I
> > > want to
> > produce some usage statistics (primarily around pages being read,
> > rather than edited). Typically, things like number of pages read
> > today, or this week etc.
> > > Can anyone point me to where either the database structures are
> > described, so I can create some offline queries myself ?
> > >
> > > Also, anyone know if there is a way to get the Statistics page
> > (Special:Statistics) to Exclude hits from the various indexers/spiders
> > for my internal company Wiki - as currently I suspect the huge number
> > of daily page reads are due to search engines...
> > > I think I asked the latter Question before - but got no replies...
> > >
> > > Many thanks,
> > > Jon
> >
> > The wiki doesn't store the reads. It can have a per page read count,
> > but nothing more. What you can do is analyse your server access log to
> > get the data you want.
> >
> >
> > _______________________________________________
> > MediaWiki-l mailing list
> > MediaWiki-l [at] lists
> > https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
> >
>
>
>
> --
> {+}Nevinho
> Venha para o Movimento Colaborativo http://sextapoetica.com.br !!
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l [at] lists
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>
>
> This email was sent to you by Thomson Reuters, the global news and
> information company.
> Any views expressed in this message are those of the individual sender,
> except where the sender specifically states them to be the views of Thomson
> Reuters.
>
>
>
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l [at] lists
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>



--
{+}Nevinho
Venha para o Movimento Colaborativo http://sextapoetica.com.br !!
_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


Jon.G.Bartlett at gsk

Nov 26, 2009, 3:30 AM

Post #6 of 7 (693 views)
Permalink
Re: Usage stats questions [In reply to]

I can see in the Wiki db that there is a page_counter (in table 'page')- I can only assume this is a count of number of page 'hits'.
It would be possible to automate queries that run daily/hourly etc to look at these values, and then interrogate them over time. However, I don't understand several of the fields in the MW database - what for example is the 'Page_latest', 'page_touched' and 'Page_latest' fields ? is there a db descriptions somewhere ?

Jon

-----Original Message-----
From: mediawiki-l-bounces [at] lists [mailto:mediawiki-l-bounces [at] lists] On Behalf Of Sam.Sexton [at] thomsonreuters
Sent: 26 November 2009 11:02
To: mediawiki-l [at] lists
Subject: Re: [Mediawiki-l] Usage stats questions

Nevio,

Platonides suggested looking at your web server access log. The following works for me:

[2864](root [at] ncq-wiki0)/usr/local/apache2/logs/Nov: grep 'GET.*a_page_name' 20091126_access.log | wc -l
8
[2865](root [at] ncq-wiki0)/usr/local/apache2/logs/Nov:

You'd need to script that to generalise it and identify all the pages [you need] - eliminating *.css, *.js, all flavours of images etc. is a start.


/Sam

-----Original Message-----
From: mediawiki-l-bounces [at] lists [mailto:mediawiki-l-bounces [at] lists] On Behalf Of nevio carlos de alarcão
Sent: 26 November 2009 10:50
To: MediaWiki announcements and site admin list
Subject: Re: [Mediawiki-l] Usage stats questions

Hi Platonides, I am also very interested in counting the number a particular page has been viewed on the corporate wiki I am administering, even if it is only few times. which code lines should be add to localssetings? Is there?

2009/11/25 Platonides <Platonides [at] gmail>

> Jon Bartlett wrote:
> > All,
> >
> > I'm trying to gauge the value of my internal company Wiki, and I
> > want to
> produce some usage statistics (primarily around pages being read,
> rather than edited). Typically, things like number of pages read
> today, or this week etc.
> > Can anyone point me to where either the database structures are
> described, so I can create some offline queries myself ?
> >
> > Also, anyone know if there is a way to get the Statistics page
> (Special:Statistics) to Exclude hits from the various indexers/spiders
> for my internal company Wiki - as currently I suspect the huge number
> of daily page reads are due to search engines...
> > I think I asked the latter Question before - but got no replies...
> >
> > Many thanks,
> > Jon
>
> The wiki doesn't store the reads. It can have a per page read count,
> but nothing more. What you can do is analyse your server access log to
> get the data you want.
>
>
> _______________________________________________
> MediaWiki-l mailing list
> MediaWiki-l [at] lists
> https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
>



--
{+}Nevinho
Venha para o Movimento Colaborativo http://sextapoetica.com.br !!
_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


This email was sent to you by Thomson Reuters, the global news and information company.
Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.



_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


-----------------------------------------------------------
This e-mail was sent by GlaxoSmithKline Services Unlimited
(registered in England and Wales No. 1047315), which is a
member of the GlaxoSmithKline group of companies. The
registered address of GlaxoSmithKline Services Unlimited
is 980 Great West Road, Brentford, Middlesex TW8 9GS.
-----------------------------------------------------------


_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l


Platonides at gmail

Nov 26, 2009, 9:43 AM

Post #7 of 7 (686 views)
Permalink
Re: Usage stats questions [In reply to]

Jon Bartlett wrote:
> I can see in the Wiki db that there is a page_counter (in table 'page')- I can only assume this is a count of number of page 'hits'.
Right. Note however that it will be undercounted if you use a caching
proxy, like squid.

> It would be possible to automate queries that run daily/hourly etc to look at these values, and then interrogate them over time.

Yes. You could. But looking at the server logs is probably easier.


> However, I don't understand several of the fields in the MW database - what for example is the 'Page_latest', 'page_touched' and 'Page_latest' fields ? is there a db descriptions somewhere ?
>
> Jon

id of the last revision, the timestamp of last time the page cache has
been invalidated (copies previous to that timestamp are outdated).

The schema is described at
http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/maintenance/tables.sql?view=markup

There're also descriptions about it at www.mediawiki.org


_______________________________________________
MediaWiki-l mailing list
MediaWiki-l [at] lists
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l

Wikipedia mediawiki RSS feed   Index | Next | Previous | View Threaded
 
 


Interested in having your list archived? Contact Gossamer Threads
 
  Web Applications & Managed Hosting Powered by Gossamer Threads Inc.