Webalizer中文帮助文件名词解释
Webalizer是一款Linux下常用的web日志分析脚本,当然对于nginx也适用。笔者在安装后比较好奇这些数据背后的含义。到底代表着什么,想必您也想知道吧,下面就是一些关于Webalizer名词解释。不到之处,还请高手不吝赐教^_^
Hits (点击数)
Any request made to the server which is logged, is considered a ‘hit’.The requests can be for
在网站日志中记录的任意一次请求,不管是html、图片、音频、CGI等等都被看做一次“点击”。
anything…html pages, graphic images, audiofiles, CGI scripts, etc… Each valid line in the server log is
实际上就是日志文件中的一行对应一次“点击”。
counted as a hit. This number represents the total number of requests that were made to the server during the specified report period.
“点击”统计代表服务器在指定时间段内响应请求的总数。
Files(文件数)
Some requests made to the server, require that the server then send something back to the requesting client, such as
客户端在向服务器发出请求后,服务器会向发出请求的客户段传送数据,例如
a html page or graphic image. When this happens, it is considered a ‘file’ and the files total is incremented.
html页面或图片。这种情形被定义为“文件数”。
The relationship between ‘hits’ and ‘files’ can be thought of as ‘incoming requests’ and ‘outgoing responses’.
“点击数”和“文件数”之间存在着一定区别,可以看做是前者代表“进来的请求”而后者代表“服务器作出的响应”。
Pages(网页数)
Pages are, well, pages! Generally, any HTML document, or anything that generates an HTML document,
网页….就是网页啦!一般来说,一个html文档或者是动态页面(php、asp、jsp等)就是“网页数”统计的目标。
would be considered a page. This does not include the other stuff that goes into a document, such as
“网页数”不包括页面中的图片、音频片断等等…..
graphic images, audio clips, etc… This number represents the number of ‘pages’ requested only, and does
被统计的“网页数”仅仅是网页本身而已,不包括网页中其他如js、css等等。
not include the other ‘stuff’ that is in the page. What actually constitutes a ‘page’ can vary from server to server. The default action is to treat anything with the extension ‘.htm’, ‘.html’ or ‘.cgi’ as a page. A lot of
在默认情况下脚本只识别后缀名是“.htm”“.html”或者“cgi”的网页。
sites will probably define other extensions, such as ‘.phtml’, ‘.php3′ and ‘.pl’ as pages as well. Some people
很多站点可能会有其他默认后缀名网页,如“.phtml”、“php3”、“.pl”等等。
consider this number as the number of ‘pure’ hits… I’m not sure if I totally agree with that viewpoint. Some other programs (and people refer to this as ‘Pageviews’。
其实说白了就是PV(访问量)啦^_^
Sites(站点数)
Each request made to the server comes from a unique ‘site’, which can be referenced by a name or
一般请求是由“站点”向服务器发送的,它可能是域名或IP地址。
ultimately, an IP address. The ‘sites’ number shows how many unique IP addresses made requests to the
“站点数”代表在指定时间段内有多少个独立IP地址向服务器发送了请求。
server during the reporting time period. This DOES NOT mean the number of unique individual users (real
这个数值并不代表独立访客(真实用户而不是机器人)的访问数量。
people) that visited, which is impossible to determine using just logs and the HTTP protocol (however, this
number might be about as close as you will get).
Visits(访客数)
Whenever a request is made to the server from a given IP address (site), the amount of time since a previous request by the address is calculated (if any). If the time difference is greater than a pre-configured ‘visit timeout’ value (or has never made a request before), it is considered a ‘new visit’, and this total is incremented (both for the site, and the IP address). The default timeout value is 30 minutes (can be changed), so if a user visits your site at 1:00 in the afternoon, and then returns at 3:00, two visits would be registered. Note: in the ‘Top Sites’ table, the visits total should be discounted on ‘Grouped’ records, and thought of as the “Minimum number of visits” that came from that grouping instead. Note: Visits only occur on PageType requests, that is, for any request whose URL is one of the ‘page’ types defined with the PageType and PagePrefix option, and not excluded by the OmitPage option. Due to the limitation of the HTTP
protocol, log rotations and other factors, this number should not betaken as absolutely accurate, rather, it should be considered a pretty close “guess”.
(这里的访客数是一个大概的数值,如果统一IP在默认30分钟以内登录则算做一个访客,间隔30分钟以上再次访问则又被计算做一个新访客)笔者个人理解,其实就是包含了一个IP地址某个时间段内到访次数
1 条评论
[...] Webalizer Quick Help Webalizer中文帮助文件名词解释 发布一个webalizer中文UTF-8汉化及图片修正完全补丁 [...]