
GoogleAnalytics has been re-designed to help you learn even more about whereyour visitors come from and how they interact with your site.
Thenew Google Analytics makes it easy to improve your results online.Write better ads, strengthen your marketing initiatives, and createhigher-converting websites. Google Analytics is free to alladvertisers, publishers, and site owners.
Youcan now use your Google Analytics login to access Website Optimizer.Find out which page designs, headlines, and graphics convert the mostvisitors.
Get Started
Web analytics is the study of online behaviour in order to improve it. There are two categories; off-site and on-site web analytics.
Off-site web analytics refers to web measurement and analysisirrespective of whether you own or maintain a website. It includes themeasurement of a website's potential audience (opportunity), share of voice (visibility), and buzz (comments) that is happening on the Internet as a whole.
On-site web analytics measure a visitor's journey once on your website. This includes its drivers and conversions; for example, which landing pages encourage people to make a purchase. On-site web analytics measures theperformance of your website in a commercial context. This data istypically compared against key performance indicators for performance, and used to improve a web site or marketing campaign's audience response.
Historically, web analytics has referred to on-site visitormeasurement. However in recent years this has blurred, mainly becausevendors are producing tools that span both categories.
Many different vendors provide on-site web analytics software and services. There are two main technological approaches to collecting the data. The first method, logfile analysis, reads the logfiles in which the web server records all its transactions. The second method, page tagging, uses Javascript on each page to notify a third-party server when a page is rendered by a web browser. Both collect data that can be processed to produce web traffic reports.
In addition other data sources may also be added to augment thedata. For example; e-mail response rates, direct mail campaign data,sales and lead information, user performance data such as click heat mapping, or other custom metrics as needed.
Web servers have always recorded all their transactions in alogfile. It was soon realised that these logfiles could be read by aprogram to provide data on the popularity of the website. Thus arose web log analysis software.
In the early 1990s, web site statistics consisted primarily of counting the number of client requests (or hits)made to the web server. This was a reasonable method initially, sinceeach web site often consisted of a single HTML file. However, with theintroduction of images in HTML, and web sites that spanned multipleHTML files, this count became less useful. The first true commercialLog Analyzer was released by IPRO in 1994.
Two units of measure were introduced in the mid 1990s to gauge moreaccurately the amount of human activity on web servers. These were page views and visits (or sessions). A page view was defined as a request made to the web server for a page, as opposed to a graphic, while a visitwas defined as a sequence of requests from a uniquely identified clientthat expired after a certain amount of inactivity, usually 30 minutes.The page views and visits are still commonly displayed metrics, but arenow considered rather unsophisticated measurements.
The emergence of search engine spiders and robots in the late 1990s, along with web proxies and dynamically assigned IP addresses for large companies and ISPs, made it more difficult to identify unique human visitors to a website. Log analyzers responded by tracking visits by cookies, and by ignoring requests from known spiders.
The extensive use of web caches also presented a problem for logfile analysis. If a person revisits apage, the second request will often be retrieved from the browser'scache, and so no request will be received by the web server. This meansthat the person's path through the site is lost. Caching can bedefeated by configuring the web server, but this can result in degradedperformance for the visitor to the website.
Concerns about the accuracy of logfile analysis in the presence ofcaching, and the desire to be able to perform web analytics as anoutsourced service, led to the second data collection method, pagetagging or 'Web bugs'.
In the mid 1990s, Web counters were commonly seen — these were images included in a web page thatshowed the number of times the image had been requested, which was anestimate of the number of visits to that page. In the late 1990s thisconcept evolved to include a small invisible image instead of a visibleone, and, by using JavaScript, to pass along with the image requestcertain information about the page and the visitor. This informationcan then be processed remotely by a web analytics company, andextensive statistics generated.
The web analytics service also manages the process of assigning acookie to the user, which can uniquely identify them during their visitand in subsequent visits.
With the increasing popularity of Ajax-basedsolutions, an alternative to the use of an invisible image, is toimplement a call back to the server from the rendered page. In thiscase, when the page is rendered on the web browser, a piece of Ajaxcode would call back to the server and pass information about theclient that can then be aggregated by a web analytics company. This isin some ways flawed by browser restrictions on the servers which can becontacted with XmlHttpRequest objects.
Both logfile analysis programs and page tagging solutions arereadily available to companies that wish to perform web analytics. Insome cases, the same web analytics company will offer both approaches.The question then arises of which method a company should choose. Thereare advantages and disadvantages to each approach.
The main advantages of logfile analysis over page tagging are as follows.
The main advantages of page tagging over logfile analysis are as follows.
Logfile analysis is almost always performed in-house. Page taggingcan be performed in-house, but it is more often provided as athird-party service. The economic difference between these two modelscan also be a consideration for a company deciding which to purchase.
Which solution is cheaper to implement depends on the amount oftechnical expertise within the company, the vendor chosen, the amountof activity seen on the web sites, the depth and type of informationsought, and the number of distinct web sites needing statistics.
Regardless of the vendor solution or data collection methodemployed, the cost of web visitor analysis and interpretation shouldalso be included. That is, the cost of turning raw data into actionableinformation. This can be from the use of third party consultants, thehiring of an experienced web analyst, or the training of a suitablein-house person. A cost-benefit analysis can then be performed. For example, what revenue increase or cost savings can be gained by analysing the web visitor data?
There are no globally agreed definitions within web analytics as theindustry bodies have been trying to agree definitions that are usefuland definitive for some time. The main bodies who have had input inthis area have been Jicwebs(Industry Committee for Web Standards)/ABCe(Auditing Bureau of Circulations electronic, UK and Europe), The WAA(Web Analytics Association, US) and to a lesser extent the IAB(Interactive Advertising Bureau). This does not prevent the followinglist from being a useful guide, suffering only slightly from ambiguity.Both the WAA and the ABCe provide more definitive lists for those whoare declaring their statistics using the metrics defined by either.
The hotel problem is generally the first problem encountered by auser of web analytics. The term was first coined by Rufus Evisonexplaining the problem at one of the Emetrics Summits and has now gained popularity as a simple expression of the problem and its resolution.
The problem is that the unique visitors for each day in a month donot add up to the same total as the unique visitors for that month.This appears to an inexperienced user to be a problem in whateveranalytics software they are using. In fact it is a simple property ofthe metric definitions.
The way to picture the situation is by imagining a hotel. The hotel has two rooms (Room A and Room B).
| Day 1 | Day 2 | Day 3 | Total | |
| Room A | John | John | Jane | 2 Unique Users |
| Room B | Mark | Jane | Mark | 2 Unique Users |
| Total | 2 | 2 | 2 | ? |
As the table shows, the hotel has two unique users each day over threedays. The sum of the totals with respect to the days is therefore six.
During the period each room has had two unique users. The sum of the totals with respect to the rooms is therefore four.
Actually only three visitors have been in the hotel over thisperiod. The problem is that a person who stays in a room for two nightswill get counted twice if you count them once on each day, but is onlycounted once if you are looking at the total for the period. Anysoftware for web analytics will sum these correctly for whatever timeperiod, thus leading to the problem when a user tries to compare thetotals.
Another common misconception in web analytics is that the sum of thenew visitors and the repeat visitors ought to be the total number ofvisitors. Again this becomes clear if the visitors are viewed asindividuals on a small scale, but still causes a large number ofcomplaints that analytics software cannot be working because of afailure to understand the metrics.
Here the culprit is the metric of a new visitor. There is really nosuch thing as a new visitor when you are considering a web site from anongoing perspective. If a visitor makes their first visit on a givenday and then returns to the web site on the same day they are both anew visitor and a repeat visitor for that day. So if we look at them asan individual which are they? The answer has to be both, so thedefinition of the metric is at fault.
A new visitor is not an individual; it is a fact of the webmeasurement. For this reason it is easiest to conceptualise the samefacet as a first visit (or first session). This resolves the conflictand so removes the confusion. Nobody expects the number of first visitsto add to the number of repeat visitors to give the total number ofvisitors. The metric will have the same number as the new visitors, butit is clearer that it will not add in this fashion.
On the day in question there was a first visit made by our chosenindividual. There was also a repeat visit made by the same individual.The number of first visits and the number of repeat visits will add upto the total number of visits for that day.
| | |