How many people use Google as home page?

It has been long suspected that many people set search engines (such as Google.com) as their home pages, and simply type the domain name in the search box rather than in the URL box of the browser when they want to get to a web site. For example, to get to “myspace.com”, they would simply type in “myspace” in Google’s search box and click on the result to get there.

However, it has been difficult to estimate exactly how many people are doing this. AOL search data log provides some hints. I did a search in the AOL log where users typed “XYZ” and clicked a domain with “XYZ.com” or “XYZ.org” etc. The number is quite high: about 20% of all searches fall into this category.

This is a useful data point for a number of reasons:

  • It shows that as much as 20% of searches are used for navigational purposes. This explains why popular domain names such as myspace or ebay are the most search terms
  • The number of searches for a domain name (e.g. ebay) indicates how many times people go to a domain directly (rather than through a referral).

Update: Om has a post about Yahoo is the most popular search on google.

One can not succeed if there is no market

The demise of Kiko generated a lot of traffic in the blogosphere. Kiko’s calendar may be a cool product, but it cannot succeed if there is no market for it.

Online calendar for consumers is a tough market. Kiko may have had a better shot if they focused on businesses and organizations, e.g., providing a hosted calendar serivce for schools, clubs and small business owners. Trumba realized that quite a while ago and changed their focus to the business market.

Interesting user search stats from AOL search lab

AOL research lab had a lot of interesting stats on user search behaviors that I have not seen anywhere else. But their website has been down since the AOL data debacle two weeks ago. Here is a peek at some of the interesting bits and pieces:

  • Top 20% of “heavy users” perform 70% of all searches.
  • 5,000 web domains (among 50M) get almost 50% of all clicks in search results.
  • Winner takes all: the top entry get 45% of all clicks. The second and third get 13% and 9% respectively.
  • 25% of users go to next pages in search results.

Query breakdown based on category (click to enlarge):

aol

The closest thing to reading people’s minds

What would it be like if we can read people’s mind and know their private thoughts, desires, worries and their deepest secrets?

AOL’s data offers us a glimpse of just that. If you look at the 400+ search terms of user 1515830, a sketch of what was on a woman’s mind in Ohio starts to emerge (cancer operation, incest, spying, marriage problem, depression, psychotic drugs, looking for a job or start a home business, plus cheap curtains, Disneyland vacation etc).

The following are some selected search terms from user 1515830. (Warning: viewer discretion is advised).

  • precancer cells found during cyst biopsy
  • wide local excision for vulvar cancer
  • photographs of surgery for vulvar cancer
  • how can i hide a wireless connection
  • free keylogging programs
  • password stealers
  • aftermath of incest
  • how to tell your family you’re a victim of incest
  • vgn for depression
  • surgical help for depression
  • anti psychotic drugs
  • online auction managing tools
  • how to become an insurance underwriter in ohio
  • jobs in denver colorado
  • teaching jobs with the denver school system
  • hannah mullins nursing school
  • marriage counseling tips
  • are divorce laws affected by adultery in ohio
  • divorce laws in ohio
  • diet pills for sale
  • … (the full list can be found at here)

Do we want really all these to be recorded and kept forever?

The email that started the AOL search data firestorm

It all started when Abdur Chowdhury (AOL Chief Architect for Research) posted the following message to the Corpora mailing list:

(Update: three people were fired for their roles in the AOL data debacle, see here and here.)

AOL is embarking on a new direction for its business – making its
content and products freely available to all consumers. To support
those goals, AOL is also embracing the vision of an open research
community, which is creating opportunities for researchers in academia
and industry alike.

We are introducing AOL Research to everyone, with the goal of
facilitating closer collaboration between AOL and anyone with a desire
to work on interesting problems. To get started, we invite you to
visit us at http://research.aol.com, where you will find:

– 20,000 hand labeled, classified queries
– 3.5 million web question/answer queries (who, what, where, when, etc.)
– Query streams for 500,000 users over 3 months (20 million queries)
– Query arrival rates for queuing analysis
– 2 million queries against US Government domains

Also, please feel free to provide feedback on the site, datasets you’d
like to see in the future, and any other comments about our vision.

Thanks,

Abdur Chowdhury

It was simply an introduction of AOL’s research lab to the academic community. Soon, the blogoshere took notice and the rest is history:-)

The AOL research site is offline now but mirrored copies of the database are all over the net. You can also search it online at http://www.aolsearchdatabase.com.

Moving from TypePad to WordPress

After trying both TypePad and WordPress, I decided to go with WordPress – it has a much better user interface. I am republishing some of the blogs from my TypePad site.