Archive for April, 2007:

The State of the Live Web, April 2007

18

Hey, it’s that time again, time to slow down, take a deep breath, and dig into the data!

About this Report, and the Obligatory Plug for Technorati

Technorati is known widely for its quarterly State of the Blogosphere reports, analyzing the trends around blogs and blogging. With this report, we expand on this tradition by introducing information and analysis relating to the broader range of social media on the Web — what we and many others call the Live Web (another good definition). Technorati continues to grow well beyond its roots at the leading blog search engine; increasingly, we are the main aggregation point for all forms of social media on the Web, including blogs, of course, but also video, photos, audio such as podcasts and much more.

What makes this possible is the rise in the use of tags across all forms of social media and the increasing implementation of tags by the publishing platforms supporting each form of media. Increasingly, tags have become a lingua franca of Live Web, helping to categorize social media while also indicating where people’s attention might be at any given moment. But because each form of media is published from unique platforms with their own established communities, the audience found itself hopping from platform to platform to get a sense of what might be hot at any given moment. Which is why our social media aggregation service — made manifest on our tagged media pages — is growing at a torrid pace.

While we still have substantial reporting on the the State of the Blogosphere, we now expanding the report to provide information about the State of Tags. Admittedly, the information we have on this new area of focus for our report isn’t as deep or as expansive as our State of the Blogosphere, and we expect that over time, this and other new sections will expand, but we believe this is a good first step in trying to provide a more comprehensive snapshot of the Live Web.

OK, on to the numbers!

The State of the Blogosphere

The state of the Blogosphere is strong, and is maturing as an influential and important part of the web.

For nearly four years, we’ve been tracking and enabling the growth of this phenomenon and theirs is much in our data to indicate that the medium is “growing up.”

Slide0005

Technorati is now tracking over 70 million weblogs, and we’re seeing about 120,000 new weblogs being created worldwide each day. That’s about 1.4 blogs created every second of every day.

Slide0007

Spam and splogs (spam blogs) continue to be a problem in the blogosphere, and there was a marked increase in splogs that coincided with the holiday season last year. Technorati has been tracking between 3,000 – 7,000 new splogs created each day, but there was a significant spike in splog creation during early December, when we tracked over 11,000 splogs created each day during December – a total of 341,000 splogs that we removed from our indexes during that period.

Fortunately, spam rates have decreased somewhat since then, as blog hosting providers have responded to the issue during the months of January and February. My personal take on the issue of spam is that all healthy ecosystems have parasites – the only question is whether or not the system is structurally vulnerable to being overwhelmed. Thankfully, because of the accountability that is built into the web itself (the URL structure is fundamentally accountable), I believe that while the vulnerability of the live web to spam is real, it is managable.

Slide0006

Since our last State of the Blogosphere report in October 2006, we’ve seen a slowing in the doubling of the size of the blogosphere. This shouldn’t be surprising, as we’re dealing with the law of large numbers – it takes a lot more growth to double from 35 million blogs to 70 million (which took about 320 days) than when it doubled from 5 million to 10 million blogs (which took about 180 days).

We also see a slowing in growth in the rate of posts created per day; while there are spikes in blog posts during times of significant world crisis — for instance, last summer’s conflict between Israel and Hezbollah — the overall trend is that posting volume is growing more slowly, at about 1.5 million postings per day. That’s about 17 posts per second. In October 2006, Technorati was tracking about 1.3 million postings per day, about 15 posts per second.

Slide0008

Popularity of Blogs vs. the MSM

Slide0010

Slide0011

In previous reports, we looked at the popularity of mainstream media compared to blog sites. One interesting item to note in April 2007, the number of blogs in the top 100 most popular sites has risen substantially. During Q3 2006 there were only 12 blogs in the Top 100 most popular sites.

In Q4, however, there were 22 blogs on the list — further evidence of the continuing maturation of the Blogosphere. Blogs continue to become more and more viable news and information outlets. For instance, information not shown in our data but revealed in our own user testing in Q1 2007 indicates that the audience is less and less likely to distinguish a blog from, say, nytimes.com — for a growing base of users, these are all sites for news, information, entertainment, gossip, etc. and not a “blog” or a “MSM site”.

Further, there is a wider diversity of languages represented here, specifically Farsi with TodayLink.ir, Persian Blog Fans Club, and Giliran.com making the Top 100. More on that in a moment, as we discuss the international growth of the Blogosphere.

The Global Blogosphere

Slide0013

In terms of blog posts by language, Japanese retakes the top spot from our last report, with 37% (up from 33%) of the posts followed closely by English at 36% (down from 39%). Additionally there was movement in the middle of the top 10 languages, highlighted by Italian overtaking Spanish for the number four spot.

The newcomer to the top 10 languages is Farsi, just joining the list at #10. It has been very interesting to watch the growth of the blogging world in the middle east, especially in countries like Iran, and it is reflected in the language distribution above.

Slide0015

Slide0014

Slide0016

English, Japanese and Chinese look almost identical to our last report in their posting distribution. With Italian overtaking Spanish, we get to see another language with a different distribution, which contrast both the extreme geographic correlations of the Asian languages and the relative lack of geographic correlations of English. Again it would appear that both English and Spanish are more global languages based on consistency of posting through a 24 hour period, whereas other top languages, specifically Japanese, Chinese, and Italian, are more geographically correlated. It would also appear that a significant number of people who are blogging are doing it during work hours.

The State of Tags

The explosive growth that we see in the Technorati index is mirrored in social media sites throughout the Web, including Flickr, YouTube, and the like. This shared phenomenon allows us to marry the wealth of information in our index with the wealth of that stored on social media sites across the Live Web through the shared construct of tags.

For the uninitiated, a tag is a category or descriptor that someone (often the creator) assigns to it . This descriptor literally hangs off the media that’s published to the Web much in the same way a luggage tag hangs off your suitcase — easily identifying the bag.

The bottom line: we’re seeing explosive growth in the tags index. People are clicking on tags, people are using tags, Google features tagged media in its results pages. Tags adoption has become a phenomenon across the Live Web, and we are seeing a correlative explosive growth at Technorati.

On to the numbers:

Slide0023

Technorati is now tracking over 230 million posts using tags or categories, and the number of people who are using tags is growing:



Slide0022

As of February 2007, About 35% of all posts Technorati tracks use tags.

Slide0024

The number of bloggers that are using tags is also increasing month over month. About 2.5 million blogs posted at least one tagged post in February 2007.

Growth and Maturation

Back in 2002 when Technorati started tracking the blogosphere, social mores and community practices were still forming, and its growth was primarily through the written word. It was a fledgling medium that was initially reviled, then feared, and, now, embraced as mainstream.

The blogosphere started well before Technorati was founded, and its growth was fostered by many people and organizations that brought openness and cooperation to the medium. One of those people, Dave Winer, just celebrated the tenth anniversary of his weblog. Given this auspicious anniversary, I wanted to give my thanks and support to Dave and to all of the other early pioneers in the world of blogging, RSS, and the Live Web. Without Dave’s efforts, the web wouldn’t look the way it does today. His creation and support for systems like weblogs.com and open formats like RSS were critical in building the early infrastructure that Technorati relies upon and helps to support.

Thanks, Dave!

Wrapping it all Up

As a result of this work and the cultural mores of openness, we also have photo sharing, podcasting, online music publishing, online video publishing, user-generated games, and, increasingly, we have structured data-sharing such as upcoming events. All of this seething, lively activity constitutes the Live Web and Technorati is its hub — thanks in large part to the growing use and ubiquity of tags. Through the social constructs of tags, we help people find unique voices and points of view. We also help social media publishers to find the people formerly known as their audience. And they all converge, as a result, on Technorati.

We’re proud of this position, of course, but also humbled by the responsibility it imposes.

As we continue to bring more and more of the Live Web to the fore, and to organize it and present it in ways that are useful, entertaining, and informative to you all, I hope you’ll continue to tell us your opinions (as if I could stop you!) and provide us your guidance. Our credo has been and will always remain: “Be of Service.” Your voice helps us to do this, so please continue to tell us what we can do better.

In summary:

  • 70 million weblogs
  • About 120,000 new weblogs each day, or…
  • 1.4 new blogs every second
  • 3000-7000 new splogs (fake, or spam blogs) created every day
  • Peak of 11,000 splogs per day last December
  • 1.5 million posts per day, or…
  • 17 posts per second
  • Growing from 35 to 75 million blogs took 320 days
  • 22 blogs among the top 100 blogs among the top 100 sources linked to in Q4 2006 – up from 12 in the prior quarter
  • Japanese the #1 blogging language at 37%
  • English second at 33%
  • Chinese third at 8%
  • Italian fourth at 3%
  • Farsi a newcomer in the top 10 at 1%
  • English the most even in postings around-the-clock
  • Tracking 230 million posts with tags or categories
  • 35% of all February 2007 posts used tags
  • 2.5 million blogs posted at least one tagged post in February

Getting All the Reports

You can get all of the State of the Blogosphere and State of the Live Web reports, going back my first report in October 2004 at http://www.sifry.com/stateoftheliveweb/ All of this material is licensed under a creative commons for-attribution license, and all I ask in addition is that you please keep the Technorati logo and links to the original reports in any use of the charts or data.

Technorati Tags: , , , , , , , , , , , , , , , , , , , , , , , ,

Share
Posted in: Uncategorized

Continue Reading

The State of Technorati, April 2007

7

I typically issue the State of the Blogosphere report each quarter to give a glimpse into our data and what that may tell us about the global social media phenomenon. As many of you have pointed out, it’s been nearly six months since the last report. Yow!

Don’t worry, it hasn’t gone away, although there are some changes in store for the widely read report. More on that at the end of this post, including some nice charts, so read on…

It’s been awhile since I’ve given you a report on the State of Technorati. Many people still think of us as the world’s leading blog search company I founded nearly four years ago (yeah, we will hit our 4 year corporate anniversary in May). While that statement is still true, it is only part of our story today. Technorati is now very much a media company – one that is unlike any other you’ve seen to be sure, but a media company nonetheless. What we bring to the fore is social media – the ways in which people are expressing themselves across the Web each and every day, in real-time. It’s informative, entertaining, vexing, touching, enervating and, every once in awhile, utterly unforgettable. It is, truly, the Live Web.

Let me begin by saying the people we serve and their behaviors on our site have shifted remarkably in recent months. In brief, we’ve seen phenomenal growth in the use of our tagged media pages. As the use of tags becomes more ubiquitous across all forms of social media and the publishing platforms that support them, they’ve become the lingua franca of the Live Web – the way in which people all over the world indicate what topics or issues are top of mind and guiding self-expression.

About nine months ago, we began to see a marked increase in the use of those tagged media pages, which back then simply included blog posts and Flickr photos using tags. So, throughout the fall and into December, we introduced a number of improvements and new features to our media pages, including the introduction of a huge range of multiple forms of user-generated content. Today, we include blog posts, photos, videos, podcasts, music, people, and events that share a common tag to give our visitors a view into who’s saying what – who’s doing what – across the Live Web, all in real time.

The Numbers

Slide0002

We’ve seen huge growth in the number of unique visitors to our site. In March, we exceeded 9 million unique visitors, which is a 141% increase in monthly visitors in a single quarter. Moreover, our quarter-over-quarter growth in page views was about 150% at the close of March (23% Growth in January, 24% Growth in February, 53% Growth in March). As I indicated a moment ago, the majority of our page views now are no longer just in real-time keyword or blog search, as would have been the case just six months ago, but also in our tagged media pages.

Slide0001

Nevertheless, I’m also very pleased to note that Technorati is still the most visited blog search engine according to recent reports by Hitwise:

031907

Quantcast adds more recent third-party data validating our growing lead as well:

Quantcast20070403

And our Vice President of Engineering, Adam Hertz, has been hard at work leading initiatives to ensure that our results are more complete, spam-free, and delivered more quickly than any other service on the Web. We’re not perfect – we’re always looking to get better, but I’m heartened by the progress we’ve made over the last six months or so.

UPDATE: Robert Scoble has published a comparison of Technorati to Google Blog Search, and he’s asking for people’s experiences. In his post, he notes that “It does look like Technorati has pulled ahead again in this race. The UI on Technorati is certainly ahead of Google’s, especially with the little chart of how many mentions a term has gotten.” He is also asking people to help him evaluate the two engines.

Another great point of pride for me has been the introduction of new features like WTF (which stands for Where’s the Fire), that empowers users to explain why any given top search or tag is hot right now, and others the power to vote on their favorite WTFs. Since we launched WTF earlier this year, we’ve continued to make enhancements to the service, which I hope you’ve noticed, like our new email-to-a-friend option for when you want to share your favorite WTFs with friends and family (and get them to vote for you, which puts your WTF at the top of search results). Over 2,000 people have created WTFs, and they’ve written over 8,000 blurbs, and those numbers are growing quickly!

We’ve been launching widgets too – building a whole new framework for them to be scalable from the get-go, and tonight Tantek announced the first three widgets in that class: Top Searches, Top Tags, and Technorati Authority.

What has made this phenomenal growth possible, of course, is the terrific team at work here at Technorati, which has also grown remarkably. Today we are 44 people working from our offices in San Francisco and New York, there’s another team of about 10 people working on our sister site Technorati Japan; this time last year we were less than half that size. Our structure has changed as well: we’ve been at work building a sales and marketing function to meet increasing demand from brands and advertisers around how it is they, too, can enter the global conversation. Again, this time last year, we barely had display ad units on our site, much less sales people to fill those units. This new group is hard at work introducing our solutions in New York and on the West Coast, and, as we announced earlier in the quarter, includes a new partnership with one of the world’s largest advertising and marketing firms, Ogilvy.

So, finally, to the State of the Blogosphere. We’re still committed to this landmark report, but, as I indicated at the beginning of this post, we’ve evolved it. Later this week, we’ll release our new “State of the Live Web”, which will include our usual State of the Blogosphere report, but will now also include additional information on areas outside of just blogs. As we continue to build out this report in coming quarters, we’ll do our best to continually add new information – on video, on microformats, and more. Just as Technorati is increasingly the hub of the Live Web, our quarterly report will be as embracing of the entirely of it as our service is.

Look for our new State of the Live Web on this blog by the end of the week and let me know what you think. My hope is that it is helpful to you.

As always, our credo remains to “Be of Service” – my hope is that we’re delivering on that promise better today than ever, and that this time next year it will be still more true than ever before.

UPDATE: Jay Meattle from compete.com has a very interesting post called “Technorati breaking away, leaving Google behind in the dust“. There’s some great data in there, here’s a bit:

 Site Media Upl Img Technorati Uvs Mar2007

Technorati Tags: , , , , , , , , , , , , , , , , , , ,

Share
Posted in: Uncategorized

Continue Reading

Across the Web

  • Facebook
  • Twitter
  • Flickr
  • Youtube

Twitter

Javascript needs to be installed to view the twitterfeed. Get Javacript

Follow Me on Twitter