free html hit counter Why Topix Is Different: Toward A Sustainable Model For Net Media Companies | John Battelle's Search Blog

Why Topix Is Different: Toward A Sustainable Model For Net Media Companies

By - March 29, 2004

richsLast Friday I had a chance to stop by the Palo Alto offices of Topix, in many ways a classic internet start up – Valley-based, run by a serial entrepreneur, good buzz – but it didn’t take long for me to sense that something was different this time. Before I get into that, let me first give you a few thoughts on the service itself, and the broader role it plays in the search business.

Background: Topix was founded by six guys, four of whom went to high school together in Pittsburgh. No, I’m not making that up. Most of them are IT/Valley vets, CEO Rich Skrenta founded NewHoo and sold it to Netscape a mere six months afterwards, then morphed it into the now famous Open Directory Project. But Netscape was sold to AOL, and after a while Rich got bored (I assume) and left with the intent of starting a company he could “work into my 40s on.” I like the sound of that.

Topix is an internet media play. More specifically, it’s a local advertising media play. The service takes a crawl-and-index approach to a vast array of internet news sources, then runs the resultant stew through a metadata engine which tags every news story with location and subject data. Topix then builds more than 150,000 topic- and location-specific pages, pages that live comfortably between the great gunky mass of search results, on the one hand, and the impersonal morass of most news sites on the other.

Skrenta likes to call Topix a “150,000-facet diamond,” at least one facet of which should appeal to most news consumers. But step back a few thousand feet and look at Topix’s approach, and you start to see something else at work, something instructive to anyone interested in next-generation approaches to search.

At the risk of getting mired in academic debate, one could argue that Topix is a proof point in the semantic web. Topix is not interested in every web result that might be rendered for a search “news kentfield,” for example. Instead, it searches a limited set of web pages – in this case thousands of news sites – and then annotates the content of those pages with semantic tags – latitude and longitude, for example. It then machine-generates something of an “encyclopedia page” for Kentfield, cobbling together news, weather, advertising, police blotters – a local newspaper in real time (Skrenta pointed out that Topix utilizes a newspaper-like layout on the site, because… it seems to work for the reader. Imagine that, we don’t have to throw out everything we learned over the past 200 years). Topix also creates pages by subject; Skrenta argues that in fact many of their industry-related pages – Wireless, for example, or Search Engines, are among the best sources of business information in the free web.

If you take this approach to the web – mediating SERPs with subject-related “landing pages” – you could imagine a broad scale search engine which manages a machine-created ontology of subject pages. WebFountain comes to mind, orthogonally. In fact, such an approach has been attempted by any number of companies, I have heard that Excite was working on such a project before it fell apart in 2001. (There are others, readers, can you chime in?).

Skrenta does not shy from the semantic tag, in fact, he is one of many I’ve spoken with over the course of reporting the book who agree that the web is failing to scale, and well-documented “neighborhoods” of semantic order will help bring the web back into focus. “Whole cataegories are dead on Google now,” Skrenta told me, referring to the ongoing arms race between relevance and search spam. He’s right, of course. But here’s where the story turns to Topix’s distinction.

The inevitable next question for Rich, is, “Sure, OK, broad web search engines like Google are starting to fail under the sheer weight and scale of the web. But Google, Yahoo, MSN, AOL – they’re all hard at work on this problem. Even more, all three have news products already, and are very focused on integrating local search. What makes you think a small startup like you can make it in the face of such pressure?”

A Valley entrepreneur will usually respond with one of two answers: 1. Don’t Worry, I’ll Sell While Small, Make A Good Sum, And Get A Good Job at An Established Company (as Rich did with NewHoo; Blogger and Kaltix also come to mind), or 2. I’ll Take VC Money, Take the Execution Risk, Sell Later And Get Silly Rich (the current path of, say Friendster). In other words, when the market is dominated by large, entrenched competitors, your best shot at succeeding as a startup is to sell, either at the beginning of your life (when the LargeCo is basically buying the talent and nascent market opportunity), or after you’ve scaled to the point of inflection on the build/buy curve. It costs money to get to that scale, which is where the VCs come in.

I found it wonderful to hear that Rich didn’t want to take either of these options. Instead, his goal was to simply start a neat service, get to the point of self-sustaining revenue (with six employees and an office over a trophy shop, that won’t take long), and grow it slowly. In other words, no VC money, no dreams of world domination. Rich just wants to build a nice media business, without having to either sell it or sell out. Rich met with the VCs about Topix, he told me. “After five minutes, every one of them would tell me their vision of what I had to do to win in the market,” he said. “If you take their money, you have to buy that vision.” Better to fund it small, with angels and friends, and let it grow to its own place in the sun. Could it be that the post-bubble web, version 2.0, will allow for such companies to succeed?

I certainly hope so. May a thousand such flowers bloom – and I see them springing up, over at Nick’s Gawker Media, for example, or the grandaddy of them all, craigslist. I’ve heard pretty much the exact same philosophy from both Nick and Craig: I want to run my little media company, VCs and exit strategies be dammed. Welcome to the club, Rich. May you work into your forties at Topix.

UPDATE: Rich emailed me as I posted this with this news:

“I wanted to let you know that, as of this morning, Topix.net is
now crawling over 6,000 news sources, up from 3,600. Here is an
approximate breakdown of the kinds of sources we are crawling:

24% Daily newspapers
19% AM & FM news radio stations
15% Weekly newspapers
15% B2B and consumer magazines
12% TV stations
9% College newspapers
5% Government websites
1% Weblogs

We have also enhanced our news crawler to navigate through javascript
and frames, which has increased the yield and coverage of our news
crawling.

We’re also including the news source name on the attribution line,
instead of just the domain name as we were previously.”

Related Posts Plugin for WordPress, Blogger...

8 thoughts on “Why Topix Is Different: Toward A Sustainable Model For Net Media Companies

  1. Call me a skeptic, but the whole semantic web thing seems to come down to phrase tagging on steroids. We have basically two sources of semantic tags: the author/publisher (this is not spam!), or deductions based on parsing the page itself. The latter comes down to generating phrases or word sets and finding pages which match.

  2. Susan Mernit says:

    Nice post, John. I’ve been a Topix fan since they launched–sometimes the sources are a little thin, but it’s a good product that keeps getting better.

  3. Take a look at IPDgroup.com. We’ve been doing the Topix thing since 1995. We’re already profitable and self sustaining.

    We current have three flavors.

    http://www.einnews.com
    This focuses on geo-political news from every country and region. The user can access by geography or topic. Subscription rates: $19.95 a month basic, $29.95 de luxe.

    http://www.inboxrobot.com
    Thousands of subjects, such as individual companies, individual government agencies, etc.
    Plus, create your own newsletter with your own keywords function. $19.95 a month.

    http://www.uspoliticstoday.com
    A specific subject niche-market search derivitive which currently is free. We plan to turn it into paid subscription site soon.

    More derivities to come. With our basic infrastructure we can cost effectively build unlimited niche focused sites as next generation search/media publications.

    Our paid subscribers include most of the world’s major press, most of the Fortune 500, a great many governments, NGOs, global trade companies and universities.

    We believe our products are conveniently bite-sized for users and that they will continue to thrive even with heavy-hitter competition.

  4. I think the problem with this approach is that you are always going to be OK to good as a niche resource but never great. For example, since it is not edited by human their search page will never be as good as your page John, or our http://google.weblogsinc.com, http://socialsoftware.weblogsinc.com, etc.

    So, over time you have a lot of topics, but none are category leaders… in fact they are all the #7, 8 or 9 place to go for information.

    Google News will have this feature in a couple of months I’m told.

    That being said, these guys sound very, very cool and perhaps this is a starting point.

    Ironically, they seem to be playing favorites and being a little heavy handed. NONE of our sites are listed and I heard from a blogger they demanded we link to them in order to be listed. With a program like that you are going to fail.

    Is this true Topix’s guys? Do you hold out links to people who link back to you?!?!

  5. Greg Linden says:

    The Topix approach has been pretty typical of high tech companies since the dot com bust. VC funding is now back to the point it was at before the irrational exhuberance of the late 1990′s; VCs now mostly fund established businesses that need help with marketing or other spending to fund a major expansion. So, yes, startups now need to survive much longer on their seed capital and existing revenue stream.

    But, in many ways, this is just a return to the normal state of affairs. Despite what we saw during the dot com boom, it’s isn’t normal for a company to get millions of VC funding for an idea drawn on a napkin.

  6. Kurt Johnson says:

    Skrenta is a rip-off artist. Not only did he link to a story I wrote that was on the Internet, he picked off the copyrighted photo and placed it directly on his topix site.

    It’s illegal, and I’m thinking about sueing him.

    kej

  7. Susie says:

    Topix completely blew their reputation when the forums became unmoderated. Most of the town forums have become nothing but trash with tons of libel. It is a sad state of affairs when law abiding citizens can have their reputations destroyed or teenagers can easily be cyberbullied because a company that claims they are reporting news is profiting from the destruction of others. Even worse if someone is attacked and they want a topic reviewed in a timely manner, they have to pay 19.99 with no guarantees of removal. Topix had a chance to be a good reputable company, but instead have chosen to be seen by a large group of people as a joke and now several attorney generals are after them. Topix is beyond ridiculous.

  8. Hope Joy says:

    WELCOME TO THE GREAT TEMPLE OF ILLUMINATI.
    Are you a business man or business woman, politician,
    musical, student and
    you want to be very rich, powerful and be famous in life.
    You can achieve
    your dreams by been a member of the Illuminati. With
    this all your dreams
    and heart desire can be fully accomplish, if you really
    want to be a member
    of the great Illuminati then you can contact us for you to
    get what so
    ever you need in life, email us on our direct email at
    lordvoodmoilluminatiworld@gmail.com or call us on+2347063752707

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>