John Battelle's Search Blog Set The Data Free, And Value Will Follow

(NB: Much has been written and said on this topic, and this post is in no way complete. We’ll be exploring this issue and many others related to data at the Web 2 Summit this Fall).

Perhaps the largest problem blocking our industry today is the retardation of consumer-driven data sharing. We’re all familiar with the three-year standoff between Google and Facebook over crawling and social graph data. Given the rise of valuable mobile data streams (and subsequent and rather blinkered hand wringing about samesaid) this issue is getting far worse.

Every major (and even every minor) player realizes that “data is the next Intel inside,” and has, for the most part, taken a hoarder’s approach to the stuff. Apple, for example, ain’t letting data out of the iUniverse to third parties except in very limited circumstances. Same for Facebook and even Google, which has made hay claiming its open philosophy over the years.

And this trend is not limited to the large players. I currently have 302 photos locked up in a service called Twitpic. I’d very much like to export them into my iPhoto library, so I can mange them as part of the rest of my photo library. But the only way to do that is to “right click” on each and every one of those photos, copying them to my desktop. That’s several hours of work that most folks simply won’t do. When an enterprising coder wrote an automated script that exported photos from Twitpic to another service called Posterous, Twitpic blocked the program. That was about the time I stopped using Twitpic.

This trend, I predict, will become the petard upon which our industry will hoist itself over the next couple of years. Very well intentioned projects like DataPortability.org and others are working on this issue, but it’s largely hidden from public view and debate, because that debate has been framed as “Us versus Them”, where the “Them” are presumably evil and profit-driven companies who want to leverage our data for their own gain. (See the entire WSJ series as exhibit A in this debate).

So far, the approach companies seem to be taking boils down to this: The data we have is too valuable to let our customers understand it, manage it, and ultimately, do whatever they want with it. We’ll say soothing things, and we’ll let our users take some actions with their data – Facebook will let you authenticate using Facebook Connect on third party sites, for example – but we won’t let you take the data you’ve created on our services, put it in your own pocket (so to speak), and hand it over to other services and platforms such that those platforms can add value to your daily life.

In other words, if information is truly currency in today’s economy, so far the coins in your pocket are all from different countries, and there’s no global exchange mechanism. They’re only worth something in the nation in which they’ve been minted.

For example, you can’t pass your Facebook identity to a third party site so as to enable that site to serve you a better advertising experience. While Facebook insists that your Facebook data is, in fact, *yours*, it turns out it’s not yours if you want to use it to help a third party make money. In other words, it’s not really yours if it has true value to a third party. Which, in essence, means it’s only yours if it’s not valuable to anyone but you. But value is most often a social concept – something has value because a third person values it.

If the true value of the economy we are building is to be unlocked, that value has to flow unchecked from one party to another. Were this to be true, differentiation of services would migrate to a higher level of the stack, so to speak. Services would be considered valuable for what they did with data given to them by consumers, rather than by their ability to lock consumer’s data into their proprietary platform. New models would emerge to reward those services for adding that value, and those models would be both more robust, and far larger than the “one ring to rule them all” model currently at play.

As things stand today, our industry’s practices are gaining the attention of dead-serious regulators, spurred to potentially early lock down of how data is used based on an incomplete understanding of how value will flow through future economic models yet to be invented. (More on this in another post).

A generation from now our industry’s approach to data collection and control will seem outdated and laughable. The most valuable digital services and companies will be rewarded for what they do with openly shareable data, not by how much data they hoard and control.

Now, I live in the real world, and I understand why companies are doing what they are doing at the moment. Facebook doesn’t want third party services creating advertising networks that leverage Facebook’s social graph – that’s clearly on Facebook’s roadmap to create in the coming year or so (Twitter has taken essentially the same approach). But if you are a publisher (and caveat, I am), I want the right to interpret a data token handed to me by my reader in any way I chose. If my interpretation is poor, that reader will leave. If it adds value, the reader stays, perhaps for a bit longer, and value is created for all. If that token comes from Facebook, Facebook also gets value.

Imagine, for example, if back in the early search days, Google decided to hoard search refer data – the information that tells a site what the search term was which led a visitor to click on a particular URL. Think of how that would have retarded the web’s growth over the past decade.

Scores of new services are emerging that hope to enable a consumer-driven ecosystem of data. Let’s not lock down data early. Let’s trust that what we’re best at doing is adding value, not hoarding it.

More on this in my 2007 post The Data Bill of Rights, not to be confused with the “Commercial Data Privacy Bill of Rights,” introduced last week. While well intentioned, this bill does not consider data ownership and portability.

4 thoughts on “Set The Data Free, And Value Will Follow”

Ed says:

April 28, 2011 at 11:09 am

Agreed John, sharing is better than hoarding. At pachube, we’re working with real-time data a la the Internet of Things. We’re betting that open data is the key to commercial innovation, specifically because of the technical difficulty in managing torrents of time-series datapoints. If everything is buttoned up, this thing will never get off the ground. The data has to be portable and follow the user’s pocketbook, and that’s what we’re trying to facilitate. We believe that if we build on this model from the beginning, we can not only differentiate, but facilitate something really powerful.

Check here:
http://blog.pachube.com/2011/04/open-data-streamroller.html

DNN says:

April 28, 2011 at 12:55 pm

The beauty of the internet is that once we get the useful data and share it globally with others, the universe comes back and opens more doors for us on the world wide web by way of giving us more business. When it comes to sharing information online, all it is basically is recycling knowledge from one person to the next =)

daisy says:

April 29, 2011 at 11:24 am

I can’t wait to pay my mortgage with information tokens.

Jan Simmonds says:

April 29, 2011 at 3:35 pm

Always enjoy your journalistic, pragmatic and measured approach to subjects others report with too much emotion and not enough reality.

Facebook’s social graph is to me, like ice cream in a sieve. The outside air currently cooled by public naivety, but threatening a rapid temperature rise every time Facebook opens the door on the heat of its repeated transgressions and whilst trying to find ways of bullying its way out of the the Catch-22 situation it is in. (If you’ve ever seen the original Italian Job, I have visions of a bus on a precipice and a pile of gold too.)

All it needs is a business to arrive, founded on the premise of ownership of your own digital identity, a trusted mechanism for sharing proportionately in the value of the content you create and the same level of trust you’d expect from a bank or doctor and then the ice cream melts and the social graph is no longer stuck in one place. Rather people would be in charge of their own digital homes and encouraged to dress them with the brands they like and in such a way so as they are well motivated to provide the holy grail for marketeers.

Facebook can’t retrospectively do this because the latent resentment for their abuses of trust runs deep and the fickle public may not have been shown the warm light, but when they do, they will have long memories.

I wrote about it better here – http://blog.famebook.com/famebook/2011/03/social-media-is-not-social-yet.html and I hope you’ll forgive the link!

Share this:

4 thoughts on “Set The Data Free, And Value Will Follow”

Leave a Reply Cancel reply