Recent Comment
Spotlight
- Reader Michael Megalli writes: It is difficult to engage in genuine conversations with the marketplace when you can't change the reality of how a company does business, what it sells, how it works with partners, etc. [go]
Recent Comments
- nmw: " BTW: I love the background image on Fred ..." [go]
- Melissa: " John, I follow you on Twitter, so I saw ..." [go]
- fred wilson: " john i think you have to use twitter to ..." [go]
- Serkan: " Looking a blinking, moving, las vegas st ..." [go]
- Adam: " Maybe I am a little slow here. But the Y ..." [go]
- nmw: " Here's one such example of the engine I ..." [go]
- nmw: " There is a quite simple way Micro ..." [go]
- nmw: " I find htt ..." [go]
- nmw: " I agree, Mark. The fact that language i ..." [go]
- Mark Porter: " I was recently at a conference where I h ..." [go]
- Andrew S: " Ads on Google's home page would not hurt ..." [go]
- John Weir: " Great post John. I'm running some cour ..." [go]
- nmw: " wow -- that almost makes the sohbet spam ..." [go]
- me@home.com: " Hi John, great book. Great blog. Great t ..." [go]
- nmw: " Chris, maybe not on HotBot.COM or Googl ..." [go]
- Chris Kilkes: " An old cliche comes to mind, "what's old ..." [go]
PERFECT FOR THAT PERSON WITH EVERYTHING
Order 'The Search'
Yup, it makes the perfect gift for that officemate or colleague who you thought had everything....including you! If you order here, I promise to sign it, assuming we can figure out the shipping...
You can also buy the audio version here.
Check my book page for more info.
Blogger's Rights
Top Posts
- The Database of Intentions (or how this all got started)
- From Pull to Point(or the first post where I riff on the "Point-To Economy")
- Google As Builder (or the point at which Google stopped being simply a search engine)
- On Google v. Yahoo
- TV and Search Merge
- On Sell Side Advertising
- Battelle Gets Searchstreams
- Search and Immortality
- Toward the Endemic (on endemic advertising)
More coming soon...
Active Topics
- 35 comments: WTF??!!! (04.17)
- 26 comments: Twitter. Oh God. (04.30)
- 15 comments: The Future of Search Series (05.08)
- 14 comments: The Music In Magazines (05.07)
- 13 comments: The Best Minds of the Web... (05.05)
Monthly Archives
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
- October 2006
- September 2006
- August 2006
- July 2006
- June 2006
- May 2006
- April 2006
- March 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005
- September 2005
- August 2005
- July 2005
- June 2005
- May 2005
- April 2005
- March 2005
- February 2005
- January 2005
- December 2004
- November 2004
- October 2004
- September 2004
- August 2004
- July 2004
- June 2004
- May 2004
- April 2004
- March 2004
- February 2004
- January 2004
- December 2003
- November 2003
- October 2003
About John Battelle
Searchblog Newsletter
Enter email to subscribe to Searchblog's newsletter:
Calendar
| Su | Mo | Tu | We | Th | Fr | Sa |
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | ||||
| 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| 11 | 12 | 13 | 14 | 15 | 16 | 17 |
| 18 | 19 | 20 | 21 | 22 | 23 | 24 |
| 25 | 26 | 27 | 28 | 29 | 30 | 31 |
Syndicate
Powered by
December 13, 2004 8:55 PM
Google Library: Talk About a Long Tail...
Mr. Page said yesterday that the project traced to the roots of Google, which he and Mr. Brin founded in 1998 after taking a leave from a graduate computer science program at Stanford where they worked on a "digital libraries" project. "What we first discussed at Stanford is now becoming practical," Mr. Page said.
The details: Google is working with Stanford, the University of Michigan, Harvard, Oxford, and the New York Public Library to make millions of books available in its index. For now the project is in pilot phase, but there are hopes and expectations this will go big in the next few years. A source told me the project was originally named Google Library, but for now it will exist under the Google Print moniker. An example of Google Print is here. The screenshot at left is what I was provided by Google for today's launch.
The implications here are significant. First, the idea that the world's knowledge, as held through books and libraries, is opening up to all via a web browser cannot be understated. It's one thing to have the an original copy of The Origin of Species on the shelves, where students and interested parties have to travel to find it. It's another to have it available to everyone via a search index and your web browser. Second, this move clearly puts Google in the category of innovator when it comes to adding information to their index. But it also raises significant business model questions, one that are both exciting and unanswered. I brought them up in an earlier post:
A very interesting case will be Google Print. As that program expands, and it's rumored that it will, dramatically, a number of questions arise. How will Google monetize out-of-copyright books? If it indeed does bring tens of thousands of out-of-print books onto the web and into its index, will it allow others to access and index that new treasure trove, or will it act more like a traditional media company, which would "own" that resource for itself? How will it choose what it brings into the index - those that might sell? Those that somehow are the most "in demand" by some measurable standard? With regard to books that are in print, will it limit itself to being soley an organizational tool supported by AdWords, or will it start to take a vig for books that are sold via the Google Print service (in fact, maybe it does already and I'm simply unaware of it - any publishers out there, let me know!)? And will the print model scale to television and movies or music?
Google Print already monetizes a selection of in-copyright books via advertising, and shares some of those revenues with the publishers. But it's a very short distance between that and, say, an affiliate link to Amazon or any other booksellers for a cut of an in copyright sale. It's also a very short route to the on demand publishing of an out of print and out of copyright book with a company that is set up to do such a deal, and I am aware of at least one that is about to launch that will provide just such a service. Of course, if you want an ebook, that can be arranged as well. For out of copyright books, the tail is extraordinarily long, and quite possibly very very profitable. In other words, this could well be a step toward diversifying Google's revenue streams away from advertising and into direct sales and/or subscriptions - ie, the content business. As one source who is familiar with the industry tells me, Google is not doing this only out of the kindness of its heart - there is a lot of money to be made in selling books, in particular books with no copyright.
I did ask Adam Smith, a manager of the Print program at Google, how Google will decide which books get scanned first. He said quite forthrightly that he did not have a good answer for me on that yet. I've heard from others that for now it's pretty random, but the question is important. As to whether Google will allow anyone else to index the books they scan, I am pretty sure the answer is no. After all, Amazon is also scanning books, and I am sure they aren't letting others in on their hard work. I'll repost if that turns out to be inaccurate. And of course there are other efforts, including Project Gutenberg and the Internet Archive. But now, we have a commercial giant who has both a mission-based (organize the world's information and make it accessible) as well as a commercially viable reason to bring this information to the world. As David Hayes, a copyright lawyer at Fenwick who worked on this deal and who I've known from my own work with his firm put it: "This will create a revolutionary new information location tool that should be a benefit to the whole world.” I for one applaud the effort - it's an example of enlightened capitalism, and I hope it thrives.
Update: I originally posted the wrong image, new image to come.- Posted by John Battelle on December 13, 2004 8:55 PM
TrackBack
Listed below are links to weblogs that reference Google Library: Talk About a Long Tail...:
» The Google Library from Oliver Thylmann's Blog
Google is close to announcing that they are scanning loads and loads of books in lots of different libraries. In the US at the moment but hey, there are lots more things out there. As always, John Battelle has a [Read More]
- Tracked on December 14, 2004 1:31 AM
» My oh my from Libraryman
A short while back my new friend TiVo caught my old friend Charlie Rose being especially useful. (Episode #10833 originally aired on 11/17/2004). The guests were the respective heads of Yahoo!, Intel, Google and Cisco. Push the geek jokes aside... [Read More]
- Tracked on December 14, 2004 3:21 AM
» Google returns to print from Platinax Internet News
When two Standford students originally took time from computer science to study the building of digital libraries, they created the foundations of what is now the billion-dollar Google brand. Now Google founders, Larry Page and Sergey Brin, have appare... [Read More]
- Tracked on December 14, 2004 5:31 AM
» 21st Century library services from DJ Alchemi
Yesterday was a big day for announcements about online access to the resources and services you would normally get in a library. The one that has got most attention is... [Read More]
- Tracked on December 14, 2004 10:15 AM
» Google returns to print from Platinax Internet News
When two Standford students originally took time from computer science to study the building of digital libraries, they created the foundations of what is now the billion-dollar Google brand. Now Google founders, Larry Page and Sergey Brin, have appare... [Read More]
- Tracked on December 14, 2004 10:32 AM
» The big news from Google: an ocean of books from Tenebris
Google's big announcement about working with some of the world's largest libraries to scan their entire collections (Google page) has received plenty of
[Read More]- Tracked on December 14, 2004 2:10 PM
» Google Print aka Google Library from Kenyan Pundit
Wow! Also check their other latest venture that's still in Beta-testing "Google Suggest" My Intellectual Property monster of an exam is tomorrow, wish me luck. [Read More]
- Tracked on December 14, 2004 6:10 PM
» Print Search from NathanSlaughter.com
There's something about studying in a library. It's not just the smell of the books or the perfect blend of wide open spaces and very narrow ones. Those elements combine with a marvelous space to person ratio and a gentle quiet to make a sublimely soot
[Read More]- Tracked on December 14, 2004 8:06 PM
» MSN Desktop Search, Yellow Pages Launch Local Search from SearchViews
[Read More]
- Tracked on December 15, 2004 8:05 AM
» MSN Launches Desktop Search, Yellow Pages Go Local from SearchViews
[Read More]
- Tracked on December 15, 2004 8:05 AM
» MSN Launches Desktop Search, Yellow Pages Go Local from SearchViews
[Read More]
- Tracked on December 15, 2004 8:07 AM
» Blogosphere response to Google's library deal from Regional: New York
As might be expected, Google's decision to digitize the library holdings of several major research institutions (including the holdings of the New York Public Library) has generated a lot of buzz and speculation in the blogosphere. Props to The Shifted... [Read More]
- Tracked on December 15, 2004 1:16 PM
» Google Beats Geico in Court, Google Spam Filters, MSN Desktop, Google Library from SEO Book.com
Google defeated Geico in the landmark trademark case, in which Geico was trying to sue Google for allowing ads to be triggered by the Geico keyword. SearchEngineWatch forums and Threadwatch have a great thread about Google spam filters. RustyBrick cove... [Read More]
- Tracked on December 15, 2004 4:15 PM
» Google Beats Geico in Court, Google Spam Filters, MSN Desktop, Google Library from SEO Book.com
Google defeated Geico in the landmark trademark case, in which Geico was trying to sue Google for allowing ads to be triggered by the Geico keyword. SearchEngineWatch forums and Threadwatch have a great thread about Google spam filters. RustyBrick cove... [Read More]
- Tracked on December 15, 2004 4:19 PM
» Google Beats Geico in Court, Google Spam Filters, MSN Desktop, Google Library from SEO Book.com
Google defeated Geico in the landmark trademark case, in which Geico was trying to sue Google for allowing ads to be triggered by the Geico keyword. SearchEngineWatch forums and Threadwatch have a great thread about Google spam filters. RustyBrick cove... [Read More]
- Tracked on December 21, 2004 4:48 PM
» MSN Launches Desktop Search, Yellow Pages Go Local from SearchViews
[Read More]
- Tracked on January 5, 2005 1:48 PM
» 21st Century library services from DJ Alchemi
Yesterday was a big day for announcements about online access to the resources and services you would normally get in a library. The one that has got most attention is... [Read More]
- Tracked on May 27, 2005 7:23 AM



Comments
[snip]It's also a very short route to the on demand publishing of an out of print and out of copyright book with a company that is set up to do such a deal, and I am aware of at least one that is about to launch that will provide just such a service.[/snip]
You mean, a company besides Lulu.com? I'm curious! Who? Who?
While the short-term focus on what's happening with this Google announcement may be on the "legacy" books sitting in the stacks and basements in the libraries, it's important to consider what the world may look like, going forward. Virtually every book now being written is being written digitally; what will "publishing" look like five years from now? Will one deliver a complete manuscript to the publisher, which will then "MIRV" it into submissions to the printer, to Amazon for "inside the book" search, and to Google? What *more* could be sent to either of the latter, given that they live with fewer format constraints than print?
There was a fascinating development some ten+ years ago now, in the Intelligence Community, when NSA collection analysts started putting their names and phone numbers on intelligence cables, and we analyst/readers got a "direct line" back to the source... classification "sources and methods" constraints might have meant that we couldn't be told any more than was contained in the cable, but we could provide direct feedback and guidance to collectors. I could see either Google or Amazon becoming services to enhance writer/reader communication, on beyond the obvious utility they provide to Search.
How long will the book survive? Doesn't searching within the book for precise passages tear apart the structure of the book? Good readers paintstakingly pour over the whole text for that gem. Now they only need to write a few good searches. The book is demystified - wripped apart. Reading will be increasingly selected by the readers' previously conceived questions.
The same forces against the structure of the book (as a whole) oppose the print industry. I don't need a printer to send me the 20 page passage. I can handle that in 30 seconds. Readers will demand this incremental deliver. Publishers will deliver. Writers will groan.
This isn't a religious comment. It's just an observation.
I think what Google are undertaking in this program is truly fantastic! To have all that information available through Google will really bring information that would be otherwise unobtainable to the masses.
I have always been frustrated that information held by institutions and certain libraries was only ever available to us by "invitation" or as in some cases where only one copy exists, by traveling halfway around the globe.
I for one hope this sort of venture catches on!
I have read disquieting rumours that the search results when you get them will be text as image and not as plain text - that's the way it is presented in demos apparently. If so it would be terrible. I want to be able to copy and paste the text or download it into my Palm once I find it. It is public domain after all... Like John B I was a little worried Google would ensure you could only access the text via Google but it appears that (in the case of the U of Mich) they are going to make the results of their digitisation available directly to the instutition as well means it should be available via several interfaces and several search engines. Which makes me wonder why they would spend $$$ doing this? It's an awfully expensive bit of good PR... I suppose it may still be *easier* to get at it via Google than an alternative search engine because they integrate things better and include more metadata...
I was thinking on this whole Google/library thing during a long trip back to Michigan, and it occurred to me that having search entities like Google or its competitors arrange for book searching probably stunts the development of open standards and architectures for "blinded" searching (e.g., allowing one to search against a corpus derived from copyrighted works, and receiving pointers/clips sufficient to lead you on to purchase, or otherwise seek information from them). Each of the major search powers will likely create its own proprietary universe of searchability, where what might be better (e.g., more open to allow for other tools, competition, etc.) would be standards for any publisher to build toward.
While it may not be legal for people to scan and make available as text, books that are written more recently, it certainly is easy for them to do so. If it becomes routine for people to search books older than a certain date they will probably begin to wonder why they can't search through books written slightly more recently which are not really recent or contemporary at all. Then we will see large numbers of people posting and downloading books with p2p networks as we are seeing now with music and movies.
The 70+ year copyright laws are just going to have to go. They are dinosaurs from the period of the tyranny of geography, the scarcity of shelf space and the creation of the special interests themselves.
I do not argue that there should be no copyrights, only that their duration should be short, as they were originally.
I think this is a great idea. As I understand it, on copyrighted materials Google will only retain exerpts. It seems to me that, beyond this, some kind of "pay per view" system could be worked out.
I appreciate the issues of ownership and compensation, and these need to be honored.
For myself, time always being at an essence, in Bloomington IN paying some outrageous parking fee to march 6 blocks in the hail and snow and spend 3 hours to find out that a book I MAY want is at some other library is a "No Way". That stuff is for 100 years ago. Getting a fair chance at seeing that it may be what I want, and paying some nominal fee to see more, would be well worth the price. Would it be worth a couple of bucks? You Betcha!
I think there's an answer in here, somewhere, and I think everyone would benefit. This is a chance for libraries, which are such an incredibly valuable resource, to make the step into the next era.
Jim
I've just been hanging out not getting anything done. What can I say? I've basically been doing nothing worth mentioning, but pfft. Not that it matters. Pretty much nothing exciting happening to speak of. I haven't been up to much these days.
I just don't have anything to say. Not that it matters. Eh. I've just been staying at home doing nothing, but I don't care. That's how it is.
I haven't gotten anything done today. I feel like a fog, but what can I say? I've just been letting everything wash over me lately, not that it matters. Shrug.
begin to wonder why they can't search through books written slightly more recently which are not really recent or contemporary at all. Then we will see large numbers of people posting and downloading books with p2p networks as we are seeing now with music and movies.
Leave a comment