Yesterday I met with Christopher Ahlberg, the PhD co-founder of Recorded Future, a company I noted in these pages back in mid-2010. Ahlberg is one of those rare birds you just know is making stuff that matters – a scientist, an entrepreneur, a tinkerer, and an enthusiast all wrapped into one.
He ran me through Recorded Future’s technology and business model, and I found it impressive. In fact, I’m hoping I can employ it somehow into my book research. And that conditional tense of “hoping” is the main problem I have with Ahlberg’s creation – it’s a rather complicated system to use. Then again, what of worth isn’t, I suppose?
Recorded Future is, at its core, a semantic search engine that consumes tens of thousands of structured information feeds as its “crawl.” It then parses this corpus for several core assets: Entities, Events, and Time (or Dates). Recorded Future’s algorithms are particularly adept at identifying and isolating these items, then correlating them at scale. If that sounds simple, it ain’t.
The service then employs a relatively complicated query structure that allows you to project the road ahead for your question. For example, you might choose “Amazon” as your entity, and then set your timeframe for events involving Amazon over the past two months and into the next two months. Recorded Future will analyze its sources (SEC filings, blogs, news sites, etc) and create a timeline-like “map” of things that have happened and are predicted to happen with regard to Amazon over the next eight weeks. You can further refine a search by adding other entities or events (“earnings” or “CEO”, for example).
How does it work? Well, turns out the Internet is rife with whisperings of the future, you just need to learn how to listen. That’s Recordable Future’s specialty. As you might imagine, Wall Street quants and government spooks just love this stuff. I’d imagine journalists would as well, but most of us are too strapped to afford the company’s services. Embedded below is a new feature of the site, a weekly overview of a news-related entity.
Recorded Future’s engine is not limited to the sources it currently consumes. Not only is Ahlberg adding more every month, his customers can add their own corpuses. Imagine throwing Wikileaks into Recorded Future, for example.
Perhaps the coolest aspect of the service is a visualization of how entities relate to each other over time. Ahlberg showed me a search for mobile patents, then toggled into a relationship graph. Guess what entity broke into the center of the graph, connected to nearly everything else? Yup – Motorola.
Did I mention that Google is an investor in Recorded Future?
As I said, I hope to start using the service soon, and perhaps posting my findings here.