Recording the future using social media to predict it (podcast)

tracking future stock prices with social media

This week’s podcast contains a great conversation with Evan Sparks of Recorded Future on this week’s Tradestreaming Radio.

Evan and Recorded Future are working to bring analysis of social media chatter to a whole ‘nother level.  Combining sophisticated linguistical analysis and a solid background in devising investment strategies (Evan worked as a quantitative equity analyst previously), Recorded Future’s platform scans over online sentiment and predicts when specific events should occur in the future.

For example, if I’m sizing up making an investment in Amazon.com ($AMZN) and I know they’re looking to go toe-t0-toe with Netflix’s ($NFLX) DVD rental offering, I could use Recorded Future to handicap the likelihood that $AMZN makes an acquisition in the space and when the Internet expects this to occur (Amazon announced the purchase of a European competitor, LoveFilm, yesterday). Retail investors and institutional money managers are turning to Recorded Future to help them tap the inherent potential value in all the content in the blogosphere.

Check out the podcast below (or if you don’t see it, here).  If you use iTunes, our program can be found here. We’ve also made available all our radio archives, as well.

Read the podcast’s transcript.

Podcast Powered By Podbean

Learn more:

Recorded Future (homepage)

Recorded Future (video channel)

Predictive Signals (Recorded Future’s quant blog)

Transcript of this podcast

Photo credit: <a href=”https://www.flickr.com/photos/53264931@N03/18905239111/”>vikwaters</a> / <a href=”https://visualhunt.com”>Visualhunt</a> / <a href=”https://creativecommons.org/licenses/by-nd/2.0/”>CC BY-ND</a>

Letter to the Google founder dudes

Wassup, Larry and Sergey.

Nice earnings last night.  26% growth, over $8b in the quarter.  Not too shabby.  I’m taking my $GOOG stock to the bank.

Oh, also, cool what you did to the “supposed” adult supervision afforded by (well, ex-) CEO Eric Schmidt.  It’s pretty sick how Larry just wrested control away from him and sidelined him to take back the company.  I’d give all my 24 hr access to fruitloops on the Googleplex to see the look on his face when you dropped that bomb on him.

For me, it didn’t really rock my world.  From the word on the Street (or at least on campus here), he never really had any control anyway. Like when you guys bought Android and didn’t even tell him.  That was awesome.

Eric may of had to go — I mean, with over 90% search dominance in so many markets, yet the inability to really monetize anything else and the makings of a Microsoft-type ($MSFT) European anti-trust inquisition mounting against the search firm, the company needed some fresh blood.

But this peep isn’t exactly sure that a return to its roots is what the firm needs.  With the growth in the company over the past few years (23k f’in Googlers, baby!), we’re kinda all over the place.  Add a lame attempt at buying Groupon — we need new leadership.  I mean, the last thing we want is to pull a Yang or Ballmer and bring back the lifeblood of our startup days — our founders — to steer these ships that have certainly outgrown their own abilities to manage such large organizations.

Oops — is that what just happened?  $YHOO’s Carol Bartz maybe-coulda-woulda been a better choice??  Maybe not.

Rock on,

Your faithful (and rich) Googler

Bond rater, heal thyself. If not, we’re bringing in new goons

The WSJ has an interesting article on a relatively new player to the bond rating game, Kroll.  If you’re asking yourself, is that the same Kroll that ran a global private security organization?  Uh-huh.  I think the world is ready for a little smackdown in sovereign debt ratings.

According to the article,

Jules Kroll created the newcomer, Kroll Bond Ratings Agency Inc., after selling his eponymous corporate investigations firm to insurance broker Marsh & McLennan in 2004.

Mr. Kroll’s new firm has provided financial strength ratings to banks and credit unions, among others, through a boutique ratings company, LACE Financial, that it acquired last August. Wednesday marks its first effort to rate individual securities through Kroll Bond Ratings.

In a move distancing itself from given practices in the bond ratings game, Kroll plans to make use of his extensive expertise in corporate frisking to go beyond what he (and everyone else) feels are too lax standards in an industry that not only missed subprime, but helped pass the buck into our current mess.

There may definitely be a new sherriff in town and his name, Kroll.  Oh yeah, he’ll also make use of K2 Consulting, a firm similar to Kroll’s previous endeavor, run by his son.

We are embracing due diligence, the legacy from my prior life

Kroll 1, S&P/Fitch and any debt issuer 0.

Optimizing purchasing decisions and when to buy

Lifehacker had a great email yesterday entitled The Best Times to Buy Anything in 2011.

Well-researched and totally tweaked by crowdsourcing this information, this post defines the most optimal times to purchase specific items throughout the year based on varying criteria.

For example, for some reason, laptops go cheap in April as do Broadway tickets in the 3rd quarter.

My favorite?  I’ll look to buy a car in September, an off-color car in December, and as one commenter suggested, try buying a car on Super Bowl Sunday — the salespeople are distracted and open to low ball offers.

Much of this information and concepts were described in Buy Ketchup in May and Fly at Noon (affiliate link). You can find the author’s website here.

Source

The Best Times to Buy Anything in 2011 (Lifehacker)

More sources for well-timed buying (Lifehacker)

Hedge fund index performance for December 2010

Final performance for the Dow Jones Credit Suisse Hedge Fund Index (“Broad Index”) is confirmed up 2.90% in December and up 10.95% YTD

Oliver Schupp, President of Credit Suisse Index Co., Inc., said, “The Dow Jones Credit Suisse Hedge Fund Index rose 2.90% in December, with nine out of ten sectors posting positive performance for the month. Managed Futures was the best performing sector, finishing up 5.42%. Positive performance was also seen in the Event Driven sector, which finished up 3.93%, however Global Macro remains the strongest performing sector in 2010 after posting positive performance of 2.67% in December.”

Performance for the Broad Index and its ten sub-strategies is calculated monthly. December, November and year-to-date performance numbers are listed below and are available at www.hedgeindex.com.

Category Dec 2010 Nov 2010 YTD 10
Dow Jones Credit Suisse Hedge Fund Index 2.90% -0.18% 10.95%
Convertible Arbitrage 1.16% 0.04% 10.95%
Dedicated Short Bias -5.86% -2.36% -22.47%
Emerging Markets 1.50% -0.38% 11.34%
Equity Market Neutral 1.74% -2.51% -0.85%
Event Driven 3.93% 0.15% 12.63%
Distressed 2.73% 0.33% 10.26%
Multi-Strategy 4.78% 0.04% 14.36%
Risk Arbitrage 1.14% -1.44% 3.17%
Fixed Income Arbitrage 0.61% 0.74% 12.51%
Global Macro 2.67% -0.52% 13.47%
Long/Short Equity 3.42% 0.46% 9.28%
Managed Futures 5.42% -4.11% 12.22%
Multi-Strategy 1.70% 0.30% 9.29%

Transcript (Podcast 4): Recorded Future and Investing Using Social Media

This transcript was taken off a recent episode of Tradestreaming Radio which can be found here.

******************

Hi! I’m Zack Miller, author of the recent book TradeStream Your Way to Profits: Building a Killer Portfolio in the Age of Social Media, and you’re listening to Tradestreaming Radio, our home in the internet radio space. This is our place to discuss how technology is helping investors to become better, smarter, and more accurate at what they do.

You can find the Tradestreaming podcast on iTunes. You can also find lots of other material relating to this podcast, as well as archives of our programs at my website www.tradestreaming.com There’s lots of other great content there as well, and I recommend you check it out.

We’ve got a great interview for today. We have Evan Sparks, who after several years of developing quantitative equity strategies on the buy side, is a product engineer at Boston-based at Recorded Future. Recorded Future uses linguistic analysis to harness the predictive power of the web for credit and equity research.

I’ll let Evan introduce himself and his firm next.

Sparks: Absolutely. As you know, I went to Dartmouth by way of background before Recorded Future. I spent several years as an analysis at a mid-sized US asset manager as a quantitative analyst building actively managed strategies in the US equity space, portfolios designed to beat their benchmark, which was typically Russell 3000, or in hedge strategies of various kinds…

I joined Recorded Future last summer. Really with the focus of taking this platform that we’ve built and finding really solid uses for it, and building out towards the financial services space, particularly focusing on quant finance, but also to some degree on some more traditional discretionary financial research.

Miller: Evan, let’s first talk about the platform that Recorded Future has developed, and then let’s drill down further to discuss the applications that some of your work and technology has for the investing field.

Sparks: Sure thing. I guess to begin, sort of our central tenet at Recorded Future, our core belief is that we think the content of the web has predictive power. We think that by being able to break down and quantify what’s going on in the web in clever ways we can find some interesting things that we can make predictions about. This might be financial market data, this might also be, we have some customers in the government intelligence space, as well as people doing things like brand management with the product.

Miller: There have been a lot of companies on line that have focused on using semantic analysis for predictive capabilities, particularly in an investment field. This has always been seen as the holy grail of investing. I wanted to see what set Recorded Future apart.

Sparks: Right. We take kind of a different approach. We do linguistic analysis of the text content, this massive repository of unstructured text content that’s on the web, and we try to apply structure to it. We do things like entity extraction, so any reference to a company, or a product, or a person, or a place, we extract the fact that has occurred in a particular document.

We also do event extractions. If two companies are involved in a acquisition, or a potential acquisition, we extract the fact that there was this acquisition event, or discussion of an acquisition and which two companies it was between, for instance.

We have things like capital markets events, as well as product releases, and also natural disasters. We have about 100-150 different event types that we capture at this point.

Miller: Once you capture that data, how do you turn it into useful information? How do you then process it so that you can begin predicting future events?

Sparks: The third piece that we capture from all of that is also any time references within the content of that text. We try to be very sophisticated about how we capture references to time. We capture when was an article published, when was it downloaded by our system, but we also capture within the text itself references to things like, “next week,” or, “on July 22nd,” or, “in 2012, this may happen.”

By being intelligent about how you capture references to future events you can start to say, “Hey, give me back any references that have been to future M&A activity in the pharmaceuticals industry over the last year.” You can start to see patterns emerge from that data.

So, some more concrete examples I guess of things that we’ve done include looking at whether negative sentiment around the S&P 500 index is a leading indicator of next month’s volatility in the S&P 500. Do we see a pattern there? We have seen some strong statically relationships in the space of volatility, dollar volume for trading, as well as in abnormal returns, in some cases.

Miller: Is the next step for Recorded Future to actually create an investment strategy around some of the data that you guys are bubbling up? Or, is your role as sort of a content provider, content mixer, just to sort of crunch the numbers then hand it off to your clients for future processing?

Sparks: For the most part we’ve been working closely with customers at hedge funds and banks, as well as smaller trading shops, to help them take their ideas and implement them with Recorded Future data. We think we have a data source that’s pretty orthogonal to traditional financial data sources. Normally people look at quarterly filings, and analyst estimates, and of course market and price data, but we think that we provide a different channel, a different set of metrics that you can judge a trading strategy based on.

Miller: From what I’ve seen, Evan, the data and information coming out of Recorded Future seems really valuable. I hope as you get the word out you’ll grow your client base, and they will devise profitable trading strategies based off of this type of analysis.

The question always arises in something like this, why not just raise some money, close off this black box and invest in a proprietary basis? Start your own hedge fund based upon some of the analysis that you’re doing?

Sparks: To answer the question, “Why don’t we start our own hedge fund with this stuff?” I think really what we’re building and the key value that we’re delivering is an ability and a flexible platform that allows anybody to answer complicated questions about the world.

Our expertise, while we think we’re pretty good at forming those kinds of questions around this data, certainly people are going to have their own ideas about what they really want to look at and where they think those values are derived. So, what are certain experts saying about this company or that company? We think maybe we can help you identify who the experts are, but it’s up to the user for how they want to interpret the results and make their investment decisions.

I think we provide a great data platform, and a great analytic platform, but the hard work is really in the analysis, I guess.

Miller: In some sense, and this maybe a poor analogy, you’re selling tools in the gold rush that is sort of what’s going on in quantitative research right now. Is that a fair analogy?

Sparks: Yeah, I’m not sure that it’s necessarily a gold rush. Certainly we’re selling tools to all types of investors, not just quants. Via our web-based UI, we’ve certainly had a lot of discretionary researchers show tremendous interest in taming this big massive data store that is the web, and getting the slices that they need out of it. It’s applicable to lots of different areas in finance.

Miller: That’s very interesting. One of the outputs I saw of the research that you produce, and I believe you were the author of the article, some of the findings your firm had around the crowded hedge fund trade. I guess there is no more crowded trade than Apple these days. Can you tell us a little bit about that research, what it means, and maybe sort of elude to some of the directions you’re going to be taking with your research in the future?

Sparks: The idea here was how do we quantify the level of discussion, or the level of crowdedness around a particular trade based on online media? We wanted to get a measure that was completely orthogonal from market data, something that’s not baked into prices, baked into flows, that kind of thing. Something that you can’t get anywhere else but from looking at online media.

What we did was construct pretty simple sort of basket of words based on relative frequencies of the words and phrases in academic and business articles about momentum investing, so momentum trade, sort of the classic papers as well as some of the newer ones. When people talk about it in blog posts, or The Wall Street Journal, or whatever, what are the words they use there that they don’t use in other kinds of articles?

Based on these word counts and frequencies we then look, over time, throughout our entire repository of business and finance articles. Per day we kind of take an average of this metric we’ve developed based on the usage of these words and phrases.

What you see in the chart and the article is we’ve plotted this over time. We took a look at how has chatter around this concept of momentum investing changed over time? What we saw when we plotted the performance of this metric against the performance of a mutual fund that follows a momentum investing strategy, according to its prospectus, was an inverse correlation, particularly over the last year between our metric and the performance’s fund.

This violated our prior. We certainly thought there would be a positive correlation between people talking about the trade and the performance of the trade. Thinking the logic would be people talk about it more, so they’re buying into it more, driving the price up.

But this negative correlation was pretty interesting when you think about it in kind of an ecological context, around this idea of crowded trade; more people fighting for the same pennies, and they get harder and harder to pick up, and maybe it’s tougher to perform in that kind of scenario.

That’s our current intuition around why this particular trade works. Certainly some future steps would be to dig in a little more, maybe look at fund flows in and out of momentum funds, maybe look at other types of investment strategies. Certainly value is one that people talk about. There are other trades that you can get into. If you can find sufficient data online around the discussion of these trades, maybe there are some interesting signals there.

Miller: I guess what struck me about the article was exactly what you pointed out, sort of its counter intuitiveness. I started thinking a little bit deeper, your outtake from sort of the ecological perspective about being harder and harder to get those falling pennies certainly is one way to explain it.

I was thinking, just recently, it could just also be baked into sort of the methodology at that particular fund that you looked at, right? Maybe it’s not truly a momentum fund, or something like that. That certainly seems to me plausible, not necessarily explanatory, but plausible.

Sparks: Yes. We definitely have thought about that a little bit. We have seen this pattern, this was the one mutual fund that I could find with a sufficient history for comparison against our metric, but over the last year other momentum funds show a very similar pattern.

It seemed to be somewhat persistent. We definitely don’t have enough data points there to make a clear statement there. I think definitely a very interesting area worth pursuing.

Miller: I then asked Evan how curious investors could interface with Recorded Future, and get a feel for what they have to offer, and maybe access some of their services.

Sparks: The first thing you can do if you want to get an idea of what we offer for free, we offer free future alerts, which are a way that you can set up a query in our system and it will send you an email alert for any new results that come online. If you’re interested in M&A rumors in the pharmaceutical space, you could set up a futures alert for acquisitions, pharmaceutical industry, any time in the future. You would get references that come up online to just that acquisitions in pharmaceuticals anytime in the future.

If that’s the kind of thing you like, if you’re interested in the results that are coming back, we encourage you to sign up for our premium product, which is $149 a month. It gives you a much, much richer experience. Several visualizations of the data that come back, timeline view, and network view of what are the entities mentioned together typically in online media, and how has that pattern changed over time, as well as a few other views of the data.

This is the platform that lets you really sort of dig into what’s going on in online media, how is it changing, and how is it changing with respect to this crucial dimension of time that we think is so important.

Miller: Evan, is there a plan to- and I know this is a sensitive question- to maybe syndicate some of these tools, some of the findings that you have within the system into other platforms, into other systems- Yahoo Finance, Bloomberg? That investors are using where they can encounter your tools and research there, as opposed to having it so they come to your website?

Sparks: Everything we have via the web platform is embeddable by default. Anything you see, any visualization you generate with the product, you can embed that in your blog, or your website. We’re adding some social media tools, currently. Definitely we want to get people interested in the platform and what this data can bring to whatever their investment process, or research requirements are.

Miller: Evan, thanks so much for participating in this podcast. This has been really educational for me. I hope it’s been instructional to our listeners, and our readers. Recorded Future sounds quite interesting. I’m going to keep an eye out for it in the future. It’s been a pleasure reading some of the findings you’ve produced on your blog. I’ll link to all this material from my blog, so that my readers can have access to it.

Thanks again.

Sparks: Great. My pleasure. Thanks a lot, Zack. Great talking to you.

Miller: That was Evan Sparks, engineer at Recorded Future, resident genius. It was just a really interesting conversation with a company that I think is sort of breaking out and making really usable and accessible some of these linguistic  analytical tools for investors.

We know, clearly, that there’s a tremendous amount of information residing online, both in the micro and macro level. Obviously drilling down, and looking at 13F filings from an insider, from a hedge fund, and mimicking those are some of the things I talk about in my book, and on my blog. Piggyback investing is important.

But, on a macro level there’s a lot of noise going on. Tools like Recorded Future are helping investors sort of provide an analytical layer to try to make sense of some of those things. The next step is to then take those and devise a strategy around them, back test them, and start predicting events into the future. Again, that just opens up a lot of doors for both individual and professional investors going forward.

Thanks again for tuning into the Tradestreaming podcast. I always appreciate your listening. Head to the blog at www.tradestreaming.com I’ll have some additional information there. I hope you turn in again soon. Thanks a lot.

Playing not to lose sometimes means you lose

Everyone has seen pro basketball players commit fouls early in a game. The coach faces a conundrum. Does he

  1. Leave the player in the game: he may play to full potential and contribute
  2. Yank him: scared of fouling out of the game, he may play sub-optimally

This decision making process has always been kind of locker room chatter.  Until recently.  Earlier this month, 2 researchers from goliath asset manager AllianceBernstein and an academic from NYU Poly addressed solving this issue using financial research in a paper entitled, How much trouble is early foul trouble?

The researchers actually came up with a formula that’s so important, failure to heed it — a single incorrect decision — could decide the game.  The research shows that it is optimal to yank starting players on a “Q+1 basis” (when they commit one more foul than the current quarter.

For example

on January 20, 2007, the Cleveland Cavaliers visited the Golden State Warriors. With 4:45 left in the third quarter, Golden State starter Andris Biedrins  committed a personal  foul. After the free throw to complete the
three point play caused by his foul, the Warriors were leading by two points. This was Biedrins’s fourth foul and he was  therefore in “Q+1” foul trouble and should have been yanked, but Don Nelson decided to keep Biedrins in the game for more than four minutes, only substituting him out with 36 seconds left in the quarter.

Was the coach’s decision to leave the foul-laden player in the game correct?

During that time, Biedrins did not pick up another foul. Biedrins  re-entered the game with 8:20 left in the fourth quarter and the Warriors up 5. Then at the 5:46 mark, he picked up his fifth foul of the game. Again, Don Nelson again kept Biedrins in the game. Finally, with 1:06 left in the fourth quarter and a tie game, Biedrins fouled out.

Cleveland eventually won the game in overtime. And the researchers question Nelson’s decision to leave his player in the game:

Nelson had two chances to  yank  Biedrins when he was in foul trouble but chose not to. As we will see, Nelson’s decision was not an aberration for him;  that year, he rarely yanked foul troubled starters even though they were in foul trouble more  than any other team. But perhaps Nelson’s entire strategic approach was wrong.

Parallels to investing

If early foul trouble means that pro ball players should sit things out for the sake of the team, I think you could draw some parallels to investors running money.

  • periodic performance review: like athletes, investors of all sizes can judge performance at any time (returns, to some extent, are the ultimate performance metric)
  • high stakes, high stress: everyone in this game is playing f’real
  • decision making appears to be serially related: investors behave differently when they’re winning versus when they’re losing.  I remember walking into my portfolio manager’s trading floor to interview.  I waited patiently while he was on a call — after he hung up, he said, “&*(@!”.  I asked what had happened and he said dispassionately that he had lost $6M on a trade.  When I asked what he was doing with it, he said simply that he is selling it and moving on.

Clearly some investors — more traders than buy-and-holders — strike certain rhythmic patterns in their investing.  You have to behave differently when suffering big losses than you would when up big on a year.  The great ones know how to optimize this tradeoff and not simply double-down just to salvage a trade-gone-bad.

Has Harbinger’s Phil Falcone lost his touch?  He’s suffered redemptions, poor performance, trader exodus.  Amidst all this, he has made a career-impacting decision to invest heavily in a huge bet on a private wireless firm.  Is he playing with too-many fouls or is this the master’s equivalent of bench time?  History will certainly be the judge.

Source

Maymin, Allan, Maymin, Philip and Shen, Eugene, How Much Trouble is Early Foul Trouble? (January 7, 2011). Available at SSRN: https://ssrn.com/abstract=1736633

New study uncovers the ‘new’ influentials in financial blogosphere

Interesting new study (see below) from Mindful Money.  Investors are certainly turning to non-traditional sources of information as inputs into their investment decisions.  But this appears to be the first time that the interconnections within the financial blogosphere have been studied.  The social media train — and specifically, actionableinvestable content — continues to chug ahead.

There are few surprises in the top rated/linked-to investment and economic blogs and websites.  Many of them are read by hundreds of thousands of investors daily.  What’s interesting here is to see how these blogs act as sources of information and credibility for the rest of us trying our hands at providing just a modicum of thoughtful investment ideas online.