Now that even <a href="http://www.zerohedge.com/news/2013-12-12/worlds-largest-hedge-fund-uses-twitter-real-time">Bridgewater has joined the Twitter craze </a>and is using user-generated content for real-time economic modelling, and who knows what else, the scramble to determine who has the most market-moving, and actionable, Twitter stream is on. Because with HFT algos having camped out at all the usual newswire sources: Bloomberg, Reuters, Dow Jones, etc. the scramble to find a "content edge" for market moving information has never been higher. However, that opens up a far trickier question: whose information on the fastest growing social network, one which many say may surpass Bloomberg in terms of news propagation and functionality, is credible and by implication: whose is not? Indeed, that is the $64K question. Luckily, there is an algo for that. In a note by Castillo et al from Yahoo Research in Spain and Chile, the authors focus on automatic methods for assessing the credibility of a given set of tweets. Specifically, they analyze microblog postings related to “trending” topics, and classify them as credible or not credible, based on features extracted from them. Our results shows that there are measurable differences in the way messages propagate, that can be used to classify them automatically as credible or not credible, with precision and recall in the range of 70% to 80%.

Now that even Bridgewater has joined the Twitter craze and  is using user-generated content for real-time economic modelling, and  who knows what else, the scramble to determine who has the most  market-moving, and actionable, Twitter stream is on. Because with HFT  algos having camped out at all the usual newswire sources: Bloomberg,  Reuters, Dow Jones, etc. the scramble to find a "content edge" for  market moving information has never been higher. However, that opens up a  far trickier question: whose information on the fastest growing social  network, one which many say may surpass Bloomberg in terms of news  propagation and functionality, is credible and by implication: whose is  not? Indeed, that is the $64K question. Luckily, there is an algo for that.