Automated Media Analysis

by PaulH 10/14/2008 4:03:00 PM

 

In many industries for many years, automation has replaced old fashioned human labour and brought efficiencies and cost savings.  But while computers can be faster and more accurate than humans at many things, will they ever do what human beings do best?  Will computers ever think?  In 1950, mathematician Alan Turing wrote that “if, during a text-based conversation, a machine is indistinguishable from a human, then it could be said to be thinking”  He devised his famous Turing Test and predicted that by 2000 computer technology would have advanced so much that 30% of human judges would be fooled into thinking that a machine was human.

 

So far the Turing Test has never been passed.  Over last weekend, controversial scientist Kevin Warwick conducted the latest set of tests as part of the 18th Loebner Prize for artificial intelligence.  Unfortunately even the winning entry, a robot called Elbot, managed to convince 25% of the judges and so officially failed the test.  “We really, really have come very close” said Warwick, although it should be noted that the sample size in a field made up of computer experts and journalists was just 12!

 

When it comes to media analysis, while computers have helped enormously,  full automation has proven to be difficult given the complex task of “understanding” language and therefore attributing the correct sentiment, messages and topics.  This has resulted in the emphasis on a human analyst to read every article which means that the cost of an analysis program scales proportionately to the volume of coverage.  When you are interested in measuring your own organisation in the relatively small universe of traditional media this is fine but what if you want to benchmark yourselves against competitors.  What if you want to extend it to social-media - how do you measure the whole of the internet?  Since the cost / benefit equation becomes an issue, maybe it is time to look at automation again.

 

Indeed, many of the social-media specialist companies have gone down the automation route.  In Nathan Gilliatt’s guide to Social Media Analysis 2008, at least 23 out of the 64 companies featured use some form of automated analysis.

 

So just how good have computers got?  It is certainly true to say that there have been significant developments over the past few years, particularly in the areas of natural language processing and machine learning.  Topic identification and entity extraction has been widely researched, while measuring sentiment has also received a fair amount of attention.  In a series of experiments between 2002 and 2004, Bo Pang and Lillian Lee from Cornell University used a variety of machine learning techniques on positive and negative film reviews.  They managed to produce accuracy rates of between 75% and 86%.

 

This of course can be applied to measuring sentiment for organisations in media coverage.  A number of generic tools exist for doing exactly that with similar results.  For example Corpora Software (who were bought by Infonic and are now part of Lexalytics) have claimed a accuracy of 75%.

 

One problem with sentiment is that what is favourable in one domain is not necessarily favourable in another.  For example while the word ‘cancer’ would usually be an indicator of negative sentiment, for a cancer charity this would probably not be the case.  Domain specific machine learning can help with this issue.  We recently conducted an experiment on coverage from a leading computer anti-virus company.  When using a generic sentiment tool we managed to get an accuracy of 79%.  However, many words that would usually be considered negative: ‘virus’, ‘attack’, ‘malware’ are often found in positive articles.  By ‘training’ the sentiment model on more relevant coverage we managed to increase the accuracy to 92%!

 

So we know how good computers are, but how good are humans?  Copora reckoned that there was a 82% chance of two or more human analysts agreeing with each other while in a piece of research from the Natural Language Processing team at Microsoft, human analysts agreed on average just 74% of the time.  To quote Microsoft: “this suggests that the task of sentiment classification is difficult even for people”.  From our own experience, this is pretty low – with a tightly defined brief, a good system that reduces subjectivity backed up with extensive quality control will result in much better accuracy and certainly better than we can get with a computer.

 

US measurement expert Katie Paine has said “We’ve gotten very good at teaching computers to understand words, the problem is that they don’t understand the nuances of conversations. Computers still can’t tell the difference between sarcasm and irony. And throw in slang and you have an even bigger problem. Facebook, HP and Microsoft did extensive research before selecting measurement tools and all three insisted on human analysts. So I ask you if some of the leading players in technology don’t trust computers, why should you?”

 

What about my own thoughts?  Are computers as good as humans for analysis – no.  Would I trust a computer to measure my most important media – no.  However for getting a litmus test on the ‘long tail’ be it social media or competitor benchmarking then there is a definite use for automated analysis.  Finally I feel that this is not a black and white issue – it is not a straight choice of computers vs humans.  Why not get the best of both worlds with a hybrid model.  An emphasis on humans for the important stuff with more automation combined with a certain amount of human checking for the rest.

 

As to the future, as technology advances we may well see the removal of humans from the analysis ‘engine’.  However since even the mighty Alan Turing got his prediction wrong, you might not want to bet on this.

Comments

10/14/2008 11:54:03 PM

I agree with your thoughts on a hybrid approach, Paul. I've been leading a project for HP's IPG group for several years that does exactly that and it seems to be cost effective and highly accurate (I believe the project that Katie Paine references is a separate project within HP). We have a database of over 100,000 print, online, broadcast and blog articles on HP and its competitors.

From my experience, the accuracy rates for computers fall within the range of 70 to 80% for objective areas like content capture (using key word searches), metatagging for predefined values, etc. We haven't tried to use computers on subjective ratings like tonality and we still use humans to validate the computer's assessments but it really cuts down on the overall cost of the system.

I also agree that you need tightly defined briefs (can we get the industry to call these "tighty whities?") to increase human accuracy - even on seemly indisputable topics.

I would also argue that managers are often too focused on reaching 100% accuracy, primarily because they tend to be right brain dominant (which makes them great PR people but poor analysts). For example, our project has been criticized for missing some coverage in an obscure trade journal. I constantly find myself drawing the "census vs. survey" analogy. A omnibus survey looks at 1000 people and draws conclusions across a population of 300 million within 3 or 4 percentage points. If we do a good job capturing 70 to 80% of our universe of coverage, we may not be a complete census but the margin of error in our results is going to be fraction of a percentage point. IF computers can help us do it and save money, even better.

Rob McMurtrie us

10/15/2008 10:04:46 AM

Thanks for your comments, Rob. I definitely agree on the census / survey analogy. There does seem to be a need from PR people to capture everything. I wonder if this is related to the old school culture of measuring quantity rather than quality – that volume, advertising value equivalents and total opportunities to see are more important than whether our target audience has read our messages (and not forgetting what our target audience thinks about us).

Regarding hybrid analysis, we are finding that by combining computers and humans we can actually improve the accuracy. According to one of our head analysts “the most common mistake from human analysts is omitting product and service names when they are mentioned, rather than the more subjective measures such as tone”. These are relatively easy to capture with a computer. By getting a computer to do what we call “pre-analysis” on an article with the analyst following up with the more subjective stuff, we are really getting some significant improvements in the accuracy.

Paul gb

10/15/2008 11:23:24 AM

Interesting piece, thanks Paul - I have blogged a few comments about it here: www.whollysocial.com/.../...-media-evaluation.html

In a (probably doomed) attempt to aid clarity, the complicated Corpora / Infonic / Lexalytics story actually runs something like this:

Online research / monitoring / analysis firm Infonic developed a neat sentiment interpretation technology tool during 2003 / 2004. Software PLC Corpora Software bought Infonic partly to acquire use and development rights for this tool in 2004. Corpora rebranded the whole company under the Infonic name in 2007, and this summer Infonic PLC merged with US sentiment technology firm Lexalytics. Hope this follows sensibly - think I've confused myself now...

Chris Thomas gb

10/15/2008 11:24:12 AM

Thanks for at great post. I'm new to this area and I'm currently looking for the best tools for blog/buzz measurement. Jeremiahs list is a great start, but I could really use some insights and hands-on experience from people who have worked with the tools.

What tools do you see as being most accurate at the moment, and what tools is a safe investment for the future? I have heard a lot of good things about Radian6, but do they differ that much and is it necessary to combine several tool, in order to get the most accurate picture.

morten saxnaes

10/15/2008 3:04:01 PM

Hi Morten, I would strongly recommend that you get Nathan Gilliatt’s guide to social media measurement (http://www.socialtarget.com/research/) which will give you an overview of the most prevalent social-media vendors and how their services compare to each other. You need to ask whether you want a specific blog measurement tool or whether you want to measure blogs and online ‘buzz’ as part of a broader measurement program. If it is the former then Radion6 is a good option. They have a very smart interface and market themselves very well. One thing I like is that you can adjust the various elements used to calculate influence using a graphic-equaliser type tool and so customize it to your needs. However I also feel that social media should not be measured in isolation but should be integrated with mainstream media measurement to give a more holistic view, which is something that the expert media analysis agencies (such as Metrica!) tend to be better at providing

Paul gb

10/15/2008 5:22:45 PM

Check out this Web 2.0 approach to chatbots: http://chatbotgame.com.

Just as Deep Blue brute-forced it in chess with speed, the idea behind the Chatbot Game is to brute-force it with a huge number of user-submitted Google-like chat rules.

amichail

10/15/2008 10:13:21 PM

Interesting post - the answer has to be some combination of artificial, and real, intelligence doesn't it?

Computers to do the leg-work, people to do the softer stuff...

Though I'll stand corrected when a computer can tell the difference between "Computers are great." and "Computers are great. Not".

Chris Reed gb

10/17/2008 10:59:32 PM

Paul,

Thank you for your endorsement of the Guide. Now I don't have to plug it myself! Smile

I posted an attempt to frame the discussion of hybrid approaches last spring. I find it helpful to distinguish between software-aided human analysis and human-aided software analysis--two very different hybrid approaches.

net-savvy.com/.../human-vs-machine-analysis.html

I've had the model quoted back to me, so I suspect that others may find it constructive.

Nathan Gilliatt us

10/21/2008 12:24:42 AM

Thanks for the replies and links.

I have been looking into it, but there is one thing that I miss. I cant seem to find any companies that can track on minor languages as we have in Scandinavia. Have you any insights to this?

morten saxnaes

3/3/2009 7:05:19 PM

Morten, Just wanted to check whether you have contacted Nathan who has also commented on this post - he should be able to help you.

Richard gb

Add comment


 

  Country flag

biuquote
  • Comment
  • Preview
Loading



Powered by BlogEngine.NET 1.4.0.0

About Measurement Matters

A blog about media analysis & evaluation, PR planning, PR measurement and marketing measurement in general.

follow Metrica for media evaluation updates

Our 5 latest tweets:
The remote server returned an error: (401) Unauthorized.
Follow us on Twitter
Add to Technorati Favorites
<

Calendar

<<  September 2010  >>
MoTuWeThFrSaSu
303112345
6789101112
13141516171819
20212223242526
27282930123
45678910

View posts in large calendar

Recent comments

Tags

Login

Sign in

Business
Blogging Fusion
Blog Directory
Public Relations Blogs - BlogCatalog Blog Directory
blog directory
Blog Flux Directory
British Blogs
Wil's Domain Weblog
Dmegs Directory
Blog Directory
Business blogs
BlogDir
blogburst logo
Blog Directory
Top Spots Links
See blogs and businesses for United Kingdom