Social web & social media, consultancy, training and advice from a flâneur of the internets. Blogger, writer, broadcaster and runner of Birmingham: It's Not Shit. I also do the odd bit of art.
December 16th, 2010

How Christmassy are you feeling?

The Christmas-o-meter bounder

Just a little fun seasonal project I’ve made with the layout and design help of Gavin Wray.

It works very much like my other sentiment analysis tools, but with a sprinkling of Santa’s magic. Santa’s magic in this instance being that any tweets with the words ‘Christmas’ or ‘Xmas’ in them are weighted doubly—that is the scores are counted twice for the purpose of producing the mean score.

So, try the Christmas-o-meter and see how Christmassy you’re feeling.

December 2nd, 2010

Sentiment Analysis of a Football Match

(click through for big)

Last night I turned my sentiment analysis tool on two hashtags: #bcfc and #avfc, the most widely used tags to refer to Birmingham City and Aston Villa during their League Cup quarter final game. It was a chance to see if visualising to ‘competing’ tags around the same event would be a useful exercise.

Caveats that would apply to this:

  • Some people use the tags instead of team names, meaning that they might be used by people supporting the other team (or no team at all)—most fans, though seem to tag with just the hashtag representing their team.
  • Some tweeters use both—these tweets could be removed technically, but make no difference to the comparative scores.
  • If there’s a subject that uses more slang or metaphor than football, it’s not often discussed on Twitter.

There was a generally a downward trend throughout the match, tension? Bad football? It could have been both. The first two goals seemed to have a much bigger impact than the third—this I don’t quite understand, but it seems to be more about the tweets themselves than the tool.

I could see how a special subject-set of emotion words could be created for football, which could cope with more nuanced or unusual words. It’s something to consider.

The sentiment scores in a Google spreadsheet, csv files: #avfc tweets (657 of which were during the game), #bcfc tweets (370 during).

The obligatory Wordle:

October 19th, 2010

Sentiment Analysis of the X-Factor

As promised, I turned my Twitter sentiment analysis tool on the big TV/social web phenomenon that is the X-Factor. I started the script running at around 6:30pm and off again at 10:30pm — but the really interesting bit is during the show itself (thankfully watching the results stream in meant I didn’t have to watch the show itself).

It ran every minute and looked at the most recent 1,000 tweets tagged #xfactor.

emThe real reason for using the X-Factor is that I was aware just how violently the emotions can swing on Twitter when watching—and also it is a very defined timeline of events. The Valence (the happy-sad ratio, red line) had greater peaks and troughs in short times than any sentiment graphing project I’ve tried before.

The differences are far more prominent in the graph than any trends over the whole two and a half hours. Arousal (awake-ness, for want of a better word) was relatively constant, as was dominance (the feeling of control), although both jump up and down (within boundaries) along with Valence.

And who was ever-so unpopular around 8:50pm? This chap:

Next, I think I’ll try Question Time.

by Jon Bounds | Posted in my projects | Tags: , , ,
October 7th, 2010

Sentiment Analysis and Twitter ‘wormals’

I’ve tried two experiments with the “is Birmingham happy” algorithm in the last few days, as they’re not based on place it makes more sense to use the popular term ‘sentiment analysis’ to refer to what it’s doing in this instance. As they were both reasonably short uses it was posible to update the reading often (and use a smaller number of tweets as the sample, giving more variation in the average scores) and give the sentiment graphs a live ‘wormal’ feeling, watching the ratings change over time.

First was on the Personal Democracy Forum EU conference in Barcelona, for the length of the two-day conference I monitored the hashtag #pdfeu every five minutes:

(click image for larger view)

The highest rating was 64.4% (at 12:45pm on Tuesday), the lowest 49.6% (Monday at 12:14pm during a short power failure). What was interesting to me was that the “arousal” rating seemed to work well as it stayed pretty steady during the power failure  (or even leaped up a little) even as the happiness of the hashtag users  dived. Post-lunch conference lulls and periods of excitement (the big spikes in day two, at least, corresponded with much applause) were mapped quite accurately.

The overall average was 57.29%. If you would like to explore or graph the data yourself, you can see in all in a Google Spreadsheet here.

Secondly I tried a much shorter and more mainstream application, David Cameron’s speech to the Conservative Party Conference:

cpchappyThe emotion tracking tool graphed here ran every 10 seconds during David Cameron’s speech to the CPC and analysed the last 100 tweets with the hashtag #cpc10 and the word “tories”. I chose two versions as I wasn’t sure that non-Conservative supporters would use the ‘official’ hashtag, I theorised that they would be likely to use the word ‘tories’. As it turned out I think that while there was a more even spread of pro and anti political types using the hashtag than I expected, but the ‘tories’ Tweeters were definitely more hostile. (See the data.) There was greater movement across the graph than on any other test I’ve run.

Conclusions? None so far, other than that I think this might be a very useful tool, and that more interesting data is created the more Tweets you have and the more you can afford (server-wise) to poll for results. I’m itching to try it on another big live event with conflicting opinions, that might mean training it on a reality TV event. Roll on the X-Factor.

October 2nd, 2010

Engaging Visitors Through Social Media

Here are the pertiant bits of the flyer for a short course myself and Chris Unitt have devised and will be delivering as part of Birmingham’s Hello Business event. It’s aimed at those running communications for visitor attractions (museums, theatres, even theme parks) and will be focused on strategies to attract and retain customers.

If that sounds like something that would be of use to you email events@businesslinkwm.co.uk.

by Jon Bounds | Posted in Conferences & Talks, social media work | Tags:
September 29th, 2010

Is Birmingham Happy?

I’ve been running a, very rough, scrape of the Birmingham (UK) based interweb for ‘emotional wellbeing’ since April of 2008. Simply put a script running twice a day read in Tweets, news headlines and (originally) blog posts and compared the words within them to a table I’d drawn up of ‘emotion’ words and fairly arbitrary scores.

It was surprisingly interesting to watch: despite its roughness, the internal consistency let patterns emerge. It broadly followed weather and sports results, with some peaks and dips you could map to specific happenings, or news stories.

graph of emotion scores

It lead to a spin off focussing on Tweets from MPs, which I think influenced some of the developments that Tweetminster produced in the next year or so.

It was the patterns that lead me to keep putting off improving the algorithm, but recent Twitter API developments meant I had to do some work anyway and that (together with another project, of which more soon) gave me the impetus to give the project an overhaul. And here’s how it works now…

Twitter’s geolocation services are now much improved, so I can specify a point (the centre of Victoria Square in Birmingham) and a radius (10 miles) and get a reasonably accurate dump of Tweet data back—the algorithm calls for the most recent 1000.

Twitter is now the sole focus of data, in keeping with the ‘conversational pychogeography‘ aims of the project (in essence, words used without too much pre-meditation are more interesting than those written purely for publication). It also provides much more and more reactive data.

The words contained within these tweets are then compared to data from the University of Florida (The Affective Norms for English Words - PDF link). Within that data set each word covered (there are around a thousand in the set I’ve using) is given a score for Valence (sad to happy on a scale 0-10), Arousal (asleep to awake on a scale of 0-10) and Dominance (feeling lack of control to feeling in control  on a scale of 0-10). The scores are then collated and a mean calculated. The overall emotional wellbeing score here is calculated as a mean of the three individual means, although the scores are revealed individually on the site.

I’m unsure if combining the results in this way is the best, which is why the site reveals the working — the Twitter feed just goes with one value for ease of understanding and adds a rating adjective too:

if ($brumemotion<100){$rating="fantastic";}
if ($brumemotion<90){$rating="superb";}
if ($brumemotion<80){$rating="good";}
if ($brumemotion<70){$rating="okay";}
if ($brumemotion<60){$rating="average";}
if ($brumemotion<50){$rating="quiet";}
if ($brumemotion<40){$rating="subdued";}
if ($brumemotion<30){$rating="low";}
if ($brumemotion<20){$rating="dreadful";}
if ($brumemotion<10){$rating="awful";}

The Twitter feed produces results twice a day, and these scores are being saved to visualise more graphically, but the website updates every ten seconds (and will self-refresh if you stay on the site) and also displays a word cloud of the currently found ‘emotion words’:

is Brum happy right now?

Thoughts on further development

I’ve been experimenting with more local results (here is a version running on just one Birmingham post code — B13) as well as live graphing. I also have a version that will analyse results for a hashtag—something we may use in conjunction with the Civico player to produce ‘wormals’ (graphs of sentiment) during conferences.

But for now, I’m happy to let the new algorithm bed in—wondering about the amount of data and frequency that will be required to see the most detail—and to see what patterns we can spot.

Feedback welcome. Go see for yourself or follow on Twitter.

September 27th, 2010

Birmingham Music Map

With the Birmingham Popular Music Archive I’ve been inviting the public to contribute to an online database of music culture in Birmingham, by placing venues, artists, people or anything they feel relates to music on a map. You can see and add your memories to the map, here.

Jon is a proper artist!

An editioning of the results so far was commissioned in the form of the Birmingham Music Map as part of ‘plug in’ an exhibition at mac curated by Simon Poulter. The exhibition has now finished, and the artwork (printed on toughened glass and around a metre wide by 1.4m high) is looking for a new home.  I’m happy for it to be displayed in any public place as long as they will display an artist’s card next to it and look after it — it would be brilliant if it was mounted somewhere appropriate, but if you’re interested please drop me a line at jon@jonbounds.co.uk.

If you’ve not got the room, poster/print copies are available, screenprinted on gorgeous white archival paper at B1 (707 × 1000mm /27.8 × 39.4in) in a signed and numbered limited edition of 100 copies.

Buy Now (£25 plus £5 postage and packing — recorded delivery):

Price for framed copies on application. Please email for details.

You can explore an online version here.

by Jon Bounds | Posted in art | Tags: , , , ,
August 24th, 2010

The future of publishing

If there’s one thing that fills the web more than cat pictures it’s ruminations on the state, past or future of newspapers and magazines. The truth is old models are failing and no-one really knows. Rupert Murdoch is trying paywalls, which is a possibility for publications with existing audiences and strong brands, but what can a start-up publication do?

In my own small way I’m experimenting — this week sees the launch of a—yes—paper-based magazine that Danny Smith and I have been working on for the best part of six months. This is what it looks like:

Pile of Dirty Bristow magazines

Things we’ve worked out so far:

  • Print is really expensive at small scale, but it’s still much easier to get people excited to work for and to sell than web content.
  • Brand is all important: we’ve gone for wilfully obtuse and arty—we think that’s a sector we can sell to.
  • A clean break between web and print means that you need to create lots of reasons for, and a fair amount of, ‘related but not similar’ content. Content that reaches the same audience, but isn’t seen as either a free or a second-rate version of what you’re asking payment for in print.
  • A new thing needs its networks—we’ve tried to make sure that everyone that can feel ownership of the magazine finds it easy to talk about and share stuff about it with their networks.
  • If you’ve got a brand, related events can make a fair bit of money—but they’re an additional risk. We’re operating at small scale, but publishers have tried this — Wrox Press when I worked for it’s web design offshoot was trying to maximise return on brand by conferences, it didn’t bring in enough money to save the company. It seems easier, however, to sell a specific happening via the social web than it does an ongoing concept.
  • No-one’s going to pay to get past the paywall on a Twitter account—well only about ten people in my experience.

As well as being exhausting and a great hobby, there’s been a fair few opportunities to try out different promotional web-tricks that I’m going to use again. Issue two shouldn’t take so long.

June 22nd, 2010

Hashtag usage — a survey

I’m very interested in the motivation behind uses of hashtags on Twitter — I have a feeling that they are more created than searched.

I would be very interested to see Twitter Search stats — to see how many people actually look at collections hashtagged content rather than just pump them out because it seems part of etiquette. This hypothesis brewed when I saw how hashtag use breaks down of  during real big events (World Cup, election) — as people already know the context, I am thinking that they are used more as a shorthand for context than searched or monitored.

Without much hope of getting that valuable data, I have created a very short questionnaire to get some feeling for use of hashtags. Due to the responses being self-selecting I am assuming that the results will be biased towards experienced Twitter users, but we’ll see. That this may be compounded due to my network containing a lot of social media profesionals is also a worry, so I would appreciate if you would spread it as far as you can.

by Jon Bounds | Posted in microblogging, my projects, twitter | Tags: , ,
May 25th, 2010

New York, New Politics

I’m off to the city that never sleeps next week for the PdF (Personal Democracy Forum) Conference — a conference on how social technology changes how politics operates. Very much looking forward to seeing Clay Shirky, Jimmy Wales et al speak and also to giving the Civico platform it’s first major test.

Civico is an offshoot of Rhubarb Radio, which I’ve been a member of for about 14 months. Rhubarb is an online community radio station, and Civico is an extension of that platform to cover democracy and events. Last year we covered the PdfEU conference in Barcelona, with what was little more than the streaming audio and a whole lot of hard work.

For this conference we hope to be able to use the newly developed Civico player. This has two great developments, one is that it integrates with the Twitter API to capture tweets alongside the audio or video. The second is much more exciting (and proud to say, developed from my original concept).

Once the audio, video, tweets (and more in development) are captured then users can share any fraction (or all) of the coverage — highlighting the best line, the biggest laugh or the most damming miss-speak. In other words it makes it easy to share the bits that you want to share. And share them by link or by embedding wherever they like.

Here’s an example from a recent conference in London, by link and by embed (this is still a beta, excuse any foibles or downtime as the player is worked on):

Read the rest of this entry »














Powered by Wordpress using the theme bbv1 Content © Jon Bounds