In order to model something, you first need to measure it. You need to be able to assign a number to an emotion. Since emotions are already either positive or negative, we already have a natural way to assign them values. Most of us can read a text and agree on whether it has a positive or negative tone. But can we automatically assign a number to emotional content on the basis of this assessment? Frank and his colleagues have adapted the software SentiStrength, which claims to be an algorithm to do exactly this. You feed it in a sentence and out pops a ranking. I put in the last sentence of the previous paragraph in to the online version and got negative score -2 for the words 'skeptical' and 'emotional' along with a positive score of 2 for 'interested'. Good to see I was emotionally neutral going in to the seminar.
|Review sentiment distribution for
'Harry Potter and The Deathly Hollows'
In one study, Garcia & Schweitzer looked at book reviews left on the Amazon website. There was a typical distribution of negative and positive emotions in these reviews (pictured on the right) where positive comments were typically extremely positive and negative criticism was varied. The time pattern of review writing varied, with some books building up a review base over time and others (such as Harry Potter) being hyped from the start.
Things get even more interesting when Frank and his colleagues looked at online chat rooms. They examined how long it took between interactions and whether the reactions were positive or negative. They found a common distribution for the times between posts, which was independent of the topic. They concluded that people were surprisingly positive in their online interactions, even though previous studies had suggested that discussion is generated by negative opposing opinions. When online, people spend a lot of time being nice to each other! Not just pressing the like button, but also in constructive agreement.
|Time series of emotions in
an online chat room.
In another part of their work, Frank and his colleagues argue that “positive words carry less information than negative words”, so I better be a bit critical. In the presentation, and the papers I have looked at since, I think a model comparison to the dynamics of the conversations and exchanges is missing. The model proposed captures the rate at which people comment and the positive/negative content of what they say. But the comparison to data is mainly at the level of time between posts or overall distribution of sentiment, rather than looking at the positive/negative interactions. I would like to see something down the lines of James Murray and co-workers on the Mathematics of Marriage. Here there is a description of how couples get in to negative/positive spirals and predictions to how this bodes for the futures of the relationships of the couples involved. Murray’s work lacks thorough validation against data, and maybe it becomes too complicated once the fast amount of chat room data is put in to a model, but I would like to see more of these interaction dynamics. Maybe this is in some of the work and I have just missed it? Or maybe it is just too difficult at present? But I would be interested to see what can be done.