Tuesday, October 4, 2011
The Arab Spring, Zest Cloudio and Butterflyzer
Labels:
butterflyzer,
eclipse,
java,
science,
visualization,
web
One of the goals that I'm really trying to drive Butterflyzer toward is finding a middle ground between the level of detail that hard-core social network analysts get into, and the incredibly over-simplified executive dashboard approach that so many of the social media analysis startups are providing. A big part of this is providing fluid, interactive visualization. So far a lot of that has been based on graph visualization, but we're also experimenting with integrating other approaches.
One of the cool things that we've got lying around to play with are a bunch of tweets collected during the start of the Arab Spring. We've used a lot of different techniques for visualizing and analyzing this data, but when the Eclipse Zest Cloudio toolset came along recently, I thought it would be an interesting way to quickly see what exactly people were really talking about. I think the results speak for themselves. Thanks to Stephan Schwiebert for this great contribution. One of the downsides of this approach is that unlike in our graph visualizations, you lose concept proximity. So for example, looking at the cloud you can instantly see that Tunisia and Lebanon are frequently mentioned, but what you can't see is how frequently they are used together. That's what our graph visualization tools do now, and I think there are some neat possibilities for marrying the two approaches.
Butterflyzer work has kicked back into high gear, and I'm currently collecting live streaming data from Twitter on the OccupyWallStreet movement. I can't wait to put together a similar word cloud for it. Obviously, a lot has changed in Egypt in the nine months since this data was collected. Who knows what will have changed here nine months from now?
Methodology: The data was collected from 10,000 tweets between Jan 23 and 29th containing the word "egypt". We then extracted word counts, removing common english words and other noise, and then visualized the remaining words with Cloudio, sizing for frequency of use.
One of the cool things that we've got lying around to play with are a bunch of tweets collected during the start of the Arab Spring. We've used a lot of different techniques for visualizing and analyzing this data, but when the Eclipse Zest Cloudio toolset came along recently, I thought it would be an interesting way to quickly see what exactly people were really talking about. I think the results speak for themselves. Thanks to Stephan Schwiebert for this great contribution. One of the downsides of this approach is that unlike in our graph visualizations, you lose concept proximity. So for example, looking at the cloud you can instantly see that Tunisia and Lebanon are frequently mentioned, but what you can't see is how frequently they are used together. That's what our graph visualization tools do now, and I think there are some neat possibilities for marrying the two approaches.
Butterflyzer work has kicked back into high gear, and I'm currently collecting live streaming data from Twitter on the OccupyWallStreet movement. I can't wait to put together a similar word cloud for it. Obviously, a lot has changed in Egypt in the nine months since this data was collected. Who knows what will have changed here nine months from now?
Methodology: The data was collected from 10,000 tweets between Jan 23 and 29th containing the word "egypt". We then extracted word counts, removing common english words and other noise, and then visualized the remaining words with Cloudio, sizing for frequency of use.
Subscribe to:
Post Comments (Atom)
Popular Posts
-
I'm happy to announce that the Virgo Tooling M4 release is available. If you just want to find out more about that, skip ahead to the sectio...
-
Every once in a while I think it makes sense to have a completely on-topic, technical post! One of the (many) things that has been on the ...
-
“So the final lesson of 1918, a simple one yet the one most difficult to execute, is that those who occupy positions of authority must lesse...
-
I've been meaning to start a blog for some time now (years) and the birth of our baby seemed like an especially auspicious time. So welcome ...
-
I've been spending quite a bit of time lately playing with the EEF framework. C'est magnifique! (No, I don't actually know any french, but I...
-
Just a quicky here.. I like to use views instead of editors for most of my GEF viewers. They are a lot of advantages to views -- the most s...
-
Many of us who've spent time traipsing through the Eclipse wilderness are probably familiar with this story... Tis a beautiful, sunlit 'mo...
-
Every once an a while, a new way of thinking comes along that is so powerful, so unique, so practical -- just so damn cool, really -- that i...
-
OK, I am long over-due for an update to the blog, and I have lots to report on, but I've been deep in build hell for the last week, (and the...
-
When Butterflyzer was launched a year ago it was introduced with the following words: "Butterflyzer is in Alpha now. Before we release mor...





Cool stuff! And interesting you mention that you'd be interested in a combination of tag clouds and graphs - I was thinking about that too, I basically imagined tag clouds as nodes in a graph, e.g. for hierarchical clustering...
ReplyDelete