new icn messageflickr-free-ic3d pan white
20120227-NodeXL-Twitter-bigdata network graph | by Marc_Smith
Back to photostream

20120227-NodeXL-Twitter-bigdata network graph



Data set:


These are the connections among the Twitter users who recently tweeted the word bigdata when queried on February 27, 2012, scaled by numbers of followers (with outliers thresholded). Connections created when users reply, mention or follow one another. The data set starts on 2/26/2012 2:56 and ends on 2/27/2012 22:25 UTC. Green lines are "follows" relationships, blue lines are "reply" or "mentions" relationships.


Layout created with the "Group Layout" feature of NodeXL which tiles bounded regions for each cluster. The Harel-Koren layout algorithm positioned each vertex:


Clusters calculated by the Clauset-Newman-Moore algorithm are also encoded by color. Clauset-Newman-Moore algorithm is defined here:


A larger version of the image is here:


Betweenness Centrality is defined here:


Top most between users:












Top word pairs by frequency of mention


#cloudexpo, #cloudcomputing, 559

#cloudcomputing, #bigdata, 511

#cloud, #cloudexpo, 483

#bigdata, @cloudexpo, 343

@ulitzer, #cloud, 342

big, data, 275

@cloudexpo, @bigdataexpo, 170

@ulitzer, #cloudexpo, 87

#cloudexpo, #bigdata, 47

ups, fronts, 44

fronts, working, 44

working, capital, 44

commerce, merchants, 44

adoption, issues, 43

#bigdata, #analytics, 42

big, deal, 40

dimension, data, 39

data, #cloud, 39

#cloud, services, 39

new, york, 38


Graph Metric, Value

Graph Type, Directed

Vertices, 824

Unique Edges, 6978

Edges With Duplicates, 1697

Total Edges, 8675

Self-Loops, 755

Connected Components, 63

Single-Vertex Connected Components, 57

Maximum Vertices in a Connected Component, 757

Maximum Edges in a Connected Component, 8594

Maximum Geodesic Distance (Diameter), 11

Average Geodesic Distance, 3.118791

Graph Density, 0.010373781

Modularity, 0.309689

NodeXL Version,


More NodeXL network visualizations are here: and here:


A gallery of NodeXL network data sets is available here:


NodeXL is free and open and available from


NodeXL is developed by the Social Media Research Foundation ( - which is dedicated to open tools, open data, and open scholarship.


Donations to support NodeXL are welcome through PayPal:


The book, Analyzing social media networks with NodeXL: Insights from a connected world, is available from Morgan Kaufmann and from Amazon.


Marc Smith on Twitter.

1 fave
Uploaded on February 28, 2012