KDnuggets Twitter Social Network
We examine KDnuggets Twitter social network, created by NodeXL, a free, open-source template for Microsoft Excel for social network analysis.
By Gregory Piatetsky, March 14, 2014.
I am grateful to Marc Smith, @marc_smith, for generating the following network graph which represents a network of Twitter users whose tweets contained "kdnuggets" from Jan 1 to Feb 3, 2014.
NodeXL is a free, open-source template for Microsoft Excel and is supported by donations from users.
NodeXL automatically generated highlights (top twitters and hashtags) for this graph are in NodeXL SNA gallery bit.ly/LIPsm8
Top vertices by betweenness centrality:
- @kdnuggets @kirkdborne @yvesmulkers @jackwmson @latentview @bigdatastartups @singularityis @ie_bigdata @informs @biconnections
Top hashtags overall:
- #BigData #DataScience #Analytics #DataMining #BigDataCo #FF #MachineLearning #Hadoop #IBMWatson
Here are some human understandable highlights. The graph shows
- one very large cluster G1, with top words: data kdnuggets science bigdata analytics top scientist find learning,
- a medium size connected cluster G2, with a major role by @kirkdborne (Kirk D. Borne, Prof. of Astrophysics at GMU and a leading data scientist),
- cluster G3 which has many unconnected components,
- small clusters G4-G9 where a central role is played by one leading tweeter, including @ibmbigdata, @yvesmulkers, @tomhcanderson, @peteskomoroch
- and a bunch of micro-clusters.
Top links in Entire Graph:
- Data Science Venn Diagram v 2.0
- Dilbert on Big Data Analysis and Salaries
- Analytics-data-mining KDnuggets content from TweetedTimes
- Data Science Skills vs Roles map
- Learning from Data, Caltech Free Online Course, now with captions in 40 languages
- Free Tutorial: Data Science in Python
- Cartoon: Let's solve this problem by using the Big Data none of us have the slightest idea what to do with
- Hadoop: Elephants in the Cloud
- www.statistics.com/
- MassBigData launched to boost Massachusetts Big Data, Analytics Ecosystem
I am not sure what else does this graph mean - your suggestions welcome!
Overall Graph Metrics:
- Vertices: 1375
- Unique Edges: 1890
- Connected Components: 140
- Maximum Vertices in a Connected Component: 1197
- Maximum Edges in a Connected Component: 3448
- Maximum Geodesic Distance (Diameter): 7
- Average Geodesic Distance: 2.426217