By Gregory Piatetsky, Apr 14, 2014.
We update our
analysis of Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science
(Dec 2013) and find several interesting trends.
First, we found that growth slowed down in 2013 Q3 but resumed in 2013 Q4 and 2014 Q1.
The Figure 1 (below) shows quarterly growth rates in top 30 groups. Except for two groups:
and SAS & Analytics Users
(not shown in Figure 1) which had big growth in 1 or 2 quarters and none in 2 other quarters, most groups show surprisingly similar pattern of decline in growth in 13Q3, followed by acceleration in 14Q1 and 14Q2.
Fig 1: Top Linked Analytics Groups, Quarterly Growth 2013Q2 to 2014Q1. Thick black line is the overall average growth rate.
Here are the 10 largest groups (by membership as of March 31, 2014).
We note that 7 largest were in the same order as in Nov 2013. The 6 largest grew significantly faster than the next 4 groups.
- Advanced Business Analytics, Data Mining and Predictive Modeling: 121,816 (74% growth in 12 months)
- Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group: 95,638 ( 82% growth)
- Big Data and Analytics: 74,350 (100%)
- Business Analytics: 53,345 (43%)
- Data Mining, Statistics, Big Data, and Data Visualization: 43,761 (116%)
- BIG DATA Professionals - Architects Scientists Analytics Experts: 30,792 (92%)
- Next Gen Market Research (NGMR): 23,368 (15%)
- SAS Analytics & BI (closed): 20,941 (32%)
- Business Intelligence & Analytics Group: 20,000 (4%)
- Global Analytics Network: 19,389 (11%)
However, there seems to be no strong correlation between group size and growth rate among all 30 groups.
Here are 10 groups with the fastest growth in the past 12 months (March 25, 2013 to March 31, 2014)
- RDataMining: 126%
- Data Mining, Statistics, Big Data, and Data Visualization: 116%
- Data Scientists: 114%
- Big Data and Analytics: 100%
- BIG DATA Professionals - Architects Scientists Analytics Experts: 92.5%
- Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group: 82%
- Advanced Business Analytics, Data Mining and Predictive Modeling: 74%
- KDnuggets Analytics, Data Mining, and Data Science: 73%
- Predictive Analytics Network (PAN): 72%
The chart below shows group growth vs group size. Color corresponds to age - redder is younger, bluer is older. Group name abbreviations are in the table below.
Fig 2: Top Linked Analytics, Big Data, Data Science Groups by 2014 size vs growth
There are 2 main measures of group activity: discussions (posts)/week and comments/week.
Since these numbers clearly depend on the group size, we measure them per 1000 members.
We measure overall group activity as (discussions + comments / week) per 1000 members.
For 4 months ending in March 2014, activity level was 2.99/week, about 25% less than 3.97/week measured in Nov 2013.
The chart below shows group activity vs group size. Color corresponds to age - redder is younger, bluer is older. Group name abbreviations are in the table below.
Fig 3: Top Linked Analytics, Big Data, Data Science Groups - 2014 Activity vs Growth
In 4 month ending in March 2014 the average activity level was 2.03 discussion/week per 1K members, and 0.96 comments/week per 1K members, or about 2.1 discussions/comment, well below 2.57 discussions/week per 1K members and 1.40 comments/week per 1K members measured in Nov 2013 (1.8 discussions/comment). This means that the
while activity has slowed down, the gap between discussions and comments has increased.
The chart below shows average comments/week vs average discussions/week for all 30 groups, with a circle size proportional to group size and circle color corresponding to activity change - green meaning increase, red decrease. We also show median lines for each dimension, which can be used to divide the groups in 4 quadrants.
Fig 4: 4 Quadrants of Top Linked Analytics, Big Data, Data Science Groups: Commenting vs Posting
Several groups stand out: KDnuggets has the highest number of discussions/1000 members, while RDM has a highest number of comments. The median line divide the groups in 4 quadrants, which we can characterize as
- "Engaged" (above median on both comments and discussions): KDnuggets, Dscientists, PAN, DM Stat, RDM, Big Data & A, Adv BADM)
- "Posting" (above median on discussions, below median on comments): DSC, Global A
- "Commenting" (below median on discussions, above median on comments): NGMR, RMA/RMDS, PR
- "Passive" (below median on both comments and discussions)
The details are in the table with below, with groups ordered by the number of members.
The link to the raw data is at the end of the post.
The growth, comments, and discussions are in
if that value is 25% above average,
if 25% below average, and in black otherwise.
We note that there are only 4 "triple green" groups, that are significantly above average on growth, comments, and discussions:
- Data Mining, Statistics, Big Data, and Data Visualization
- Data Scientists
- KDnuggets Analytics, Data Mining, and Data Science
(Mar 31, 2014)
|Founded||12 mon Growth
per 1K mbr
per 1K mbr
Advanced Business Analytics, Data Mining and Predictive Modeling (Adv BADM)
Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group (Big Data ASFSSP)
Big Data and Analytics (Big Data & A)
Business Analytics (Biz Analytics)
Data Mining, Statistics, Big Data, and Data Visualization (DM Stat)
BIG DATA Professionals - Architects Scientists Analytics Experts (BD Prof)
Next Gen Market Research (NGMR) (NGMR)
SAS Analytics & BI (closed) (SAS A&BI)
Business Intelligence & Analytics Group (BI&A)
Global Analytics Network (Global A)
Machine Learning Connection (closed) (ML Conn)
Pattern Recognition, Data Data Mining, Machine Intelligence (closed) (PR)
SAS & Analytics Users (SAS Users)
Actuary / Actuarial, Predictive Modeling, Data Mining, and Statistics News / Jobs / Careers Group (Actuary)
Research Methods and Data Science (RMDS, former RMA)
Text Analytics (Text A)
Data Science Central (DSC)
Advanced Analytics (closed) (Adv A)
Visual Analytics (Visual)
Data & Text Analytics Professionals (D&TA Prof)
Predictive Analytics Network (PAN) (PAN)
Advanced Analytics, Predictive Modeling & Statistical Analyses (closed) (Adv AP)
Data Scientists (Dscientists)
Lavastorm Analytics Community Group (Lavastorm)
KDnuggets Analytics, Data Mining, and Data Science (KDnuggets)
Data Mining Technology (closed) (DMT)
Predictive Model Markup Language (PMML) (PMML)
Healthcare Data Mining and Modeling (Healthcare)
Business Intelligence Tools (BI Tools)
Note: You can get actual data from the HTML source code of the LinkedIn group Statistics/Activity page.
Look for dataset seriesName="Comments"
and parse that data. Likewise for Discussions and Members.
Thanks to Anmol Rajpurohit for collecting the membership, comments, and discussions data.
Here is raw data (csv)
for the top 30 LinkedIn groups.
Largest LinkedIn Groups to Join for Big Data, Data Science and Analytics
, by Ted O'Brien, Aug 30, 2013.
Let me know which relevant groups were missed and what other trends you see.