SlideShare a Scribd company logo
1 of 48
Download to read offline
Community structure in complex networks
V.A. Traag
KITLV, Leiden, the Netherlands
e-Humanities, KNAW, Amsterdam, the Netherlands
February 21, 2014
eRoyal Netherlands Academy of Arts and Sciences
Humanities
Overview
1 What are communities in networks? How do we find them?
2 Where are those small communities?
3 When are communities significant?
4 What should I remember? And what’s next?
Part I
What are communities?
How do we find them?
What is a community?
• Everybody has an intuitive idea.
• Yet no single agreed upon definition.
• Common core:
Groups of nodes that are
relatively densely connected within, and
relatively sparsely connected between.
General community detection
• Reward links inside community,
weight aij
• Punish missing links inside
community, weight bij .
• General quality function
H =
ij
(Aij aij −(1−Aij )bij )δ(σi , σj ).
0
1
2
3
4
5
6
7
8
9
10
11
General community detection
• Reward links inside community,
weight aij
• Punish missing links inside
community, weight bij .
• General quality function
H =
ij
(Aij aij −(1−Aij )bij )δ(σi , σj ).
0
1
2
3
4
5
6
7
8
9
10
11
General community detection
• Reward links inside community,
weight aij
• Punish missing links inside
community, weight bij .
• General quality function
H =
ij
(Aij aij −(1−Aij )bij )δ(σi , σj ).
0
1
2
3
4
5
6
7
8
9
10
11
Different weights
No a-priori constraints on weights aij , bij .
Model aij bij
Reichardt & Bornholdt 1 − bij γpij
Arenas, Fern´andez & G´omez 1 − bij pij (γ) − γδij
Ronhovde & Nussinov 1 γ
Constant Potts Model 1 − γ γ
Modularity
• Null-model pij , constraint: ij pij = 2m.
• Popular null-model, configuration model pij =
ki kj
2m .
• With γ = 1, leads to modularity:
Q =
ij
Aij −
ki kj
2m
δ(σi , σj ).
• As sum over communities:
Q =
c
(ec − ec ).
Optimising modularity
Initial communities
0
1
2
3
4
5
6
7
8
9
10
11
Optimising modularity
Initial communities
Move 0
0
1
2
3
4
5
6
7
8
9
10
11
Optimising modularity
Initial communities
Move 0
Move 5
0
1
2
3
4
5
6
7
8
9
10
11
Optimising modularity
Initial communities
Move 0
Move 5
Move 11
0
1
2
3
4
5
6
7
8
9
10
11
Optimising modularity
Initial communities
Move 0
Move 5
Move 11
0
1
2
3
4
5
6
7
8
9
10
11
No more improvement
Optimising modularity
Initial communities
Move 0
Move 5
Move 11
0
1
2
1
1 1
3 6
5
Aggregate graph, and
repeat same procedure.
Optimising modularity
Initial communities
Move 0
Move 5
Move 11
0
1
2
1
1 1
3 6
5
Aggregate graph, and
repeat same procedure.
Louvain algorithm
1 Move node i to best (greedy) community.
2 Repeat (1) until no more improvement.
3 Contract graph (communities → nodes).
4 Repeat (1)-(3) until no more improvement.
Part II
Where are those small
communities?
Resolution limit
• Modularity might miss ‘small’
communities.
• Merge two cliques in ring of cliques
when
γRB <
q
nc(nc − 1) + 2
.
• Number of communities scales as
√
γRBm.
• For general null model, problem
remains since ij pij = 2m.
Resolution limit
• Modularity might miss ‘small’
communities.
• Merge two cliques in ring of cliques
when
γRB <
q
nc(nc − 1) + 2
.
• Number of communities scales as
√
γRBm.
• For general null model, problem
remains since ij pij = 2m.
Resolution-limit-free
• Ronhovde & Nussinov model (aij = 1, bij = γ).
• Claim: resolution-limit-free, as merge depends only on ‘local’
variables
γRN <
1
n2
c − 1
.
• But, take pij = ki kj , we obtain
γRB <
1
2(nc(nc − 1) + 2)2
,
also only ‘local’ variables. Hence, also resolution-limit-free?
• Problems of scale remain.
Resolution limit
Resolution limit
Resolution limit
Resolution limit
Resolution-limit-free
Defining resolution-limit-free
Definition (Resolution-limit-free)
Objective function H is called resolution-limit-free if, whenever
partition C optimal for G, then subpartition D ⊂ C also optimal
for subgraph H(D) ⊂ G induced by D.
Theorem (Swap optimal subpartitions)
If C is optimal, with subpartition D, we can replace D by another
optimal subpartition D .
Defining resolution-limit-free
Definition (Resolution-limit-free)
Objective function H is called resolution-limit-free if, whenever
partition C optimal for G, then subpartition D ⊂ C also optimal
for subgraph H(D) ⊂ G induced by D.
Theorem (Swap optimal subpartitions)
If C is optimal, with subpartition D, we can replace D by another
optimal subpartition D .
What methods are
resolution-limit-free?
Resolution-limit-free methods
• RN and CPM can be easily proven resolution-limit-free.
• What about other weights aij and bij ?
Definition (Local weights)
Weights aij , bij called local whenever for subgraph H ⊂ G, weights
remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H).
Theorem (Local weights ⇒ resolution-limit-free)
Objective function H is resolution-limit-free if weights are local.
Resolution-limit-free methods
• RN and CPM can be easily proven resolution-limit-free.
• What about other weights aij and bij ?
Definition (Local weights)
Weights aij , bij called local whenever for subgraph H ⊂ G, weights
remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H).
Theorem (Local weights ⇒ resolution-limit-free)
Objective function H is resolution-limit-free if weights are local.
Resolution-limit-free methods
• RN and CPM can be easily proven resolution-limit-free.
• What about other weights aij and bij ?
Definition (Local weights)
Weights aij , bij called local whenever for subgraph H ⊂ G, weights
remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H).
Theorem (Local weights ⇒ resolution-limit-free)
Objective function H is resolution-limit-free if weights are local.
Resolution-limit-free methods
• RN and CPM can be easily proven resolution-limit-free.
• What about other weights aij and bij ?
Definition (Local weights)
Weights aij , bij called local whenever for subgraph H ⊂ G, weights
remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H).
Theorem (Local weights ⇒ resolution-limit-free)
Objective function H is resolution-limit-free if weights are local.
Inverse not true: some small perturbation (i.e. non local weight)
will not change optimal partition. But very few exceptions.
Resolution-limit-free methods
• RN and CPM can be easily proven resolution-limit-free.
• What about other weights aij and bij ?
Definition (Local weights)
Weights aij , bij called local whenever for subgraph H ⊂ G, weights
remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H).
Theorem (Local weights ⇒ resolution-limit-free)
Objective function H is resolution-limit-free if weights are local.
Inverse not true: some small perturbation (i.e. non local weight)
will not change optimal partition. But very few exceptions.
Local methods are
resolution-limit-free.
Part II
When are communities
significant?
Modularity in non-modular graphs
Modularity as sign of community structure
• Modularity −1 ≤ Q ≤ 1.
• High modularity ⇒ community structure?
• Modularity higher than 0.3 seen as significant.
Modularity in non-modular graphs
Modularity as sign of community structure
• Modularity −1 ≤ Q ≤ 1.
• High modularity ⇒ community structure?
• Modularity higher than 0.3 seen as significant.
Many graphs have high modularity,
but no community structure.
Modularity without community structure
Q = 0.31
Modularity Q ≈ 0 for random graphs.
Significance
How significant is a partition?
Significance
E = 14
E = 9
Fixed partition
E = 11
Better partition
Significance
E = 14
E = 9
Fixed partition
E = 11
Better partition
• Not: Probability to find E edges in partition.
• But: Probability to find partition with E edges.
Subgraph probability
Decompose partition
• Probability to find partition with E edges.
• Probability to find communities with ec edges.
• Asymptotic estimate
• Probability for subgraph of nc nodes with density pc
Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2
cD(pc p)
Significance
• Probability for all communities Pr(σ) ≈
c
exp −n2
cD(pc p) .
• Significance S(σ) = − log Pr(σ) =
c
n2
cD(pc p).
Subgraph probability
Decompose partition
• Probability to find partition with E edges.
• Probability to find communities with ec edges.
• Asymptotic estimate
• Probability for subgraph of nc nodes with density pc
Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2
cD(pc p)
Significance
• Probability for all communities Pr(σ) ≈
c
exp −n2
cD(pc p) .
• Significance S(σ) = − log Pr(σ) =
c
n2
cD(pc p).
Subgraph probability
Decompose partition
• Probability to find partition with E edges.
• Probability to find communities with ec edges.
• Asymptotic estimate
• Probability for subgraph of nc nodes with density pc
Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2
cD(pc p)
Significance
• Probability for all communities Pr(σ) ≈
c
exp −n2
cD(pc p) .
• Significance S(σ) = − log Pr(σ) =
c
n2
cD(pc p).
Subgraph probability
Decompose partition
• Probability to find partition with E edges.
• Probability to find communities with ec edges.
• Asymptotic estimate
• Probability for subgraph of nc nodes with density pc
Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2
cD(pc p)
Significance
• Probability for all communities Pr(σ) ≈
c
exp −n2
cD(pc p) .
• Significance S(σ) = − log Pr(σ) =
c
n2
cD(pc p).
Subgraph probability
Decompose partition
• Probability to find partition with E edges.
• Probability to find communities with ec edges.
• Asymptotic estimate
• Probability for subgraph of nc nodes with density pc
Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2
cD(pc p)
Significance
• Probability for all communities Pr(σ) ≈
c
exp −n2
cD(pc p) .
• Significance S(σ) = − log Pr(σ) =
c
n2
cD(pc p).
Subgraph probability
Decompose partition
• Probability to find partition with E edges.
• Probability to find communities with ec edges.
• Asymptotic estimate
• Probability for subgraph of nc nodes with density pc
Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2
cD(pc p)
Significance
• Probability for all communities Pr(σ) ≈
c
exp −n2
cD(pc p) .
• Significance S(σ) = − log Pr(σ) =
c
n2
cD(pc p).
Significance
10−3 10−2 10−1 100
103
104
105
106
γ
N E
Significance
10−3 10−2 10−1 100
103
104
105
106
γ
N E S
Final Chapter
What should I remember? And
what’s next?
Conclusions
To remember
• Modularity can hide small communities.
• Local methods avoid this problem (RN, CPM).
• High modularity ⇒ significant: use significance.
What’s next?
• Various measures of significance: what’s the difference?
• Choose “correct” resolution ⇒ resolution limit?
Thank you!
Questions?
Traag, Van Dooren & Nesterov
Narrow scope for resolution-limit-free community detection
Phys Rev E 84, 016114 (2011)
Traag, Krings & Van Dooren
Significant scales in community structure
Sci Rep 3, 2930 (2013)
Reichardt & Bornholdt
Statistical mechanics of community detection.
Phys Rev E 74, 016110 (2006)
www.traag.net vincent@traag.net @vtraag

More Related Content

Viewers also liked

ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...
ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...
ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...Maik' Ckneteli
 
Hello my name is
Hello my name isHello my name is
Hello my name isMmoyer94
 
Marka patent kavrami
Marka patent kavramiMarka patent kavrami
Marka patent kavramiMhmtTprdmz
 
Sameva mobility-india
Sameva mobility-indiaSameva mobility-india
Sameva mobility-indiaNilofar Nigar
 
Cтратегия разработки по в R&D компании
Cтратегия разработки по в R&D компанииCтратегия разработки по в R&D компании
Cтратегия разработки по в R&D компанииRuslan Martimov
 
Bravery#2
Bravery#2Bravery#2
Bravery#2Shiro28
 
Komp Podhod
Komp PodhodKomp Podhod
Komp Podhodmazaeva
 
η εξέλιξη της τεχνολογίας κάμινας θεοδωρίκας
η εξέλιξη της τεχνολογίας κάμινας θεοδωρίκαςη εξέλιξη της τεχνολογίας κάμινας θεοδωρίκας
η εξέλιξη της τεχνολογίας κάμινας θεοδωρίκαςGiannhsMarkos
 
Présentation du RC&D sur le bilan de la COP21
Présentation du RC&D sur le bilan de la COP21Présentation du RC&D sur le bilan de la COP21
Présentation du RC&D sur le bilan de la COP21rac_marion
 
linee guida per la valorizzazione della montagna piemontese
linee guida per la valorizzazione della montagna piemonteselinee guida per la valorizzazione della montagna piemontese
linee guida per la valorizzazione della montagna piemonteseIdeazione
 
2015 0228 OpenStack swift; GMO Internet Services
2015 0228 OpenStack swift; GMO Internet Services2015 0228 OpenStack swift; GMO Internet Services
2015 0228 OpenStack swift; GMO Internet ServicesNaoto Gohko
 

Viewers also liked (19)

Sesija 2 organizovana inovacija unapređivanje konkurentnosti u organizacijama
Sesija 2 organizovana inovacija    unapređivanje konkurentnosti u organizacijamaSesija 2 organizovana inovacija    unapređivanje konkurentnosti u organizacijama
Sesija 2 organizovana inovacija unapređivanje konkurentnosti u organizacijama
 
ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...
ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...
ლტოლვილის სტატუსის მინიჭების შესახებ სექსუალურ ორიენტაციასა და გენდერულ იდენტ...
 
Hello my name is
Hello my name isHello my name is
Hello my name is
 
PDHPE
PDHPEPDHPE
PDHPE
 
Marka patent kavrami
Marka patent kavramiMarka patent kavrami
Marka patent kavrami
 
Sarah and frances vocab
Sarah and frances vocabSarah and frances vocab
Sarah and frances vocab
 
Cmmaao pvt-ltd-pmi
Cmmaao pvt-ltd-pmiCmmaao pvt-ltd-pmi
Cmmaao pvt-ltd-pmi
 
Prezentare electotehnica emaia 2
Prezentare electotehnica emaia 2Prezentare electotehnica emaia 2
Prezentare electotehnica emaia 2
 
Sameva mobility-india
Sameva mobility-indiaSameva mobility-india
Sameva mobility-india
 
Cтратегия разработки по в R&D компании
Cтратегия разработки по в R&D компанииCтратегия разработки по в R&D компании
Cтратегия разработки по в R&D компании
 
Estagio lei-11788
Estagio lei-11788Estagio lei-11788
Estagio lei-11788
 
Bravery#2
Bravery#2Bravery#2
Bravery#2
 
Komp Podhod
Komp PodhodKomp Podhod
Komp Podhod
 
η εξέλιξη της τεχνολογίας κάμινας θεοδωρίκας
η εξέλιξη της τεχνολογίας κάμινας θεοδωρίκαςη εξέλιξη της τεχνολογίας κάμινας θεοδωρίκας
η εξέλιξη της τεχνολογίας κάμινας θεοδωρίκας
 
Présentation du RC&D sur le bilan de la COP21
Présentation du RC&D sur le bilan de la COP21Présentation du RC&D sur le bilan de la COP21
Présentation du RC&D sur le bilan de la COP21
 
IHELP@KSDG
IHELP@KSDG IHELP@KSDG
IHELP@KSDG
 
linee guida per la valorizzazione della montagna piemontese
linee guida per la valorizzazione della montagna piemonteselinee guida per la valorizzazione della montagna piemontese
linee guida per la valorizzazione della montagna piemontese
 
2015 0228 OpenStack swift; GMO Internet Services
2015 0228 OpenStack swift; GMO Internet Services2015 0228 OpenStack swift; GMO Internet Services
2015 0228 OpenStack swift; GMO Internet Services
 
LAQVINTA
LAQVINTALAQVINTA
LAQVINTA
 

Similar to Community structure in complex networks

Resolution-free community detection
Resolution-free community detectionResolution-free community detection
Resolution-free community detectionVincent Traag
 
Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraJason Riedy
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphsNicola Barbieri
 
Clustering of graphs and search of assemblages
Clustering of graphs and search of assemblagesClustering of graphs and search of assemblages
Clustering of graphs and search of assemblagesData-Centric_Alliance
 
Sketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignmentSketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignmentssuser2be88c
 
Introduction to Supervised ML Concepts and Algorithms
Introduction to Supervised ML Concepts and AlgorithmsIntroduction to Supervised ML Concepts and Algorithms
Introduction to Supervised ML Concepts and AlgorithmsNBER
 
Variational inference
Variational inference  Variational inference
Variational inference Natan Katz
 
Randomness conductors
Randomness conductorsRandomness conductors
Randomness conductorswtyru1989
 
Searching Informed Search.pdf
Searching Informed Search.pdfSearching Informed Search.pdf
Searching Informed Search.pdfDrBashirMSaad
 
Data Mining Lecture_10(b).pptx
Data Mining Lecture_10(b).pptxData Mining Lecture_10(b).pptx
Data Mining Lecture_10(b).pptxSubrata Kumer Paul
 
super vector machines algorithms using deep
super vector machines algorithms using deepsuper vector machines algorithms using deep
super vector machines algorithms using deepKNaveenKumarECE
 
machine learning.pptx
machine learning.pptxmachine learning.pptx
machine learning.pptxAbdusSadik
 
ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...
ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...
ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...sleepy_yoshi
 
Threshold network models
Threshold network modelsThreshold network models
Threshold network modelsNaoki Masuda
 
Bayseian decision theory
Bayseian decision theoryBayseian decision theory
Bayseian decision theorysia16
 
columbus15_cattaneo.pdf
columbus15_cattaneo.pdfcolumbus15_cattaneo.pdf
columbus15_cattaneo.pdfAhmadM65
 
Community Detection
Community DetectionCommunity Detection
Community DetectionIlio Catallo
 

Similar to Community structure in complex networks (20)

Resolution-free community detection
Resolution-free community detectionResolution-free community detection
Resolution-free community detection
 
Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear Algebra
 
ilp-nlp-slides.pdf
ilp-nlp-slides.pdfilp-nlp-slides.pdf
ilp-nlp-slides.pdf
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphs
 
Clustering of graphs and search of assemblages
Clustering of graphs and search of assemblagesClustering of graphs and search of assemblages
Clustering of graphs and search of assemblages
 
Sketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignmentSketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignment
 
nber_slides.pdf
nber_slides.pdfnber_slides.pdf
nber_slides.pdf
 
Introduction to Supervised ML Concepts and Algorithms
Introduction to Supervised ML Concepts and AlgorithmsIntroduction to Supervised ML Concepts and Algorithms
Introduction to Supervised ML Concepts and Algorithms
 
Variational inference
Variational inference  Variational inference
Variational inference
 
Randomness conductors
Randomness conductorsRandomness conductors
Randomness conductors
 
Searching Informed Search.pdf
Searching Informed Search.pdfSearching Informed Search.pdf
Searching Informed Search.pdf
 
Klt
KltKlt
Klt
 
Data Mining Lecture_10(b).pptx
Data Mining Lecture_10(b).pptxData Mining Lecture_10(b).pptx
Data Mining Lecture_10(b).pptx
 
super vector machines algorithms using deep
super vector machines algorithms using deepsuper vector machines algorithms using deep
super vector machines algorithms using deep
 
machine learning.pptx
machine learning.pptxmachine learning.pptx
machine learning.pptx
 
ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...
ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...
ICML2012読み会 Scaling Up Coordinate Descent Algorithms for Large L1 regularizat...
 
Threshold network models
Threshold network modelsThreshold network models
Threshold network models
 
Bayseian decision theory
Bayseian decision theoryBayseian decision theory
Bayseian decision theory
 
columbus15_cattaneo.pdf
columbus15_cattaneo.pdfcolumbus15_cattaneo.pdf
columbus15_cattaneo.pdf
 
Community Detection
Community DetectionCommunity Detection
Community Detection
 

More from Vincent Traag

Peer review uncertainty at the institutional level
Peer review uncertainty at the institutional levelPeer review uncertainty at the institutional level
Peer review uncertainty at the institutional levelVincent Traag
 
Replacing peer review by metrics in the UK REF?
Replacing peer review by metrics in the UK REF?Replacing peer review by metrics in the UK REF?
Replacing peer review by metrics in the UK REF?Vincent Traag
 
Use of the journal impact factor for assessing individual articles need not b...
Use of the journal impact factor for assessing individual articles need not b...Use of the journal impact factor for assessing individual articles need not b...
Use of the journal impact factor for assessing individual articles need not b...Vincent Traag
 
Uncovering important intermediate publications
Uncovering important intermediate publicationsUncovering important intermediate publications
Uncovering important intermediate publicationsVincent Traag
 
Complex contagion of campaign donations
Complex contagion of campaign donationsComplex contagion of campaign donations
Complex contagion of campaign donationsVincent Traag
 
Polarization and consensus in citation networks
Polarization and consensus in citation networksPolarization and consensus in citation networks
Polarization and consensus in citation networksVincent Traag
 
Introduction to complex networks
Introduction to complex networksIntroduction to complex networks
Introduction to complex networksVincent Traag
 
Public thesis defence: groups and reputation in social networks
Public thesis defence: groups and reputation in social networksPublic thesis defence: groups and reputation in social networks
Public thesis defence: groups and reputation in social networksVincent Traag
 
Structure of media attention
Structure of media attentionStructure of media attention
Structure of media attentionVincent Traag
 
Dynamics of Media Attention
Dynamics of Media AttentionDynamics of Media Attention
Dynamics of Media AttentionVincent Traag
 
Dynamical Models Explaining Social Balance
Dynamical Models Explaining Social BalanceDynamical Models Explaining Social Balance
Dynamical Models Explaining Social BalanceVincent Traag
 
Significant scales in community structure
Significant scales in community structureSignificant scales in community structure
Significant scales in community structureVincent Traag
 
Reconstructing Third World Elite Rotation Events from Newspapers
Reconstructing Third World Elite Rotation Events from NewspapersReconstructing Third World Elite Rotation Events from Newspapers
Reconstructing Third World Elite Rotation Events from NewspapersVincent Traag
 
Reputation Dynamics Through Gossiping
Reputation Dynamics Through GossipingReputation Dynamics Through Gossiping
Reputation Dynamics Through GossipingVincent Traag
 
Limits of community detection
Limits of community detectionLimits of community detection
Limits of community detectionVincent Traag
 
Cooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingCooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingVincent Traag
 
Cooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingCooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingVincent Traag
 
Exponential Ranking: Taking into account negative links.
Exponential Ranking: Taking into account negative links.Exponential Ranking: Taking into account negative links.
Exponential Ranking: Taking into account negative links.Vincent Traag
 
Social Event Detection
Social Event DetectionSocial Event Detection
Social Event DetectionVincent Traag
 
Cooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingCooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingVincent Traag
 

More from Vincent Traag (20)

Peer review uncertainty at the institutional level
Peer review uncertainty at the institutional levelPeer review uncertainty at the institutional level
Peer review uncertainty at the institutional level
 
Replacing peer review by metrics in the UK REF?
Replacing peer review by metrics in the UK REF?Replacing peer review by metrics in the UK REF?
Replacing peer review by metrics in the UK REF?
 
Use of the journal impact factor for assessing individual articles need not b...
Use of the journal impact factor for assessing individual articles need not b...Use of the journal impact factor for assessing individual articles need not b...
Use of the journal impact factor for assessing individual articles need not b...
 
Uncovering important intermediate publications
Uncovering important intermediate publicationsUncovering important intermediate publications
Uncovering important intermediate publications
 
Complex contagion of campaign donations
Complex contagion of campaign donationsComplex contagion of campaign donations
Complex contagion of campaign donations
 
Polarization and consensus in citation networks
Polarization and consensus in citation networksPolarization and consensus in citation networks
Polarization and consensus in citation networks
 
Introduction to complex networks
Introduction to complex networksIntroduction to complex networks
Introduction to complex networks
 
Public thesis defence: groups and reputation in social networks
Public thesis defence: groups and reputation in social networksPublic thesis defence: groups and reputation in social networks
Public thesis defence: groups and reputation in social networks
 
Structure of media attention
Structure of media attentionStructure of media attention
Structure of media attention
 
Dynamics of Media Attention
Dynamics of Media AttentionDynamics of Media Attention
Dynamics of Media Attention
 
Dynamical Models Explaining Social Balance
Dynamical Models Explaining Social BalanceDynamical Models Explaining Social Balance
Dynamical Models Explaining Social Balance
 
Significant scales in community structure
Significant scales in community structureSignificant scales in community structure
Significant scales in community structure
 
Reconstructing Third World Elite Rotation Events from Newspapers
Reconstructing Third World Elite Rotation Events from NewspapersReconstructing Third World Elite Rotation Events from Newspapers
Reconstructing Third World Elite Rotation Events from Newspapers
 
Reputation Dynamics Through Gossiping
Reputation Dynamics Through GossipingReputation Dynamics Through Gossiping
Reputation Dynamics Through Gossiping
 
Limits of community detection
Limits of community detectionLimits of community detection
Limits of community detection
 
Cooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingCooperation, Reputation & Gossiping
Cooperation, Reputation & Gossiping
 
Cooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingCooperation, Reputation & Gossiping
Cooperation, Reputation & Gossiping
 
Exponential Ranking: Taking into account negative links.
Exponential Ranking: Taking into account negative links.Exponential Ranking: Taking into account negative links.
Exponential Ranking: Taking into account negative links.
 
Social Event Detection
Social Event DetectionSocial Event Detection
Social Event Detection
 
Cooperation, Reputation & Gossiping
Cooperation, Reputation & GossipingCooperation, Reputation & Gossiping
Cooperation, Reputation & Gossiping
 

Recently uploaded

Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...Lokesh Kothari
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 

Recently uploaded (20)

Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 

Community structure in complex networks

  • 1. Community structure in complex networks V.A. Traag KITLV, Leiden, the Netherlands e-Humanities, KNAW, Amsterdam, the Netherlands February 21, 2014 eRoyal Netherlands Academy of Arts and Sciences Humanities
  • 2. Overview 1 What are communities in networks? How do we find them? 2 Where are those small communities? 3 When are communities significant? 4 What should I remember? And what’s next?
  • 3. Part I What are communities? How do we find them?
  • 4. What is a community? • Everybody has an intuitive idea. • Yet no single agreed upon definition. • Common core: Groups of nodes that are relatively densely connected within, and relatively sparsely connected between.
  • 5. General community detection • Reward links inside community, weight aij • Punish missing links inside community, weight bij . • General quality function H = ij (Aij aij −(1−Aij )bij )δ(σi , σj ). 0 1 2 3 4 5 6 7 8 9 10 11
  • 6. General community detection • Reward links inside community, weight aij • Punish missing links inside community, weight bij . • General quality function H = ij (Aij aij −(1−Aij )bij )δ(σi , σj ). 0 1 2 3 4 5 6 7 8 9 10 11
  • 7. General community detection • Reward links inside community, weight aij • Punish missing links inside community, weight bij . • General quality function H = ij (Aij aij −(1−Aij )bij )δ(σi , σj ). 0 1 2 3 4 5 6 7 8 9 10 11
  • 8. Different weights No a-priori constraints on weights aij , bij . Model aij bij Reichardt & Bornholdt 1 − bij γpij Arenas, Fern´andez & G´omez 1 − bij pij (γ) − γδij Ronhovde & Nussinov 1 γ Constant Potts Model 1 − γ γ
  • 9. Modularity • Null-model pij , constraint: ij pij = 2m. • Popular null-model, configuration model pij = ki kj 2m . • With γ = 1, leads to modularity: Q = ij Aij − ki kj 2m δ(σi , σj ). • As sum over communities: Q = c (ec − ec ).
  • 11. Optimising modularity Initial communities Move 0 0 1 2 3 4 5 6 7 8 9 10 11
  • 12. Optimising modularity Initial communities Move 0 Move 5 0 1 2 3 4 5 6 7 8 9 10 11
  • 13. Optimising modularity Initial communities Move 0 Move 5 Move 11 0 1 2 3 4 5 6 7 8 9 10 11
  • 14. Optimising modularity Initial communities Move 0 Move 5 Move 11 0 1 2 3 4 5 6 7 8 9 10 11 No more improvement
  • 15. Optimising modularity Initial communities Move 0 Move 5 Move 11 0 1 2 1 1 1 3 6 5 Aggregate graph, and repeat same procedure.
  • 16. Optimising modularity Initial communities Move 0 Move 5 Move 11 0 1 2 1 1 1 3 6 5 Aggregate graph, and repeat same procedure. Louvain algorithm 1 Move node i to best (greedy) community. 2 Repeat (1) until no more improvement. 3 Contract graph (communities → nodes). 4 Repeat (1)-(3) until no more improvement.
  • 17. Part II Where are those small communities?
  • 18. Resolution limit • Modularity might miss ‘small’ communities. • Merge two cliques in ring of cliques when γRB < q nc(nc − 1) + 2 . • Number of communities scales as √ γRBm. • For general null model, problem remains since ij pij = 2m.
  • 19. Resolution limit • Modularity might miss ‘small’ communities. • Merge two cliques in ring of cliques when γRB < q nc(nc − 1) + 2 . • Number of communities scales as √ γRBm. • For general null model, problem remains since ij pij = 2m.
  • 20. Resolution-limit-free • Ronhovde & Nussinov model (aij = 1, bij = γ). • Claim: resolution-limit-free, as merge depends only on ‘local’ variables γRN < 1 n2 c − 1 . • But, take pij = ki kj , we obtain γRB < 1 2(nc(nc − 1) + 2)2 , also only ‘local’ variables. Hence, also resolution-limit-free? • Problems of scale remain.
  • 24. Defining resolution-limit-free Definition (Resolution-limit-free) Objective function H is called resolution-limit-free if, whenever partition C optimal for G, then subpartition D ⊂ C also optimal for subgraph H(D) ⊂ G induced by D. Theorem (Swap optimal subpartitions) If C is optimal, with subpartition D, we can replace D by another optimal subpartition D .
  • 25. Defining resolution-limit-free Definition (Resolution-limit-free) Objective function H is called resolution-limit-free if, whenever partition C optimal for G, then subpartition D ⊂ C also optimal for subgraph H(D) ⊂ G induced by D. Theorem (Swap optimal subpartitions) If C is optimal, with subpartition D, we can replace D by another optimal subpartition D . What methods are resolution-limit-free?
  • 26. Resolution-limit-free methods • RN and CPM can be easily proven resolution-limit-free. • What about other weights aij and bij ? Definition (Local weights) Weights aij , bij called local whenever for subgraph H ⊂ G, weights remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H). Theorem (Local weights ⇒ resolution-limit-free) Objective function H is resolution-limit-free if weights are local.
  • 27. Resolution-limit-free methods • RN and CPM can be easily proven resolution-limit-free. • What about other weights aij and bij ? Definition (Local weights) Weights aij , bij called local whenever for subgraph H ⊂ G, weights remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H). Theorem (Local weights ⇒ resolution-limit-free) Objective function H is resolution-limit-free if weights are local.
  • 28. Resolution-limit-free methods • RN and CPM can be easily proven resolution-limit-free. • What about other weights aij and bij ? Definition (Local weights) Weights aij , bij called local whenever for subgraph H ⊂ G, weights remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H). Theorem (Local weights ⇒ resolution-limit-free) Objective function H is resolution-limit-free if weights are local.
  • 29. Resolution-limit-free methods • RN and CPM can be easily proven resolution-limit-free. • What about other weights aij and bij ? Definition (Local weights) Weights aij , bij called local whenever for subgraph H ⊂ G, weights remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H). Theorem (Local weights ⇒ resolution-limit-free) Objective function H is resolution-limit-free if weights are local. Inverse not true: some small perturbation (i.e. non local weight) will not change optimal partition. But very few exceptions.
  • 30. Resolution-limit-free methods • RN and CPM can be easily proven resolution-limit-free. • What about other weights aij and bij ? Definition (Local weights) Weights aij , bij called local whenever for subgraph H ⊂ G, weights remain similar, i.e. aij (G) = λ(H)aij (H) and bij (G) = λ(H)bij (H). Theorem (Local weights ⇒ resolution-limit-free) Objective function H is resolution-limit-free if weights are local. Inverse not true: some small perturbation (i.e. non local weight) will not change optimal partition. But very few exceptions. Local methods are resolution-limit-free.
  • 31. Part II When are communities significant?
  • 32. Modularity in non-modular graphs Modularity as sign of community structure • Modularity −1 ≤ Q ≤ 1. • High modularity ⇒ community structure? • Modularity higher than 0.3 seen as significant.
  • 33. Modularity in non-modular graphs Modularity as sign of community structure • Modularity −1 ≤ Q ≤ 1. • High modularity ⇒ community structure? • Modularity higher than 0.3 seen as significant. Many graphs have high modularity, but no community structure.
  • 34. Modularity without community structure Q = 0.31 Modularity Q ≈ 0 for random graphs.
  • 36. Significance E = 14 E = 9 Fixed partition E = 11 Better partition
  • 37. Significance E = 14 E = 9 Fixed partition E = 11 Better partition • Not: Probability to find E edges in partition. • But: Probability to find partition with E edges.
  • 38. Subgraph probability Decompose partition • Probability to find partition with E edges. • Probability to find communities with ec edges. • Asymptotic estimate • Probability for subgraph of nc nodes with density pc Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2 cD(pc p) Significance • Probability for all communities Pr(σ) ≈ c exp −n2 cD(pc p) . • Significance S(σ) = − log Pr(σ) = c n2 cD(pc p).
  • 39. Subgraph probability Decompose partition • Probability to find partition with E edges. • Probability to find communities with ec edges. • Asymptotic estimate • Probability for subgraph of nc nodes with density pc Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2 cD(pc p) Significance • Probability for all communities Pr(σ) ≈ c exp −n2 cD(pc p) . • Significance S(σ) = − log Pr(σ) = c n2 cD(pc p).
  • 40. Subgraph probability Decompose partition • Probability to find partition with E edges. • Probability to find communities with ec edges. • Asymptotic estimate • Probability for subgraph of nc nodes with density pc Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2 cD(pc p) Significance • Probability for all communities Pr(σ) ≈ c exp −n2 cD(pc p) . • Significance S(σ) = − log Pr(σ) = c n2 cD(pc p).
  • 41. Subgraph probability Decompose partition • Probability to find partition with E edges. • Probability to find communities with ec edges. • Asymptotic estimate • Probability for subgraph of nc nodes with density pc Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2 cD(pc p) Significance • Probability for all communities Pr(σ) ≈ c exp −n2 cD(pc p) . • Significance S(σ) = − log Pr(σ) = c n2 cD(pc p).
  • 42. Subgraph probability Decompose partition • Probability to find partition with E edges. • Probability to find communities with ec edges. • Asymptotic estimate • Probability for subgraph of nc nodes with density pc Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2 cD(pc p) Significance • Probability for all communities Pr(σ) ≈ c exp −n2 cD(pc p) . • Significance S(σ) = − log Pr(σ) = c n2 cD(pc p).
  • 43. Subgraph probability Decompose partition • Probability to find partition with E edges. • Probability to find communities with ec edges. • Asymptotic estimate • Probability for subgraph of nc nodes with density pc Pr(S(nc, pc) ⊆ G(n, p)) ≈ exp −n2 cD(pc p) Significance • Probability for all communities Pr(σ) ≈ c exp −n2 cD(pc p) . • Significance S(σ) = − log Pr(σ) = c n2 cD(pc p).
  • 44. Significance 10−3 10−2 10−1 100 103 104 105 106 γ N E
  • 45. Significance 10−3 10−2 10−1 100 103 104 105 106 γ N E S
  • 46. Final Chapter What should I remember? And what’s next?
  • 47. Conclusions To remember • Modularity can hide small communities. • Local methods avoid this problem (RN, CPM). • High modularity ⇒ significant: use significance. What’s next? • Various measures of significance: what’s the difference? • Choose “correct” resolution ⇒ resolution limit?
  • 48. Thank you! Questions? Traag, Van Dooren & Nesterov Narrow scope for resolution-limit-free community detection Phys Rev E 84, 016114 (2011) Traag, Krings & Van Dooren Significant scales in community structure Sci Rep 3, 2930 (2013) Reichardt & Bornholdt Statistical mechanics of community detection. Phys Rev E 74, 016110 (2006) www.traag.net vincent@traag.net @vtraag