首页 关于我 项目 博客 图谱 简历 联系 English
返回列表

2026年5月6日

3152 Lecture 9

Network Analysis

FIT3152Class NoteEnglish

Lecture Slide: FIT3152 Lecture 09.pdf

Previous: 3152 Lecture 8 - Ensemble Model and Artificial Neural Networks

Network

Network is a type of graph that use Vertices (nodes) to represent the entities and Edges (arcs) to represent the connections between each entities. the Network graph can use to represent many things, such as the relation between people, structure of a social network, connection between English words.

Vertices

Vertices represent the entities in the network, such as a person, computer article etc. One vertices can be connect to another vertices to represent the relationship between two vertices.

Edges

Edges is the connections between Vertices in network. It can be undirected which can represent the double way relation, or it can be directed which can represent the one way relation.

Edges may also be weighted to show the strength of the relation, can also represent cost or gain as well like what we do in Dijkstra algorithm.

Network structures

Network.png

Network structure describes how vertices are connected by edges.

TermMeaningExample
WalkA sequence of links, vertices can be repeated(M,A,J,R,J,)(M,A,J,R,J,\cdots)
PathA walk with no repeated vertices(M,A,J,R)(M,A,J,R)
CycleA path that begins and ends at the same vertex(A,S,R,A)(A,S,R,A)
GeodesicThe shortest path between two verticesGeodesic(R,D)=(R,J,D)Geodesic(R,D)=(R,J,D)
LengthThe number of links in a walk or pathdist(M,D)=3dist(M,D)=3
ConnectedThere is a path between each pair of verticesThe example network is connected
Directed graphEdges have direction, so travel is only allowed in that directionA -> B
LoopAn edge from a vertex to itselfEdge(A,A)
Complete graphEvery vertex is joined to every other vertexEvery pair is connected
SubgraphA subset of a graphGraph(M,A)
CliqueA complete subgraphGraph(A,R,J)
Simple graphA graph with no loops or multi-edgesNo repeated edge between same pair

Adjacency Matrix

An adjacency Matrix can be used to represent the network by indicating the connections between individuals

adjacency matrix.png Network.png

Degree of vertex

Degree is the number of edges connected to a vertex, representing the size of the vertex’s neighbourhood.

For directed graphs, the degree can be separated into in-degree and out-degree.

For example, In graph above, degree(R) = 3

Network Analysis

Network analysis can describe the structure of the whole network.

Diameter

Diameter is the longest geodesic between any two vertices.

In other words, it is the longest shortest path in the network.

Diameter=max(dist(u,v))Diameter=max(dist(u,v))

For the network above, one longest shortest path is:

M -> A -> J -> R

So the diameter is 3.

Average Path Length

Average path length is the average geodesic distance between any two vertices in the network.

Average path length=sum of all shortest path distancesnumber of vertex pairsAverage\ path\ length=\frac{\text{sum of all shortest path distances}}{\text{number of vertex pairs}}

For example, if the shortest path distances are:

1, 1, 2, 1, 2, 1

Then:

Average path length=ave(1,1,2,1,2,1)=1.333Average\ path\ length=ave(1,1,2,1,2,1)=1.333

Degree Distribution

Degree distribution describes how many vertices have each degree.

It describes the number of vertices by the degree that vertex has.

Degree Distribution of network:

DegreeN
00
12
21
32
41

Density

Density measures how many edges exist in the network compared with how many edges could possibly exist.

For an undirected simple graph:

den(g)=EgVg(Vg1)/2den(g)=\frac{|E_g|}{|V_g|(|V_g|-1)/2}

Where:

Eg=number of edges in graph gVg=number of vertices in graph gVg(Vg1)/2=maximum possible number of edges\begin{aligned} |E_g| &= \text{number of edges in graph } g \\ |V_g| &= \text{number of vertices in graph } g \\ |V_g|(|V_g|-1)/2 &= \text{maximum possible number of edges} \end{aligned}

A higher density means the network is more connected.

Clustering Coefficient

Clustering coefficient is also called transitivity.

It is the proportion of triangles relative to the number of connected triples.

clt(g)=3τΔ(g)τ3(g)clt(g)=\frac{3\tau_{\Delta}(g)}{\tau_3(g)}

Where:

τΔ(g)=number of triangles in graph gτ3(g)=number of connected triples in graph g\begin{aligned} \tau_{\Delta}(g) &= \text{number of triangles in graph } g \\ \tau_3(g) &= \text{number of connected triples in graph } g \end{aligned}

A higher clustering coefficient means the network has more triangle-like local groups.

Vertex Importance

Vertex importance means how important or central a vertex is in the network.

The importance of a vertex can be based on:

  • Number of connections with other vertices
  • Centrality of vertex within the network

The number of connections can be measured by degree.

Centrality can measure the strategic power of a vertex to control information in the network.

Common centrality measures:

  • Degree
  • Closeness
  • Betweenness
  • Eigenvector

Degree Centrality

Degree centrality is based on the number of edges connected to a vertex.

A vertex with higher degree has more direct connections with other vertices.

It can be interpreted as the size of the vertex’s neighbourhood.

Closeness Centrality

Closeness centrality measures how close a vertex is to all other vertices in the network.

A vertex is close if it has small total shortest distance to all other vertices.

Closeness centrality is the inverse of the total shortest distance between one vertex and all other vertices.

CCl(v)=1uvd(v,u)C_{Cl}(v)=\frac{1}{\sum_{u \neq v}d(v,u)}

Where d(v,u)d(v,u) is the shortest distance between vertex vv and vertex uu.

For example, if the shortest distances from JJ to all other vertices are:

1, 1, 2, 1, 2

Then:

CCl(J)=11+1+2+1+2=17C_{Cl}(J)=\frac{1}{1+1+2+1+2}=\frac{1}{7}

High closeness centrality means the vertex can reach other vertices with shorter paths.

So high closeness centrality means strong local influencer.

Betweenness Centrality

Betweenness centrality measures how often a vertex is between other vertices.

It counts the number of shortest paths between other vertices that pass through this vertex.

CB(v)=svtσst(v)σstC_B(v)=\sum_{s\neq v\neq t}\frac{\sigma_{st}(v)}{\sigma_{st}}

Where:

CB(v)=betweenness centrality of vertex vσst=the number of shortest paths from s to tσst(v)=the number of shortest paths from s to t that pass through v\begin{aligned} C_B(v) &= \text{betweenness centrality of vertex } v \\ \sigma_{st} &= \text{the number of shortest paths from } s \text{ to } t \\ \sigma_{st}(v) &= \text{the number of shortest paths from } s \text{ to } t \text{ that pass through } v \end{aligned}

If there is more than one shortest path, the count may be proportional.

In the other word, betweenness centrality is the sum of the number of shortest paths between other vertices that pass through this vertex.

A vertex with high betweenness centrality can act as a bridge, hub, or relay in the network.

For example, if many shortest paths must go through vertex JJ, then JJ has high betweenness centrality.

If a vertex is not on any shortest path between other vertices, then its betweenness centrality is 0.

Eigenvector Centrality

Eigenvector centrality measures the importance of a vertex based on the importance of its neighbours.

The idea is that connecting to important vertices makes a vertex more important.

It is difficult to calculate by hand, but the concept is useful.

For example, Google PageRank is related to this idea.

Which Centrality Measure

Different centrality measures describe different types of importance.

Degree centrality measures direct connections.

Closeness centrality measures how well a vertex is connected locally.

Betweenness centrality measures hub or bridge potential.

Eigenvector centrality measures the quality of connections.

Hand Calculation Example

For a small graph, common hand calculation questions include:

  • Average path length
  • Diameter
  • Degree of a vertex
  • Degree distribution
  • Betweenness centrality
  • Closeness centrality
  • Cliques

Average path length is the average of all geodesic distances between pairs of vertices.

Diameter is the longest geodesic distance in the network.

Degree of a vertex is the number of edges connected to that vertex.

Clique is a subgraph where every vertex is connected to every other vertex.

For closeness centrality, first calculate the shortest distance from the chosen vertex to all other vertices, then take the inverse of the total distance.

For betweenness centrality, check which shortest paths between other vertices pass through the chosen vertex.

From the class example in slide:

  • Average path length = ave(1,1,2,1,2,1)=1.333ave(1,1,2,1,2,1)=1.333
  • Diameter = 2
  • dY=3d_Y=3
  • Degree distribution = 3, 2, 2, 1
  • CB(Y)=2C_B(Y)=2
  • CCl(Y)=11+1+1=0.333C_{Cl}(Y)=\frac{1}{1+1+1}=0.333
  • Cliques include W-X, W-Y, X-Y, Y-Z, and W-X-Y

反向链接