Solution: Minimize Malware Spread
Let's solve the Minimize Malware Spread problem using the Union Find pattern.
We'll cover the following
Statement
You’re given a network of nodes as an adjacency matrix graph with the node directly connected to the node if graph[i][j] == 1
.
A list of nodes, initial
, is given, which contains nodes initially infected by malware. When two nodes are connected directly and at least one of them is infected by malware, both nodes will be infected by malware. This spread of malware will continue until every node in the connected component of nodes has been infected.
After the infection has stopped spreading, will represent the final number of nodes in the entire network that have been infected with malware.
Return a node from initial
such that, when this node is removed from the graph, is minimized. If multiple nodes can be removed to minimize , return the node with the smallest index.
Note: If a node was removed from the initial list of infected nodes, it might still be infected later on due to the malware’s spread.
Constraints:
graph.length == graph[i].length
- n
graph[i][j]
is or .graph[i][j] == graph[j][i]
graph[i][i] == 1
-
initial.length
-
initial[i]
- All the integers in the
initial
are unique.
Solution
Since we have to check for infected nodes in each connected component, we need to find all the connected components in a graph. To do this, we use the UnionFind
data structure that finds all of the graphs’s connected components.
The steps of the algorithm are given below:
-
Find all of the graph’s connected components using the
UnionFind
data structure. -
Traverse the
initial
array and store all the connected components with at least one infected node in a hash map,infected
. -
Traverse the
initial
array again and calculate the number of infections (from theinfected
hash map) in each connected component containing one or more infected nodes.- If a connected component contains more than one infected node, ignore it and move to the next iteration of the loop.
- Otherwise, calculate the size of the connected component.
-
Find the connected component with the greatest size, containing only one initially infected node from step .
-
If there are multiple components that are the same size that would count as the largest connected component, choose the component with the smallest index.
The slides below illustrate how we would like the algorithm to run: