Independent set problem

Problem definition

In graph theory, an independent set is a set of vertices in a graph, no two of which are adjacent.

In the following, we are going to solve the solution space properties of the independent set problem on the Petersen graph. To start, let us define a Petersen graph instance.

using GenericTensorNetworks, Graphs

graph = Graphs.smallgraph(:petersen)
{10, 15} undirected simple Int64 graph

We can visualize this graph using the show_graph function

# set the vertex locations manually instead of using the default spring layout
rot15(a, b, i::Int) = cos(2i*π/5)*a + sin(2i*π/5)*b, cos(2i*π/5)*b - sin(2i*π/5)*a
locations = [[rot15(0.0, 60.0, i) for i=0:4]..., [rot15(0.0, 30, i) for i=0:4]...]
show_graph(graph, locations; format=:svg)

The graphical display is available in the following editors

Generic tensor network representation

The independent set problem can be constructed with IndependentSet type as

iset = IndependentSet(graph)
IndependentSet{SimpleGraph{Int64}, Int64, UnitWeight}(SimpleGraph{Int64}(15, [[2, 5, 6], [1, 3, 7], [2, 4, 8], [3, 5, 9], [1, 4, 10], [1, 8, 9], [2, 9, 10], [3, 6, 10], [4, 6, 7], [5, 7, 8]]), [1, 1, 1, 1, 1, 1, 1, 1, 1, 1])

The tensor network representation of the independent set problem can be obtained by

problem = GenericTensorNetwork(iset; optimizer=TreeSA())
GenericTensorNetwork{IndependentSet{SimpleGraph{Int64}, Int64, UnitWeight}, OMEinsum.SlicedEinsum{Int64, OMEinsum.DynamicNestedEinsum{Int64}}, Int64}
- open vertices: Int64[]
- fixed vertices: Dict{Int64, Int64}()
- contraction time = 2^7.977, space = 2^4.0, read-write = 2^8.676

where the key word argument optimizer specifies the tensor network contraction order optimizer as a local search based optimizer TreeSA.

Here, the key word argument optimizer specifies the tensor network contraction order optimizer as a local search based optimizer TreeSA. The resulting contraction order optimized tensor network is contained in the code field of problem.

Theory (can skip)

Let $G=(V, E)$ be a graph with each vertex $v\in V$ associated with a weight $w_v$. To reduce the independent set problem on it to a tensor network contraction, we first map a vertex $v\in V$ to a label $s_v \in \{0, 1\}$ of dimension $2$, where we use $0$ ($1$) to denote a vertex absent (present) in the set. For each vertex $v$, we defined a parameterized rank-one tensor indexed by $s_v$ as

\[W(x_v^{w_v}) = \left(\begin{matrix} 1 \\ x_v^{w_v} \end{matrix}\right)\]

where $x_v$ is a variable associated with $v$. Similarly, for each edge $(u, v) \in E$, we define a matrix $B$ indexed by $s_u$ and $s_v$ as

\[B = \left(\begin{matrix} 1 & 1\\ 1 & 0 \end{matrix}\right).\]

Ideally, an optimal contraction order has a space complexity $2^{{\rm tw}(G)}$, where ${\rm tw(G)}$ is the tree-width of $G$ (or graph in the code). We can check the time, space and read-write complexities by typing

contraction_complexity(problem)
Time complexity: 2^7.977279923499915
Space complexity: 2^4.0
Read-write complexity: 2^8.67595703294175

For more information about how to improve the contraction order, please check the Performance Tips.

Solution space properties

Maximum independent set size $\alpha(G)$

We can compute solution space properties with the solve function, which takes two positional arguments, the problem instance and the wanted property.

maximum_independent_set_size = solve(problem, SizeMax())[]
4.0ₜ

Here SizeMax means finding the solution with maximum set size. The return value has Tropical type. We can get its content by typing

read_size(maximum_independent_set_size)
4.0

Counting properties

Count all solutions and best several solutions

We can count all independent sets with the CountingAll property.

count_all_independent_sets = solve(problem, CountingAll())[]
76

The return value has type Float64. The counting of all independent sets is equivalent to the infinite temperature partition function

solve(problem, PartitionFunction(0.0))[]
76.0

We can count the maximum independent sets with CountingMax.

count_maximum_independent_sets = solve(problem, CountingMax())[]
(4.0, 5.0)ₜ

The return value has type CountingTropical, which contains two fields. They are n being the maximum independent set size and c being the number of the maximum independent sets.

read_size_count(count_maximum_independent_sets)
4.0 => 5.0

Similarly, we can count independent sets of sizes $\alpha(G)$ and $\alpha(G)-1$ by feeding an integer positional argument to CountingMax.

count_max2_independent_sets = solve(problem, CountingMax(2))[]
30.0*x^3 + 5.0*x^4

The return value has type TruncatedPoly, which contains two fields. They are maxorder being the maximum independent set size and coeffs being the number of independent sets having sizes $\alpha(G)-1$ and $\alpha(G)$.

read_size_count(count_max2_independent_sets)
2-element Vector{Pair{Float64, Float64}}:
 3.0 => 30.0
 4.0 => 5.0
Find the graph polynomial

We can count the number of independent sets at any size, which is equivalent to finding the coefficients of an independence polynomial that defined as

\[I(G, x) = \sum_{k=0}^{\alpha(G)} a_k x^k,\]

where $\alpha(G)$ is the maximum independent set size, $a_k$ is the number of independent sets of size $k$. The total number of independent sets is thus equal to $I(G, 1)$. There are 3 methods to compute a graph polynomial, :finitefield, :fft and :polynomial. These methods are introduced in the docstring of GraphPolynomial.

independence_polynomial = solve(problem, GraphPolynomial(; method=:finitefield))[]
1 + 10∙x + 30∙x2 + 30∙x3 + 5∙x4

The return type is Polynomial.

read_size_count(independence_polynomial)
5-element Vector{Pair{Int64, BigInt}}:
 0 => 1
 1 => 10
 2 => 30
 3 => 30
 4 => 5

Configuration properties

Find one best solution

We can use the bounded or unbounded SingleConfigMax to find one of the solutions with largest size. The unbounded (default) version uses a joint type of CountingTropical and ConfigSampler in computation, where CountingTropical finds the maximum size and ConfigSampler samples one of the best solutions. The bounded version uses the binary gradient back-propagation (see our paper) to compute the gradients. It requires caching intermediate states, but is often faster (on CPU) because it can use TropicalGEMM (see Performance Tips).

max_config = solve(problem, SingleConfigMax(; bounded=false))[]
(4.0, ConfigSampler{10, 1, 1}(0010111000))ₜ

The return value has type CountingTropical with its counting field having ConfigSampler type. The data field of ConfigSampler is a bit string that corresponds to the solution

single_solution = read_config(max_config)
0010111000

This bit string should be read from left to right, with the i-th bit being 1 (0) to indicate the i-th vertex is present (absent) in the set. We can visualize this MIS with the following function.

show_graph(graph, locations; format=:svg, vertex_colors=
    [iszero(single_solution[i]) ? "white" : "red" for i=1:nv(graph)])
Enumerate all solutions and best several solutions

We can use bounded or unbounded ConfigsMax to find all solutions with largest-K set sizes. In most cases, the bounded (default) version is preferred because it can reduce the memory usage significantly.

all_max_configs = solve(problem, ConfigsMax(; bounded=true))[]
(4.0, {1010000011, 1001001100, 0101010001, 0100100110, 0010111000})ₜ

The return value has type CountingTropical, while its counting field having type ConfigEnumerator. The data field of a ConfigEnumerator instance contains a vector of bit strings.

_, configs_vector = read_size_config(all_max_configs)
4.0 => StaticBitVector{10, 1}[1010000011, 1001001100, 0101010001, 0100100110, 0010111000]

These solutions can be visualized with the show_configs function.

show_configs(graph, locations, reshape(configs_vector, 1, :); padding_left=20)
Example block output

We can use ConfigsAll to enumerate all sets satisfying the independence constraint.

all_independent_sets = solve(problem, ConfigsAll())[]
{0100000110, 1000000110, 0000000110, 0100000011, 0100000010, 1010000011, 0010000011, 1010000010, 0010000010, 1000000011, 0000000011, 1000000010, 0000000010, 0100000100, 1000001100, 0000001100, 1000000100, 0000000100, 0100010001, 0100000001, 0100010000, 0100000000, 1010001000, 0010011000, 0010001000, 1010000001, 0010010001, 0010000001, 1010000000, 0010010000, 0010000000, 1000001000, 0000011000, 0000001000, 1000000001, 0000010001, 0000000001, 1000000000, 0000010000, 0000000000, 0101000100, 1001001100, 0001001100, 1001000100, 0001000100, 0101010001, 0101000001, 0101010000, 0101000000, 1001001000, 0001011000, 0001001000, 1001000001, 0001010001, 0001000001, 1001000000, 0001010000, 0001000000, 0100100110, 0000100110, 0100100010, 0010100010, 0000100010, 0100100100, 0000101100, 0000100100, 0100110000, 0100100000, 0010111000, 0010101000, 0010110000, 0010100000, 0000111000, 0000101000, 0000110000, 0000100000}

The return value has type ConfigEnumerator.

Sample solutions

It is often difficult to store all configurations in a vector. A more clever way to store the data is using the sum product tree format.

all_independent_sets_tree = solve(problem, ConfigsAll(; tree_storage=true))[]
+ (count = 76.0)
├─ + (count = 58.0)
│  ├─ + (count = 40.0)
│  │  ├─ * (count = 13.0)
│  │  │  ├─ * (count = 1.0)
│  │  │  │  ├─ OnehotVec{10, 2}(9, 1)
│  │  │  │  └─ OnehotVec{10, 2}(9, 1)
│  │  │  └─ + (count = 13.0)
│  │  │     ├─ * (count = 1.0)
│  │  │     │  ⋮
│  │  │     │  
│  │  │     └─ + (count = 12.0)
│  │  │        ⋮
│  │  │        
│  │  └─ + (count = 27.0)
│  │     ├─ * (count = 1.0)
│  │     │  ├─ * (count = 1.0)
│  │     │  │  ⋮
│  │     │  │  
│  │     │  └─ * (count = 1.0)
│  │     │     ⋮
│  │     │     
│  │     └─ + (count = 26.0)
│  │        ├─ * (count = 4.0)
│  │        │  ⋮
│  │        │  
│  │        └─ + (count = 22.0)
│  │           ⋮
│  │           
│  └─ * (count = 18.0)
│     ├─ OnehotVec{10, 2}(4, 1)
│     └─ * (count = 18.0)
│        ├─ OnehotVec{10, 2}(4, 1)
│        └─ + (count = 18.0)
│           ├─ * (count = 1.0)
│           │  ⋮
│           │  
│           └─ + (count = 17.0)
│              ⋮
│              
└─ * (count = 18.0)
   ├─ OnehotVec{10, 2}(5, 1)
   └─ + (count = 18.0)
      ├─ * (count = 5.0)
      │  ├─ * (count = 1.0)
      │  │  ├─ OnehotVec{10, 2}(9, 1)
      │  │  └─ OnehotVec{10, 2}(9, 1)
      │  └─ + (count = 5.0)
      │     ├─ * (count = 1.0)
      │     │  ⋮
      │     │  
      │     └─ + (count = 4.0)
      │        ⋮
      │        
      └─ + (count = 13.0)
         ├─ * (count = 1.0)
         │  ├─ * (count = 1.0)
         │  │  ⋮
         │  │  
         │  └─ * (count = 1.0)
         │     ⋮
         │     
         └─ + (count = 12.0)
            ├─ * (count = 2.0)
            │  ⋮
            │  
            └─ + (count = 10.0)
               ⋮
               

The return value has the SumProductTree type. Its length corresponds to the number of configurations.

length(all_independent_sets_tree)
76.0

We can use Base.collect function to create a ConfigEnumerator or use generate_samples to generate samples from it.

collect(all_independent_sets_tree)

generate_samples(all_independent_sets_tree, 10)
10-element Vector{StaticBitVector{10, 1}}:
 0000000100
 0100000000
 0010011000
 0000001000
 0101000100
 0001000100
 0001001100
 1001001000
 0000100100
 0010110000

This page was generated using Literate.jl.