Story about edge coloring of graph

By ko_osaga, history, 20 months ago,

You are given a graph GG, and for each vertex vv you have to assign a positive integer color such that every adjacent pair of vertices (vertices directly connected by edge) have different color assigned. You have to minimize the maximum color assigned: In other words, you have to minimize the number of distinct assigned colors.

But that's graph coloring (vertex coloring). It's hard. Ok, one more time:

You are given a graph GG, and for each edge ee you have to assign a positive integer color such that every adjacent pair of edges (edges sharing same endpoints) have different color assigned. You have to minimize the maximum color assigned: In other words, you have to minimize the number of distinct assigned colors.

This is the edge coloring of graph, and I will talk about this now.

Note that this can be interpreted as "graph coloring (vertex coloring) of line graphs". In that sense, you can consider in a similar spirit to "graph coloring of interval graphs", "graph coloring of permutation graphs", blah-blah.

General graph

Let D=maxv∈V(G)deg(v)D=maxv∈V(G)deg(v) be the maximum degree of a graph. Since every vertex should be incident to edges with different colors, the edges can't be colored with less than DD colors.

So if we somehow find the way to color the edges with DD colors, the case is closed. This is obviously not true: Consider a 3-cycle, then D=2D=2, but you need 3 colors.

Still, Vizing discovered that this is approximately true:

Theorem (Vizing, 1964). Every simple graph can be edge-colored with D+1D+1 colors.
Proof: If you are interested then see here.

Note that:

Theorem doesn't hold if the graph has multiple edges connecting same vertices.
It's NP-hard to determine if the graph can be colored with DD colors, so optimal coloring is still NP-hard.

Wikipedia discusses the O(nm)O(nm) algorithm to find a D+1D+1 edge coloring of simple graph, which you can find the implementation here. Sometimes, people asks you to solve just that exact same problem, and sometimes you can just cheese problems without thinking too much. Anyway, this is not our main point. let's jump to more interesting part.

Bipartite Graphs

To find an optimal edge coloring, we have to prove that the edges can be colored with DD colors. In general graph this was a little (one color!) bit of fail, but as the countercase had an odd cycle, maybe in bipartite graph this is true?

Theorem. Every bipartite graph can be edge-colored with DD colors.
Proof. We use induction on DD.
If D=0D=0, the proposition is trivial.
Otherwise, we will partition the graph GG with a matching MM and a graph G∖MG∖M with maximum degree D−1D−1. We transform the graph GG in such a way that the left/right bipartition have equal number of vertices (just add dummy nodes of degree 0), and every node have degree DD (repeatedly pick two nodes with degree <D<D and add a dummy edge. If this procedure fails, then two bipartition have different sum of degrees, which is impossible).
Let L,RL,R be a bipartition of this new graph. Now we assume every node of GG have degree DD. Suppose there is no perfect matching in GG. Then by Hall's theorem, there exists a subset of vertices S⊆LS⊆L such that |N(S)|<|S||N(S)|<|S|. Then there exists |S|×D|S|×D edges connecting between N(S)N(S) and SS. On the other hand, there can't exist more than |N(S)|×D|N(S)|×D edges incident to N(S)N(S). Contradiction. Thus the perfect matching exists.
Now remove the dummy edges from the perfect matching. Note that the dummy edges don't connect any node which originally had degree DD. So the edges left from the matching still covers all nodes with degree DD. Denote this matching MM. Then G∖MG∖M have maximum degree D−1D−1. Color the edges in MM with color DD, and color G∖MG∖M by inductive hypothesis.
Remark. Theorem still holds when the graph has multiple edges.

This also gives an algorithm to find a edge coloring of bipartite graphs. We just want to find any matching that covers all nodes with maximum degree DD. If all nodes have degree DD, then simple bipartite matching works. In general case, since we have to cover some nodes, we have to model this as flow and enforce some edges to have capacity 1. This can be modeled with maximum flow with demands. In any case, if you use Dinic's algorithm or Hopcroft/Karp, time complexity is O(D×MN0.5)O(D×MN0.5).

So that's great, we can optimally solve edge coloring of bipartite graph in polynomial time. On the other hand, using matching or even flow with demands is pretty tiring and complicated, can we get rid of flows at all?

There is an implementation of edge coloring in O(NM)O(NM) with short code and good constant factor, which doesn't rely on flow argument. You can read about this algorithm here (problem F). Here is the implementation I use, which is copied from waterfalls' submission here.

Faster

Can we do this faster? Let's go back to Dinic's algorithm. Here's a simple yet elegant trick to reduce the time complexity from O(D×MN0.5)O(D×MN0.5) to O(MN0.5logD)O(MN0.5log⁡D).

The idea is, as you can guess from the loglog factor, divide-and-conquer. If DD is odd, you can just use the naive method to reduce the degree to D−1D−1. If DD is even, it partitions the graph into two pieces with maximum degree D2D2. But how?

Let's revisit the following well-known fact related to even degrees:

Lemma (Euler Circuit). For connected graphs where every node has even degree, there is a circuit which visits every edges exactly once.

And also, there exists an algorithm to compute the Euler circuits in linear time.

So now, let's add some dummy edges again to connect the vertices with odd degree. Obviously, there exists even number of vertices with odd degree, so we can add any kind of matchings between them. Now, every connected component has an Euler circuit, so we can compute them for linear time.

Now, note that the Euler circuit obviously have even length in bipartite graphs, let's denote the circuit as C=e1,e2,…,e2kC=e1,e2,…,e2k. Let's color the odd-numbered edges (e1,e3,e5…e1,e3,e5…) as blue, and even-numbered edges as red. Then, for each vertex vv, we can see it is adjacent to deg(v)2deg(v)2 blue nodes and deg(v)2deg(v)2 red nodes! This is because, if the circuit entered the vertex with some color, it leaves the vertex with different colors.

Thus, just take the Euler circuit for each component, and divide the edges into blue and red sets. Remove dummy edges, and you still have max degree D2D2 for each color. So you can recurse from that point! Since every edges are considered at at most O(logD)O(log⁡D) recursion stage, and they contribute to at most N0.5N0.5 overhead, the time complexity is O(MN0.5logD)O(MN0.5log⁡D).

More faster!!

Now let's look for something genuinely fast: Fast enough that it doesn't involve any flow and have O(MlogMlogD)O(Mlog⁡Mlog⁡D) solution!

Going back to the original proof, we know that it is easier to assume that the graph is regular: Every node has degree of DD. Naive strategy of adding edges, as described in proof, adds too much edges and is inefficient. But we can improve it by "merging" small-degree nodes: If two node have sum of degree at most DD, then we can merge(contract) two nodes, and any coloring in this new graph still works in the original one.

So, for each side of biparitition, while there is at least 2 nodes with deg(v)≤D2deg(v)≤D2, find them and contract it. After the end of this procedure, we are left with at most 2 nodes (one for each bipartition) with deg(v)≤D2deg(v)≤D2. Now, even if we apply the naive way of adding dummy nodes and edges, we end up adding at most O(cM)O(cM) edges where c≤3c≤3 in my naive calculation.

Now let's assume the graph is regular. Thanks to regularity, we don't have to find flows anymore but can simply calculate a perfect matching in this regular graph. It doesn't look easy to find this faster than O(M1.5)O(M1.5), but there is an algorithm to compute a perfect matching of regular graph in O(MlogM)O(Mlog⁡M), which was discovered in 2010: https://arxiv.org/pdf/0909.3346.pdf

To briefly explain what's going on, the algorithm is actually not very different from finding a bipartite matching with flows. We construct a usual flow graph with source and sinks, and find an augmenting path. There are just two notable difference:

If the edge is in matching, rather than reversing it, we just contract two endpoint of such edge. You can see this doesn't make the situation different.
We don't use DFS to find path: We use random walk from source, which means we will sample any outgoing edges with random probability, and halt until it reaches the sink.

Intuitively, the second heuristic will run very fast in the initial stage of augmenting path finding. At the first stage the length of path will be length 3, and it will remain so until a certain period. Random walk procedure is much faster than DFS in such case, because it's time complexity is just it's walk length. Now, the paper claims the following:

Lemma 6 (from paper). If we found m<nm<n matchings in a current graph, then the expected number of steps taken by the random walk is at most 2+nn−m2+nn−m.

Then, we can simply do this for all 0≤m≤n−10≤m≤n−1, and get 2n+n∑ni=11i=O(nlogn)2n+n∑i=1n1i=O(nlog⁡n)!

Lastly, note that this problem can be solved in O(MlogM)O(Mlog⁡M) time. Interested readers can find it in the second reference link.

Challenge

The O(mn)O(mn) algorithm is very simple to implement and it usually don't find long Kempe chains. I doubt that is true for any fast algorithm I mentioned. Plus, it won't be cache oblivious. Can anyone implement any o(mn)o(mn) algorithm for bipartite edge coloring which runs faster than O(mn)O(mn) algorithm?

Practice problem

I added some practice problem for this topic. Some of them are directly related to my article, some of them checks your creativity at reducing different problems to bipartite edge coloring.

References

Also special thanks to 300iq for introducing me the paper for perfect matching.

Story about edge coloring of graph

General graph

Bipartite Graphs

Faster

More faster!!

Challenge

Practice problem

References

Recommend

"Convolution" sum of 2 increasing sequences.

It’s time to stop using Python 3.6

Set Up a Customer Loyalty Program for an Ecommerce Business | AWeber

只需要一个域用户即可拿到 DC 权限（CVE-2021-42287 and CVE-2021-42278）

電子紙的螢幕 BOOX Mira Series

Install OpenShift's Web Terminal Operator in any namespace

Why can't see the code of virtual contestant in gym

重庆警方破获惊天大案！年流水超150亿元，特大跨境网络赌博案侦破

Solution to import error of Pbds in windows

Codeforces Round #760 (Div. 3)

About Joyk