# Difference between revisions of "Set covering problem"

Authors: Sherry Liang, Khalid Alanazi, Kumail Al Hamoud

## Introduction

The set covering problem is a significant NP-hard problem in combinatorial optimization. In the set covering problem, two sets are given: a set U of elements and a set S of subsets of the set U. Each subset in S is associated with a predetermined cost, and the union of all the subsets covers the set U. This combinatorial problem then concerns finding the optimal number of subsets whose union covers the universal set while minimizing the total cost.1 The problem has many applications in the airline industry, and it was explored on an industrial scale as early as the 1970s.2

## Methodology

The mathematical formulation of the set covering problem is define as follows. We define U = {u1, ….. um} as the universe of elements and S = {s1, ….sn} as a collection of subsets such that si ${\displaystyle \in }$U and the union of si cover U (i.e. ${\displaystyle \cup }$si = U ). Addionally, each set si must cover at least one element of U and has associated cost c that is larger than zero (i.e. ci > 0). The objective is to find the minimum cost sub-collection of sets X ${\displaystyle \in }$ S that covers all the elements in the universe U.

An integer linear program (ILP) model can be formulated for the minimum set covering problem as follows:

Decision variables

${\displaystyle y_{i}={\begin{cases}1,&{\text{if subset }}i{\text{ is selected}}\\0,&{\text{otherwise }}\end{cases}}}$

Objective function

minimize ${\displaystyle \sum _{i=1}^{n}c_{i}y_{i}}$

Constraints

${\displaystyle \sum _{i=1}^{n}a_{i}jy_{i}>=1,\forall j=1,....,m}$

${\displaystyle y_{i}\in \{0,1\},\forall i=1,....,n}$

The objective function ${\displaystyle \sum _{i=1}^{n}c_{i}y_{i}}$ is defined to minimize the number of subset si that cover all elements in the universe by minimizing its total cost. The first constraint implies that every element j in the universe U must be be covered where aij =1 if j ${\displaystyle \in }$ si and 0 otherwise. The second constraint ${\displaystyle y_{i}\in \{0,1\}}$ indicates that the decision variables are binary.

## Applications

The applications of the set covering problem span a wide range of applications, but its usefulness is evident in industrial and governmental planning. Variations of the set covering problem that are of practical significance include the following.

The optimal location problem
This set covering problems is concerned with maximizing the coverage of some public facility placed at different locations.3 Consider the problem of placing fire stations to serve the towns of some city.4 If each fire station can serve its town and all adjacent towns, we can formulate a set covering problem where each subset consists of a set of adjacent towns. The problem is then solved to minimize the required number of fire stations to serve the whole city.
The optimal route selection problem
Consider the problem of selecting the optimal bus routes to place pothole detectors. Due to scarcity of the physical sensors, the problem does not allow for placing a detector at every road. The task of finding the maximum coverage using a limited number of detectors could be formulated as a set covering problem.5 Specifically, giving a collection of routes U, where each route itself is divided into segments. The segments of two routes can overlap. The goal is then to select the routes that maximize number of covered segments.
The airline crew scheduling problem
An important application of large-scale set covering is the airline crew scheduling problem, which pertains to assigning airline staff to work shifts.2,6 Thinking of the collection of flights as a universal set to be covered, we can formulate a set covering problem to search for the optimal assignment of employees to flights.

## Conclusion

The set covering problem, which aims to find the least number of subsets that cover some universal set, is a widely known NP-hard combinatorial problem. Due to its applicability to route planning and airline crew scheduling, several methods have been proposed to solve it. Its straightforward formulation allows for the use of off-the-shelf optimizers to solve it. Moreover, heuristic techniques and greedy algorithms can be used to solve large-scale set covering problems for industrial applications.

## References

1. Grossman, T., & Wool, A. (1997). Computational experience with approximation algorithms for the set covering problem. European Journal of Operational Research, 101(1), 81-92. doi:10.1016/s0377-2217(96)00161-0
2. RUBIN, J. (1973). A Technique for the Solution of Massive Set Covering Problems, with Application to Airline Crew Scheduling. Transportation Science, 7(1), 34-48. Retrieved November 23, 2020, from http://www.jstor.org/stable/25767684
3. Church, R., ReVelle, C. The maximal covering location problem. Papers of the Regional Science Association 32, 101–118 (1974). https://doi.org/10.1007/BF01942293
4. Aktaş, E., Özaydın, Ö, Bozkaya, B., Ülengin, F., & Önsel, Ş. (2013). Optimizing Fire Station Locations for the Istanbul Metropolitan Municipality. Interfaces, 43(3), 240-255. doi:10.1287/inte.1120.0671
5. Ali, J., & Dyo, V. (2017). Coverage and Mobile Sensor Placement for Vehicles on Predetermined Routes: A Greedy Heuristic Approach. Proceedings of the 14th International Joint Conference on E-Business and Telecommunications. doi:10.5220/0006469800830088
6. Marchiori, E., & Steenbeek, A. (2000). An Evolutionary Algorithm for Large Scale Set Covering Problems with Application to Airline Crew Scheduling. EvoWorkshops.