Sparse Reconstruction with Compressed Sensing

From Cornell University Computational Optimization Open Textbook - Optimization Wiki
Jump to navigation Jump to search

Author: Ngoc Ly (SysEn 5800 Fall 2021)

Introduction

sub module goal

The goal of compressed sensing is to solve the underdetermined linear system in which the number of variables is much greater than the number of observations, resulting in an infinite number of signal coefficient vectors for the same set of compressive measurements . The objective is to reconstruct a vector in a given of measurements and a sensing matrix A. Instead of taking a large number of high-resolution measurements and discarding the majority of them, consider taking far fewer random measurements and reconstructing the original with high probability from its sparse representation.

sub modual

Begin with a linear equation , where is a sensing matrix that must be obtained and will result in either exact or approximated optimum solution depending on how it is chosen, is a signal vector with at most -sparse entries, which means has non-zero entries, be an index set, is a compressed measurement vector, , is a noise vector and assumed to be bounded if it exists, and .


sub module sparsity

A vector is said to be -sparse in if it has at most nonzero coefficients. The support of is , and is a -sparse signal when the cardinality . The set of -sparse vectors is denoted by . Consequently, there are different subsets of -sparse vectors. If a random -sparse is drawn uniformly from , its entropy is approximately equivalent to bits are required for compression of ~cite(Measurements vs Bits).


The idea is to search for the sparsest from the measurement vector and a sensing matrix with . If the number of linear measurements is at least twice as its sparsity , i.e., , then there exists at most one signal that satisfies the constraint and produce the correct result for any [coluccia2015 book7]. Hence, the reconstruction problem can be formulated as an "norm" program.

This optimization problem minimizes the number of nonzero entries of subject to the constraint , that is to find the sparsest element in the affine space [2019 book33]. It turns out to be a combinatorial optimization problem, which is NP-Hard because it includes all possible sets of -sparse out of . Furthermore, if noise is present, the recovery is unstable [Buraniuk "compressed sensing"].

Restricted Isometry Property (RIP)

A matrix is said to satisfy the RIP of order if for all has a . A restricted isometry constant (RIC) of is the smallest satisfying this condition [2019 book38, coluccia2015 book7].

Under projections through matrix , the restricted isometry property allows -sparse vectors to have unique measurement vectors . If meets RIP, then does not send two distinct -sparse to the same measurement vector , indicating that is a unique solution under RIP.


If the matrix satisfies the RIP condition of order and the constant , there are two distinct -sparse vectors in . When they are equal, the restricted isometry property holds. If is a -order RIP matrix, it means that no two -sparse vectors are mapped to the same measurement vector by . In other words, when working with sparse vectors, the RIP ensures that the columns of are nearly orthonormal. Furthermore, is an approximately norm-preserving function, which means that it preserves its distance when mapping for -sparse signals for all or more as approaches zero. [Candes, Romberg, Tao[4]] demonstrate that if is -sparse, and satisfies the RIP of order with RIP-constant , then gives a unique sparse solution. The convex optimization problem is the same as the solution to the program and can solve by the Linear Program [2019 book38, coluccia2015 book7]. Hence, the reconstruction problem is as followed which can be solved by basis pursuit.


Theory

Two main things need to be considered when recovering

  • (1) The design of the sensing matrix
  • (2) The recovery algorithm

Sensing Matrix

Although the sensing matrix satisfies RIP of order in some situations, confirming a given matrix meets RIP's criteria is NP-hard in general. As a result, designing an efficient sensing matrix is critical. These sensing matrices are responsible for signal compression at the encoder end and accurate or approximate reconstruction at the decoder end. For signal compression, different sensing matrices are utilized in compressed sensing. There are random, deterministic, structural, and optimized sensing matrices are used in compressed sensing [2020_book].

  • Random Sensing Matrices

Some classes of random matrices satisfy RIP, specifically those matrices with independent and identically distributed (i.i.d.) entries drawn from a sub-Gaussian distribution. It requires the number of measurements, , to recover with high probability. Other popular random sensing matrices are Gaussian, Bernoulli, or Rademacher distributions. However, those random dense matrices incur tremendous computational and memory costs, making them unsuitable for large-scale applications. Several researchers have turned to sparse measurement matrices such as binary or bi-adjacency matrices rather than random matrices to address this issue. Nonetheless, those sparse matrices are unstructured in the same way that acyclic networks or trees are. Fortunately, the new random Weibull matrices [9] are built with appropriate observations and provide exact sparse signal reconstruction with a greater probability [2020_book].

  • Deterministic Sensing Matrices

Although deterministic sensing matrices are insufficient to satisfy the RIP condition, they can be used to provide instances for novel concepts. The Vandermond matrices are one of the best deterministic matrices for recovering the -sparse signal; however, the reconstruction technique gets unstable as rises. Despite the lack of RIP support in the deterministic sensing matrices, it successfully recovered the original sparse signal for the chirp function-based employing complex-valued deterministic matrices. According to the researchers Amini and Marvasti [13], binary, bipolar, and ternary matrices are deterministic constructions of sensing matrices that satisfy the RIP of order . Because of the cyclic feature, the reconstruction process can be sped up using a fast Fourier transform (FFT) technique. Another type of deterministic CS sensing matrix is one based on mutual coherence. The finite geometry-based sparse binary matrices were constructed with low coherence [18,19]. The sparseness property of matrices aids in reducing storage requirements and improving the reconstruction process [2020_book].

  • Structural Sensing Matrices

Sensing technologies require structured measurement matrices to perform a variety of tasks. Those matrices can be easily created with a small number of parameters. Furthermore, structured matrices can be used to speed the recovery performance of algorithms, making these matrices suitable for big data challenges. Many researchers created Toeplitz and circulant random sensing matrices used in multipath sparse channel estimation and network systems. In terms of estimated accuracy, signal reconstruction speed, and coherence, these matrices perform similar to i.i.d. Gaussian satisfies RIP with high probability. The new sparse block circulant matrix (SBCM) structure greatly reduces computational complexity. Other structural sensing matrices include the Hadamard matrix, which provides near-optimal assurances of recovery while requiring less complexity and so allowing for simple hardware implementation [2020_book].

  • Optimized Sensing Matrices

In order to accomplish high-quality signal reconstruction, the sensing matrices must satisfy RIP. Regardless of what might be expected, the RIP is difficult to verify. Another method for validating the RIP is to compute the mutual coherence between the sensing matrix and the sparse matrix or figure the Gram matrix as . The objective is to improve the sensing matrix to lower coherence using numerous strategies such as the random-detecting framework-based improvement strategy using the shrinkage technique and the irregular estimation of sensing matrix improvement employing a symmetrical strategy. Another technique is to optimize the sensing matrix using a block-sparsified dictionary approach, decreasing the total inter-block and sub-block coherence of the dictionary matrix and thereby significantly increasing the reconstruction [2020_book]


Mutual Coherence

The recovery algorithm often refers to the measurement of quantities that are appropriate to the measurement matrix (sensing matrix), i.e., the coherence. In general, the performance of the recovery algorithm gets better if the coherence is getting smaller. It means the columns of the matrices that have medium size are well-conditioned (governed).


Let and assume -normalized columns of , the mutual coherence is defined by

[1] , where is the th column of


The feasibility of attaining the lower bounds for the coherence of a matrix , with -normalized columns, and the columns of the matrix are equiangular tight frames defined as


In compressed sensing, a small coherence and a sensing matrix , where is much larger than , are the major important requirements.

Let a matrix with -normalized columns and let . Then for all -sparse vectors

Algorithms

Three big groups of algorithms are:[2]

  • Optimization methods: includes minimization i.e. Basis Pursuit, and quadratically constraint

minimization i.e. basis pursuit denoising.

  • Greedy methods: include orthogonal matching pursuit (OMP) and compressive sampling matching pursuit (CoSaMP).
  • Thresholding-based methods: such as iterative hard thresholding (IHT) and iterative soft thresholding (IST), approximate IHT or AM-IHT, and many more.

More cutting-edge methods include dynamic programming.

We will cover one, i.e. IHT. WHY IHT THEN? Basis pursuit, matching pursuit type algorithms belong to a more general class of iterative thresholding algorithms. [3] So IHT seems like the ideal place to start. If everything compliment with RIP, then IHT has fast convergence.

Algorithm IHT

The convex program mentioned in introduction has an equivalent nonconstraint optimization program.

(cite IT for sparse approximations)  ??? [1]. In statistics we call the regularization LASSO with as the regularization parameter. This is the closest convex relaxation to the first program mentioned in the introduction.[The Benefit of Group Sparsity]

Then

sub modual

Define the threashholding operators as: selects the best-k term approximation for some k.

Stopping criterion is [4]

  • Input and
  • output , an -sparse solution to
  • Set
  • While stopping criterion false do
    • end while
  • return:

is an Adjoint matrix i.e. the transpose of it's cofactor.

Numerical Example

sub module iteration 1

calculate

Thresholding

Calculate the Error

Calculate the new y

Check if error is less then

sub module iteration 2

calculate

Thresholding

Calculate the Error

Calculate the new y

Check if error is less then

Stopping condition meat

Applications and Motivations

Low-Rank Matrices

The Netflix Prize was accompanied by low-rank matrix recovery or the matrix completion problem.  The approach then fills in the missing values in the user's ratings for movies that the user hasn't seen. These estimates are based on ratings from other users, who have similar ratings if a matrix is created with all the users as rows and the movie titles as columns. Because some users' interests will be similar and therefore overlap, it is possible to reduce the degrees of freedom significantly. This low-rank structure is frequently assumed for the problem domain of collaborative filtering ~cited citations.

Dictionary Learning

The goal in dictionary learning is to infer the original dictionary as possible. Instead of using a predefined dictionary, researchers have found that learning the dictionary by obtaining "dynamic features" from training data often yields representation. Biometric features can be taken from video clips of each subject in a dataset and used to populate the dictionary's columns. Using random projections and sparse representations for iris detection for noncontact biometrics-based authentication systems from video samples has been proposed ~cited citations.

Single-pixel cameras

Single-pixel cameras or single detector imaging are used in situations when detectors are either prohibitively expensive or difficult to miniaturize. A microarray is made up of a large number of miniature mirrors that can be individually turned on and off. The mechanism behind the random sampling, which results in low coherence between measurements, is the most important component of the single-pixel camera. This microarray reflects the light from the scene, and a lens combines all of the reflected beams into one sensor, which is the single detector of the camera used to capture the image ~cited citations.

Conclusion

The theory was a buildup from what is an inverse problem and sparsity. There it develops into the l0 norm and then concludes l1 norm which are sufficient conditions for basis pursuit. 

Although the sensing matrix fulfills RIP of order k in some cases, establishing that a given matrix satisfies RIP's conditions is NP-hard in general.  In many cases verifying the sensing matrix isn't a reasonable task. So designing the sensing matrix is crucial. It needs to satisfy RIP and be well suited for the problem domain. As well as needs to be computational less of a burden. This takes some creativity and some understanding of the problem domain. It may be concluded the sensing matrix is the most important component and is deffenetaly the most philosophical.

Referencse

[5] [6] [7] [8] [9] [10]


[11]

Cite error: Closing </ref> missing for <ref> tag

Cite error: Closing </ref> missing for <ref> tag

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]


  1. 1.0 1.1 Cite error: Invalid <ref> tag; no text was provided for refs named :1
  2. Cite error: Invalid <ref> tag; no text was provided for refs named :0
  3. Cite error: Invalid <ref> tag; no text was provided for refs named :4
  4. Cite error: Invalid <ref> tag; no text was provided for refs named :2
  5. D. L. Donoho, “Compressed sensing,” vol. 52, pp. 1289–1306, 2006, doi: 10.1109/tit.2006.871582.
  6. E. J. Candès and T. Tao, “Decoding by linear programming,” IEEE Trans. Inf. Theory, vol. 51, no. 12, Art. no. 12, 2005, doi: 10.1109/TIT.2005.858979.
  7. D. L. Donoho, “Compressed sensing,” vol. 52, pp. 1289–1306, 2006, doi: 10.1109/tit.2006.871582.
  8. T. Blumensath and M. E. Davies, “Iterative Hard Thresholding for Compressed Sensing,” May 2008.
  9. S. Foucart and H. Rauhut, A mathematical introduction to compressive sensing. New York [u.a.]: Birkhäuser, 2013.
  10. R. G. Baraniuk, “Compressive Sensing [Lecture Notes],” IEEE Signal Processing Magazine, vol. 24, no. 4, Art. no. 4, 2007, doi: 10.1109/MSP.2007.4286571.
  11. 2. Emmanuel J. Candès and Terence Tao. Decoding by linear programming. IEEE
  12. 12. E. J. Candes, J. Romberg, and T. Tao. Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. 52:489–509, 2006.
  13. 17. Mohammed Rostami. Compressed sensing with side information on feasible re- gion, 2013.
  14. 23. Laska Jason Noah. Rice university regime change: Sampling rate vs. bit-depth in compressive sensing, 2011.
  15. 33. Richard G. Baraniuk. Compressive sensing [lecture notes]. IEEE Signal Processing Magazine, 24(4):118–121, 2007.
  16. 30. Richard Baraniuk, Mark Davenport, Ronald DeVore, and Michael Wakin. A simple proof of the restricted isometry property for random matrices. 28:253–263, 2008.
  17. 40. Emmanuel Candes, Justin Romberg, and Terence Tao. Stable signal recovery from incomplete and inaccurate measurements. March 2005.
  18. 1348. Niklas Koep, Arash Behboodi, and Rudolf Mathar. An introduction to compressed sensing, 2019.
  19. 49. Martin Burger, Janic Föcke, Lukas Nickel, Peter Jung, and Sven Augustin. Recon- struction methods in thz single-pixel imaging, 2019.
  20. 50. J. K. Pillai, V. M. Patel, R. Chellappa, and N. K. Ratha. Secure and robust iris recognition using random projections and sparse representations. 33:1877–1893, 2011.