5.0

5.0 | 1 rating Rate this file 69 Downloads (last 30 days) File Size: 100.74 KB File ID: #32455
image thumbnail

Spectral / Phase-based Visual Saliency

by Boris Schauerte

 

07 Aug 2011 (Updated 15 Dec 2011)

Calculate the saliency of an image using DCT image signatures or the spectral residual.

| Watch this File

File Information
Description

This package provides implementations of several spectral (FFT,DCT)
saliency algorithms for images.

There are several methods (multichannel_method) to calculate the
multichannel saliency:
'fft': by default the same as 'fft:whitening'
'fft:whitening' Uses spectral whitening to calculate the saliency of
                each channel separately and then averages the result.
'fft:residual' Uses the spectral residual to calculate saliency of
                each channel separately and then averages the result.
'dct' Uses DCT-based image signatures to calculate saliency
                of each channel separately and then averages the
                result.
'quat:fft': Converts the image into a quaternion-based
                representation, uses quaternion FFT/IFFT operations.
'quat:dct' Converts the image into a quaternion-based
                representation, uses quaternion DCT/IDCT operations.
'quat:dct:fast' Same as 'quad:dct', but with a fixed image
                resolution of 64x48 and uses optimized .mex files for
                faster calculation.

Usage examples:
- spectral_saliency_multichannel(imread(..image path...))
  or as an example for other color spaces (e.g. ICOPP, Lab, ...)
- spectral_saliency_multichannel(rgb2icopp(imread(..image path...)))

If you use any of this work in scientific research or as part of a
larger software system, you are requested to cite the use in any
related publications or technical documentation. The work is based
upon:

    B. Schauerte, and R. Stiefelhagen, "Predicting Human Gaze using
    Quaternion DCT Image Signature Saliency and Face Detection," in IEEE
    Workshop on the Applications of Computer Vision (WACV), 2012.

Notes:
- The implementation of the quaternion-based approach requires the
  quaternion toolbox for Matlab (QTFM).
- I kept the implementations as focused and simple as possible and
  thus they lack more advanced functionality, e.g. more complex
  normalizations. However, I think that the provided functionality is
  more than sufficient for (a) people who want to get started in the
  field of visual attention (especially students), (b) practitioners
  who have heard about the spectral approach and want to try it, and
  (c) people who just need a fast, reliable, well-established visual
  saliency algorithm (with a simple interface and not too many
  parameters) for their applications.
- GBVS and Itti require the original GBVS Matlab implementation by
  J. Harel (see http://www.klab.caltech.edu/~harel/share/gbvs.php)

For more details on the method see:
[1] X. Hou and L. Zhang, "Saliency Detection: A Spectral Residual
    Approach", in CVPR, 2007.
    (original paper)
[2] C. Guo, Q. Ma, and L. Zhang, "Spatio-temporal saliency detection
    using phase spectrum of quaternion fourier transform," in CVPR,
    2008.
    (extension to quaternions; importance of the residual)
[3] X. Hou, J. Harel, and C. Koch, "Image Signature: Highlighting
    sparse salient regions," in PAMI, 2011.
    (uses DCT-based "image signatures")
[4] B. Schauerte, and R. Stiefelhagen, "Predicting Human Gaze using
    Quaternion DCT Image Signature Saliency and Face Detection," in IEEE
    Workshop on the Applications of Computer Vision (WACV), 2012.
    (extension to quaternions; spectral saliency and face detection;
     evaluation of spectral saliency approaches on eye-tracking data;
     achieved the currently best reported results on the CERF/FIFA
     eye-tracking data set and Toronto/Bruce-Tsotsos data set)

It has been applied quite a lot during the last years, e.g., see:
[5] B. Schauerte, B. Kuehn, K. Kroschel, R. Stiefelhagen, "Multimodal
    Saliency-based Attention for Object-based Scene Analysis," in
    IROS, 2011.
    ("simple" multi-channel and quaternion-based; Isophote-based
     saliency map segmentation)
[6] B. Schauerte, J. Richarz, G. A. Fink,"Saliency-based
    Identification and Recognition of Pointed-at Objects," in IROS,
    2010.
    (uses multi-channel on intensity, blue-yellow/red-green opponent)
[7] B. Schauerte, G. A. Fink, "Focusing Computational Visual
    Attention in Multi-Modal Human-Robot Interaction," in Proc. ICMI,
    2010
    (extended to a multi-scale and neuron-based approach that allows
     to incorporate information about the visual search target
     to facilitate joint attention in spoken human-robot interaction)

However, the underlying principle has been addressed long before:
[9] A. Oppenheim and J. Lim, "The importance of phase in signals,"
    in Proc. IEEE, vol. 69, pp. 529-541, 1981.

MATLAB release MATLAB 7.10 (2010a)
Other requirements - Quaternion Toolbox for Matlab (optional) - GCC (Linux) or MSVC (Windows) (optional)
Tags for This File  
Everyone's Tags
Tags I've Applied
Add New Tags Please login to tag files.
Comments and Ratings (1)
19 Jan 2012 Brahim HAMADICHAREF  
Please login to add a comment or rating.
Updates
10 Aug 2011

- added the quaternion-based multichannel saliency calculation

17 Aug 2011

- temporarily removed the Quaternion support (will be added again, soon)

02 Nov 2011

added the quaternion support again; added quaternion DCT signatures and saliency based on DCT image signatures; added another ICOPP conversion; added a visualization for quaternion images; added a README.txt

02 Nov 2011

fix

02 Nov 2011

- fix

02 Nov 2011

- fix

03 Nov 2011

- changed the default parameters

03 Nov 2011

- added an efficient C/C++ implementation of the QDCT image signatures
- added an example.m file for a simple example (including an example image)

09 Dec 2011

- updated the documentation

13 Dec 2011

- fixed an if-statement

15 Dec 2011

- C/C++ implementation now compiles without problems under Windows (MS Visual Studio Compiler / Windows SDK 7.1)

Tag Activity for this File
Tag Applied By Date/Time
image processing Boris Schauerte 08 Aug 2011 12:28:36
saliency Boris Schauerte 08 Aug 2011 12:28:36
attention Boris Schauerte 08 Aug 2011 12:28:36
spectral residual Boris Schauerte 08 Aug 2011 12:28:36
spetral whitening Boris Schauerte 08 Aug 2011 12:28:36
phasebased saliency Boris Schauerte 08 Aug 2011 12:28:36
object detection Boris Schauerte 08 Aug 2011 12:28:36
quaternion Boris Schauerte 10 Aug 2011 12:55:38
vision Boris Schauerte 10 Aug 2011 13:57:14
visual saliency Boris Schauerte 10 Aug 2011 13:57:14
visual attention Boris Schauerte 10 Aug 2011 13:57:14
dct image signatures Boris Schauerte 02 Nov 2011 08:57:46
image signatures Boris Schauerte 02 Nov 2011 08:57:46

Contact us at files@mathworks.com