Noise reduction

Noise reduction is the process of removing noise from a signal. Noise reduction techniques exist for audio and images. Noise reduction algorithms may distort the signal to some degree.

Performer Peter Alsop and a child demonstrate different ways of dealing with noise. Sound engineers employ a diverse range of strategies to combat noise in both audible and visual environments.

All signal processing devices, both analog and digital, have traits that make them susceptible to noise. Noise can be random or white noise with an even frequency distribution, or frequency-dependent noise introduced by a device's mechanism or signal processing algorithms.

In electronic recording devices, a major type of noise is hiss created by random electron motion due to thermal agitation at all temperatures above absolute zero. These agitated electrons rapidly add and subtract from the voltage of the output signal and thus create detectable noise.

In the case of photographic film and magnetic tape, noise (both visible and audible) is introduced due to the grain structure of the medium. In photographic film, the size of the grains in the film determines the film's sensitivity, more sensitive film having larger sized grains. In magnetic tape, the larger the grains of the magnetic particles (usually ferric oxide or magnetite), the more prone the medium is to noise. To compensate for this, larger areas of film or magnetic tape may be used to lower the noise to an acceptable level.

In general

Noise reduction algorithms tend to alter signals to a greater or lesser degree. The local signal-and-noise orthogonalization algorithm can be used to avoid changes to the signals.[1]

In seismic exploration

Boosting signals in seismic data is especially crucial for seismic imaging,[2][3] inversion,[4][5] and interpretation,[6] thereby greatly improving the success rate in oil & gas exploration.[7][8][9][10] The useful signal that is smeared in the ambient random noise is often neglected and thus may cause fake discontinuity of seismic events and artifacts in the final migrated image. Enhancing the useful signal while preserving edge properties of the seismic profiles by attenuating random noise can help reduce interpretation difficulties and misleading risks for oil and gas detection.

In audio

When using analog tape recording technology, they may exhibit a type of noise known as tape hiss. This is related to the particle size and texture used in the magnetic emulsion that is sprayed on the recording media, and also to the relative tape velocity across the tape heads.

Four types of noise reduction exist: single-ended pre-recording, single-ended hiss reduction, single-ended surface noise reduction, and codec or dual-ended systems. Single-ended pre-recording systems (such as Dolby HX and HX Pro, or Tandberg's Actilinear and Dyneq[11][12][13][14]) work to affect the recording medium at the time of recording. Single-ended hiss reduction systems (such as DNL[15] or DNR) work to reduce noise as it occurs, including both before and after the recording process as well as for live broadcast applications. Single-ended surface noise reduction (such as CEDAR and the earlier SAE 5000A and Burwen TNE 7000) is applied to the playback of phonograph records to attenuate the sound of scratches, pops, and surface non-linearities. Dual-ended systems have a pre-emphasis process applied during recording and then a de-emphasis process applied at playback.

Compander-based noise reduction systems

Dual-ended compander noise reduction systems include the professional systems Dolby A[15] and Dolby SR by Dolby Laboratories, dbx Professional and dbx Type I by dbx, Donald Aldous' EMT NoiseBX,[16] Burwen Laboratories' Model 2000[17][18][19] and Telefunken's telcom c4[15] as well as the consumer systems Dolby NR, Dolby B,[15] Dolby C and Dolby S, dbx Type II,[15] Telefunken's High Com[15] and Nakamichi's High-Com II, Toshiba's (Aurex AD-4) adres,[15] JVC's ANRS[15] and Super ANRS,[15] Fisher/Sanyo's Super D,[15] and the Hungarian/East-German Ex-Ko system.[20] These systems have a pre-emphasis process applied during recording and then a de-emphasis process applied at playback.

The first widely used audio noise reduction technique was developed by Ray Dolby in 1966. Intended for professional use, Dolby Type A was an encode/decode system in which the amplitude of frequencies in four bands was increased during recording (encoding), then decreased proportionately during playback (decoding). The Dolby B system (developed in conjunction with Henry Kloss) was a single band system designed for consumer products. In particular, when recording quiet parts of an audio signal, the frequencies above 1 kHz would be boosted. This had the effect of increasing the signal to noise ratio on tape up to 10 dB depending on the initial signal volume. When it was played back, the decoder reversed the process, in effect reducing the noise level by up to 10 dB. The Dolby B system, while not as effective as Dolby A, had the advantage of remaining listenable on playback systems without a decoder.

The Telefunken High Com integrated circuit U401BR could be utilized to work as a mostly Dolby B–compatible compander as well.[21] In various late-generation HighCom tape decks the Dolby-B emulating "D NR Expander" functionality worked not only for playback, but undocumentedly also during recording.

dbx was a competing analog noise reduction system developed by David E. Blackmer, founder of dbx laboratories.[22] It used a root-mean-squared (RMS) encode/decode algorithm with the noise-prone high frequencies boosted, and the entire signal fed through a 2:1 compander. dbx operated across the entire audible bandwidth and unlike Dolby B was unusable as an open ended system. However it could achieve up to 30 dB of noise reduction.

Since analog video recordings use frequency modulation for the luminance part (composite video signal in direct colour systems), which keeps the tape at saturation level, audio style noise reduction is unnecessary.

Dynamic noise limiter and dynamic noise reduction

Dynamic noise limiter (DNL) is an audio noise reduction system originally introduced by Philips in 1971 for use on cassette decks.[15] Its circuitry is also based on a single chip.[23][24]

It was further developed into dynamic noise reduction (DNR) by National Semiconductor to reduce noise levels on long-distance telephony.[25] First sold in 1981, DNR is frequently confused with the far more common Dolby noise reduction system.[26] However, unlike Dolby and dbx Type I & Type II noise reduction systems, DNL and DNR are playback-only signal processing systems that do not require the source material to first be encoded, and they can be used together with other forms of noise reduction.[27]

Because DNL and DNR are non-complementary, meaning they do not require encoded source material, they can be used to remove background noise from any audio signal, including magnetic tape recordings and FM radio broadcasts, reducing noise by as much as 10 dB.[28] They can be used in conjunction with other noise reduction systems, provided that they are used prior to applying DNR to prevent DNR from causing the other noise reduction system to mistrack.

One of DNR's first widespread applications was in the GM Delco car stereo systems in U.S. GM cars introduced in 1984.[29] It was also used in factory car stereos in Jeep vehicles in the 1980s, such as the Cherokee XJ. Today, DNR, DNL, and similar systems are most commonly encountered as a noise reduction system in microphone systems.[30]

Other approaches

A second class of algorithms work in the time-frequency domain using some linear or non-linear filters that have local characteristics and are often called time-frequency filters.[31] Noise can therefore be also removed by use of spectral editing tools, which work in this time-frequency domain, allowing local modifications without affecting nearby signal energy. This can be done manually by using the mouse with a pen that has a defined time-frequency shape. This is done much like in a paint program drawing pictures. Another way is to define a dynamic threshold for filtering noise, that is derived from the local signal, again with respect to a local time-frequency region. Everything below the threshold will be filtered, everything above the threshold, like partials of a voice or "wanted noise", will be untouched. The region is typically defined by the location of the signal Instantaneous Frequency,[32] as most of the signal energy to be preserved is concentrated about it.

Modern digital sound (and picture) recordings no longer need to worry about tape hiss so analog style noise reduction systems are not necessary. However, an interesting twist is that dither systems actually add noise to a signal to improve its quality.

Software programs

Most general purpose voice editing software will have one or more noise reduction functions (Audacity, WavePad, etc.). Notable special purpose noise reduction software programs include Gnome Wave Cleaner.

In images

Noise reduction by correlation

Images taken with both digital cameras and conventional film cameras will pick up noise from a variety of sources. Further use of these images will often require that the noise be (partially) removed – for aesthetic purposes as in artistic work or marketing, or for practical purposes such as computer vision.

Types

In salt and pepper noise (sparse light and dark disturbances), pixels in the image are very different in color or intensity from their surrounding pixels; the defining characteristic is that the value of a noisy pixel bears no relation to the color of surrounding pixels. Generally this type of noise will only affect a small number of image pixels. When viewed, the image contains dark and white dots, hence the term salt and pepper noise. Typical sources include flecks of dust inside the camera and overheated or faulty CCD elements.

In Gaussian noise, each pixel in the image will be changed from its original value by a (usually) small amount. A histogram, a plot of the amount of distortion of a pixel value against the frequency with which it occurs, shows a normal distribution of noise. While other distributions are possible, the Gaussian (normal) distribution is usually a good model, due to the central limit theorem that says that the sum of different noises tends to approach a Gaussian distribution.

In either case, the noise at different pixels can be either correlated or uncorrelated; in many cases, noise values at different pixels are modeled as being independent and identically distributed, and hence uncorrelated.

Tradeoffs

There are many noise reduction algorithms in image processing.[33] In selecting a noise reduction algorithm, one must weigh several factors:

  • the available computer power and time available: a digital camera must apply noise reduction in a fraction of a second using a tiny onboard CPU, while a desktop computer has much more power and time
  • whether sacrificing some real detail is acceptable if it allows more noise to be removed (how aggressively to decide whether variations in the image are noise or not)
  • the characteristics of the noise and the detail in the image, to better make those decisions

Chroma and luminance noise separation

In real-world photographs, the highest spatial-frequency detail consists mostly of variations in brightness ("luminance detail") rather than variations in hue ("chroma detail"). Since any noise reduction algorithm should attempt to remove noise without sacrificing real detail from the scene photographed, one risks a greater loss of detail from luminance noise reduction than chroma noise reduction simply because most scenes have little high frequency chroma detail to begin with. In addition, most people find chroma noise in images more objectionable than luminance noise; the colored blobs are considered "digital-looking" and unnatural, compared to the grainy appearance of luminance noise that some compare to film grain. For these two reasons, most photographic noise reduction algorithms split the image detail into chroma and luminance components and apply more noise reduction to the former.

Most dedicated noise-reduction computer software allows the user to control chroma and luminance noise reduction separately.

Linear smoothing filters

One method to remove noise is by convolving the original image with a mask that represents a low-pass filter or smoothing operation. For example, the Gaussian mask comprises elements determined by a Gaussian function. This convolution brings the value of each pixel into closer harmony with the values of its neighbors. In general, a smoothing filter sets each pixel to the average value, or a weighted average, of itself and its nearby neighbors; the Gaussian filter is just one possible set of weights.

Smoothing filters tend to blur an image, because pixel intensity values that are significantly higher or lower than the surrounding neighborhood would "smear" across the area. Because of this blurring, linear filters are seldom used in practice for noise reduction; they are, however, often used as the basis for nonlinear noise reduction filters.

Anisotropic diffusion

Another method for removing noise is to evolve the image under a smoothing partial differential equation similar to the heat equation, which is called anisotropic diffusion. With a spatially constant diffusion coefficient, this is equivalent to the heat equation or linear Gaussian filtering, but with a diffusion coefficient designed to detect edges, the noise can be removed without blurring the edges of the image.

Non-local means

Another approach for removing noise is based on non-local averaging of all the pixels in an image. In particular, the amount of weighting for a pixel is based on the degree of similarity between a small patch centered on that pixel and the small patch centered on the pixel being de-noised.

Nonlinear filters

A median filter is an example of a non-linear filter and, if properly designed, is very good at preserving image detail. To run a median filter:

  1. consider each pixel in the image
  2. sort the neighbouring pixels into order based upon their intensities
  3. replace the original value of the pixel with the median value from the list

A median filter is a rank-selection (RS) filter, a particularly harsh member of the family of rank-conditioned rank-selection (RCRS) filters;[34] a much milder member of that family, for example one that selects the closest of the neighboring values when a pixel's value is external in its neighborhood, and leaves it unchanged otherwise, is sometimes preferred, especially in photographic applications.

Median and other RCRS filters are good at removing salt and pepper noise from an image, and also cause relatively little blurring of edges, and hence are often used in computer vision applications.

Wavelet transform

The main aim of an image denoising algorithm is to achieve both noise reduction and feature preservation. In this context, wavelet-based methods are of particular interest. In the wavelet domain, the noise is uniformly spread throughout coefficients while most of the image information is concentrated in a few large ones.[35] Therefore, the first wavelet-based denoising methods were based on thresholding of detail subbands coefficients.[36] However, most of the wavelet thresholding methods suffer from the drawback that the chosen threshold may not match the specific distribution of signal and noise components at different scales and orientations.

To address these disadvantages, non-linear estimators based on Bayesian theory have been developed. In the Bayesian framework, it has been recognized that a successful denoising algorithm can achieve both noise reduction and feature preservation if it employs an accurate statistical description of the signal and noise components.[35]

Statistical methods

Statistical methods for image denoising exist as well, though they are infrequently used as they are computationally demanding. For Gaussian noise, one can model the pixels in a greyscale image as auto-normally distributed, where each pixel's "true" greyscale value is normally distributed with mean equal to the average greyscale value of its neighboring pixels and a given variance.

Let denote the pixels adjacent to the th pixel. Then the conditional distribution of the greyscale intensity (on a scale) at the th node is:

for a chosen parameter and variance . One method of denoising that uses the auto-normal model uses the image data as a Bayesian prior and the auto-normal density as a likelihood function, with the resulting posterior distribution offering a mean or mode as a denoised image.[37] [38]

Block-matching algorithms

A block-matching algorithm can be applied to group similar image fragments into overlapping macroblocks of identical size, stacks of similar macroblocks are then filtered together in the transform domain and each image fragment is finally restored to its original location using a weighted average of the overlapping pixels.[39]

Random field

Shrinkage fields is a random field-based machine learning technique that brings performance comparable to that of Block-matching and 3D filtering yet requires much lower computational overhead (such that it could be performed directly within embedded systems).[40]

Deep learning

Various deep learning approaches have been proposed to solve noise reduction and such image restoration tasks. Deep Image Prior is one such technique which makes use of convolutional neural network and is distinct in that it requires no prior training data.[41]

Software

Most general purpose image and photo editing software will have one or more noise reduction functions (median, blur, despeckle, etc.).

See also

General noise issues

Audio

Images and video

Similar problems

References

  1. Chen, Yangkang; Fomel, Sergey (November–December 2015). "Random noise attenuation using local signal-and-noise orthogonalization". Geophysics. 80 (6): WD1–WD9. Bibcode:2015Geop...80D...1C. doi:10.1190/GEO2014-0227.1. S2CID 120440599.
  2. Xue, Zhiguang; Chen, Yangkang; Fomel, Sergey; Sun, Junzhe (2016). "Seismic imaging of incomplete data and simultaneous-source data using least-squares reverse time migration with shaping regularization". Geophysics. 81 (1): S11–S20. Bibcode:2016Geop...81S..11X. doi:10.1190/geo2014-0524.1.
  3. Chen, Yangkang; Yuan, Jiang; Zu, Shaohuan; Qu, Shan; Gan, Shuwei (2015). "Seismic imaging of simultaneous-source data using constrained least-squares reverse time migration". Journal of Applied Geophysics. 114: 32–35. Bibcode:2015JAG...114...32C. doi:10.1016/j.jappgeo.2015.01.004.
  4. Chen, Yangkang; Chen, Hanming; Xiang, Kui; Chen, Xiaohong (2017). "Geological structure guided well log interpolation for high-fidelity full waveform inversion". Geophysical Journal International. 209 (1): 21–31. Bibcode:2016GeoJI.207.1313C. doi:10.1093/gji/ggw343.
  5. Gan, Shuwei; Wang, Shoudong; Chen, Yangkang; Qu, Shan; Zu, Shaohuan (2016). "Velocity analysis of simultaneous-source data using high-resolution semblance—coping with the strong noise". Geophysical Journal International. 204 (2): 768–779. Bibcode:2016GeoJI.204..768G. doi:10.1093/gji/ggv484.
  6. Chen, Yangkang (2017). "Probing the subsurface karst features using time-frequency decomposition". Interpretation. 4 (4): T533–T542. doi:10.1190/INT-2016-0030.1.
  7. Huang, Weilin; Wang, Runqiu; Chen, Yangkang; Li, Huijian; Gan, Shuwei (2016). "Damped multichannel singular spectrum analysis for 3D random noise attenuation". Geophysics. 81 (4): V261–V270. Bibcode:2016Geop...81V.261H. doi:10.1190/geo2015-0264.1.
  8. Chen, Yangkang (2016). "Dip-separated structural filtering using seislet transform and adaptive empirical mode decomposition based dip filter". Geophysical Journal International. 206 (1): 457–469. Bibcode:2016GeoJI.206..457C. doi:10.1093/gji/ggw165.
  9. Chen, Yangkang; Ma, Jianwei; Fomel, Sergey (2016). "Double-sparsity dictionary for seismic noise attenuation". Geophysics. 81 (4): V261–V270. Bibcode:2016Geop...81V.193C. doi:10.1190/geo2014-0525.1.
  10. Chen, Yangkang (2017). "Fast dictionary learning for noise attenuation of multidimensional seismic data". Geophysical Journal International. 209 (1): 21–31. Bibcode:2017GeoJI.209...21C. doi:10.1093/gji/ggw492.
  11. https://web.archive.org/web/20200702165827/http://www.ant-audio.co.uk/Tape_Recording/Theory/Tandberg_Actilinear_Dyneq.pdf
  12. https://web.archive.org/web/20200702165605/http://sportsbil.com/tandberg/tcd-440a-tech.pdf
  13. Information, Reed Business (20 September 1979). "New Scientist".
  14. https://web.archive.org/web/20200702172103/https://www.nytimes.com/1984/09/02/arts/sound-a-standout-cassette-deck.html
  15. "High Com - the latest noise reduction system / Noise reduction - silence is golden" (PDF). elektor (UK) – up-to-date electronics for lab and leisure. Vol. 1981 no. 70. February 1981. pp. 2-04–2-09. Archived (PDF) from the original on 2020-07-02. Retrieved 2020-07-02. (6 pages)
  16. R., C. (1965). "Kompander verbessert Magnettonkopie". Radio Mentor (in German). 1965 (4): 301–303.
  17. Burwen, Richard S. (February 1971). "A Dynamic Noise Filter". Journal of the Audio Engineering Society. 19 (1).
  18. Burwen, Richard S. (June 1971). "110 dB Dynamic Range For Tape" (PDF). Audio: 49–50. Archived (PDF) from the original on 2017-11-13. Retrieved 2017-11-13.
  19. Burwen, Richard S. (December 1971). "Design of a Noise Eliminator System". Journal of the Audio Engineering Society. 19: 906–911.
  20. "Stereo Automat MK42 R-Player Budapesti Rádiótechnikai Gyár B".
  21. HIGHCOM - The HIGHCOM broadband compander utilizing the U401BR integrated circuit (PDF) (Semiconductor information 2.80). AEG-Telefunken. Archived (PDF) from the original on 2016-04-16. Retrieved 2016-04-16.
  22. Hoffman, Frank W. (2004). Encyclopedia of Recorded Sound. 1 (revised ed.). Taylor & Francis.
  23. "Noise Reduction". Audiotools.com. 2013-11-10.
  24. "Philips' Dynamic Noise Limiter". Archived from the original on 2008-11-05. Retrieved 2009-01-14.
  25. "Dynamic Noise Reduction". ComPol Inc.
  26. "History". Archived from the original on 2007-09-27. Retrieved 2009-01-14.
  27. "Audio Terms". Archived from the original on 2008-12-20. Retrieved 2009-01-14.
  28. "LM1894 Dynamic Noise Reduction System DNR". Archived from the original on 2008-12-20. Retrieved 2009-01-14.
  29. Gunyo, Ed. "Evolution of the Riviera - 1983 the 20th Anniversary". Riviera Owners Association. (NB. Originally published in The Riview, Vol. 21, No. 6, September/October 2005.)
  30. http://www.hellodirect.com/catalog/Product.jhtml?PRODID=11127&CATID=15295%5B%5D
  31. Boashash, B., ed. (2003). Time-Frequency Signal Analysis and Processing – A Comprehensive Reference. Oxford: Elsevier Science. ISBN 978-0-08-044335-5.
  32. Boashash, B. (April 1992). "Estimating and Interpreting the Instantaneous Frequency of a Signal-Part I: Fundamentals". Proceedings of the IEEE. 80 (4): 519–538. doi:10.1109/5.135376.
  33. Mehdi Mafi, Harold Martin, Jean Andrian, Armando Barreto, Mercedes Cabrerizo, Malek Adjouadi, “A Comprehensive Survey on Impulse and Gaussian Denoising Filters for Digital Images,” Signal Processing, vol. 157, pp. 236-260, 2019.
  34. Liu, Puyin; Li, Hongxing (2004). Fuzzy Neural Network Theory and Application. Intelligent Robots and Computer Vision Xiii: Algorithms and Computer Vision. 2353. World Scientific. pp. 303–325. Bibcode:1994SPIE.2353..303G. doi:10.1117/12.188903. ISBN 978-981-238-786-8. S2CID 62705333.
  35. Forouzanfar, M.; Abrishami-Moghaddam, H.; Ghadimi, S. (July 2008). "Locally adaptive multiscale Bayesian method for image denoising based on bivariate normal inverse Gaussian distributions". International Journal of Wavelets, Multiresolution and Information Processing. 6 (4): 653–664. doi:10.1142/S0219691308002562. S2CID 31201648.
  36. Mallat, S. (1998). A Wavelet Tour of Signals Processing. London: Academic Press.
  37. Besag, Julian (1986). "On the Statistical Analysis of Dirty Pictures" (PDF). Journal of the Royal Statistical Society. Series B (Methodological). 48 (3): 259–302. doi:10.1111/j.2517-6161.1986.tb01412.x. JSTOR 2345426.
  38. Seyyedi, Saeed (2018). "Incorporating a Noise Reduction Technique Into X-Ray Tensor Tomography". J IEEE Transactions on Computational Imaging. 4 (1): 137–146. doi:10.1109/TCI.2018.2794740. JSTOR 17574903. S2CID 46793582.
  39. Dabov, Kostadin; Foi, Alessandro; Katkovnik, Vladimir; Egiazarian, Karen (16 July 2007). "Image denoising by sparse 3D transform-domain collaborative filtering". IEEE Transactions on Image Processing. 16 (8): 2080–2095. Bibcode:2007ITIP...16.2080D. CiteSeerX 10.1.1.219.5398. doi:10.1109/TIP.2007.901238. PMID 17688213. S2CID 1475121.
  40. Schmidt, Uwe; Roth, Stefan (2014). Shrinkage Fields for Effective Image Restoration (PDF). Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on. Columbus, OH, USA: IEEE. doi:10.1109/CVPR.2014.349. ISBN 978-1-4799-5118-5.
  41. Ulyanov, Dmitry; Vedaldi, Andrea; Lempitsky, Victor (30 November 2017). "Deep Image Prior". arXiv:1711.10925v2 [Vision and Pattern Recognition Computer Vision and Pattern Recognition].
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.