Open Access
9 May 2024 Perceptual thresholds for differences in CT noise texture
Author Affiliations +
Abstract

Purpose

The average (fav) or peak (fpeak) noise power spectrum (NPS) frequency is often used as a one-parameter descriptor of the CT noise texture. Our study develops a more complete two-parameter model of the CT NPS and investigates the sensitivity of human observers to changes in it.

Approach

A model of CT NPS was created based on its fpeak and a half-Gaussian fit (σ) to the downslope. Two-alternative forced-choice staircase studies were used to determine perceptual thresholds for noise texture, defined as parameter differences with a predetermined level of discrimination performance (80% correct). Five imaging scientist observers performed the forced-choice studies for eight directions in the fpeak/σ-space, for two reference NPSs (corresponding to body and lung kernels). The experiment was repeated with 32 radiologists, each evaluating a single direction in the fpeak/σ-space. NPS differences were quantified by the noise texture contrast (Ctexture), the integral of the absolute NPS difference.

Results

The two-parameter NPS model was found to be a good representation of various clinical CT reconstructions. Perception thresholds for fpeak alone are 0.2 lp/cm for body and 0.4 lp/cm for lung NPSs. For σ, these values are 0.15 and 2 lp/cm, respectively. Thresholds change if the other parameter also changes. Different NPSs with the same fpeak or fav can be discriminated. Nonradiologist observers did not need more Ctexture than radiologists.

Conclusions

fpeak or fav is insufficient to describe noise texture completely. The discrimination of noise texture changes depending on its frequency content. Radiologists do not discriminate noise texture changes better than nonradiologists.

1.

Introduction

The visual appearance of medical images is influenced by both their noise magnitude and noise texture. In fact, it is known that both can affect the detectability of small and low-contrast lesions.1 Although multiple factors affect noise magnitude, the noise texture in CT imaging is mainly influenced by the reconstruction method and reconstruction kernel used. In addition, with certain iterative CT reconstruction methods, the reconstructed images can appear to radiologists as nonnatural, i.e., plasticky or blotchy.25 This appearance is usually associated with a shift of the noise power spectrum (NPS) toward the lower frequencies compared with that of images obtained with filtered-back projection (FBP).6 This shift downward in the noise frequencies is mainly due to these algorithms achieving a reduction in the image noise by increasing the spatial correlation across voxels, especially in low-dose conditions. As expected, this increase in correlation can lead to a lowering of the spatial resolution of the image.7,8

Depending on the contrast, newly developed, deep-learning-based reconstruction (DLR) algorithms seem to be able to decouple this usual relationship between spatial resolution and noise texture from each other to a larger extent than that existing in current iterative reconstruction methods.9 This may allow for new opportunities to manipulate noise texture during reconstruction, improving the detectability of low-contrast lesions.

Therefore, with the increasing use of iterative reconstruction algorithms in CT and especially with the advent of deep-learning-based postprocessing, it is of interest to better understand the phenomenon of noise texture changes during reconstruction and postprocessing in CT. With this knowledge, it could be feasible to tune some of these algorithms to optimize the resulting image noise texture while maintaining the spatial resolution. This can be achieved by studying only the shape of the NPS, independently of its magnitude. However, to make these insights clinically relevant, it is necessary to first determine what changes in noise texture are actually perceptible by a human observer. Given the complexity of the human visual system, it is not immediately clear how sensitive humans are to noise texture differences. Therefore, it is of interest to characterize the minimum changes in the NPS shape that are needed for a human observer to detect a change in the image texture.

To be able to systematically study noise texture changes, a simple and continuous parametric model that describes the NPS change, and therefore noise texture, is needed. It is common to summarize the information of CT NPSs with one parameter, the frequency at which the NPS peaks (fpeak), or alternatively, the average NPS frequency (fav). However, it is clear that one parameter can provide only limited information on the frequency distribution of the noise texture. In other words, multiple different NPSs, all resulting in different noise textures, could have the same fpeak and/or fav. To overcome this, a more complete parametric representation of the CT NPS shape is needed.

Therefore, the purpose of this study is to introduce and validate a more complete parametric model of the NPS in CT and use that model to determine the detectability of changes in noise texture for human observers.

2.

Materials and Methods

To investigate the perceptual thresholds for noise texture changes, we created and evaluated a simple and continuous two-parameter model that describes the shape of the NPS of CT images. This model was then used in forced-choice psychophysical experiments using adaptive staircase methods to estimate the observer thresholds as a function of changes in the two parameters. To understand if these perceptual thresholds might be different for radiologists compared with nonradiologists, a limited version of the study was repeated with radiologists. Finally, to determine if the threshold changes varied based on differences in the reference NPS, these experiments were performed using two different reference noise textures, one for body and the other for lung reconstruction kernels.

2.1.

Modeling the Noise Texture

In CT, an NPS usually has a ramp dominating the lower frequencies and an apodization part dominating the higher frequencies.10 Previously, to model the full NPS, a six-parameter model of NPS was suggested:11

Eq. (1)

NPS(f)=a·fb·e|fcα|d2β2.

In this model, the parameter a controls the magnitude of the noise, and the other parameters primarily determine the shape of the NPS. However, having six parameters that can change alone or together results in a large number of possible changes and is impractical for use in an observer study. Therefore, we propose the simplification of the model to a three-parameter one. By evaluating the resulting NPS fits from one manufacturer, the values of b, c, and d were empirically determined and fixed to 1 for b and c and 2 for d:

Eq. (2)

NPS(f)=a·f·e(fα)22β2.

The applicability of this model for clinically available reconstruction kernels and reconstruction methods in CT for various vendors is tested.

If an NPS is described using Eq. (2), the peak frequency of this NPS is derived analytically as follows:

Eq. (3)

fpeak=α+α2+4β22.

To characterize the NPS independently of the fitting model used, we propose two parameters: one parameter that describes the upslope and one that describes the downslope. For the description of the upslope, we used fpeak because NPSs are supposed to monotonically increase to fpeak and this parameter is already often used to describe the NPS. For the downslope, we used the standard deviation (σ) of a half-Gaussian that is fitted through the downslope of the NPS, i.e., for all frequencies equal to or higher than fpeak, resulting in

Eq. (4)

g(f)=a·e(ffpeak)22σ2ffpeak,
where a determines the magnitude of the Gaussian and σ is its width. So σ was used as a single parameter to describe the NPS downslope. Because we are modeling only the shape of the NPS and not its overall magnitude, all modeled NPSs were set to unit area under the curve. An example NPS and its resulting parameterizations are shown in Fig. 1.

Fig. 1

Example of NPS parameterization. The original acquired NPS (in gray) is fit using the three-parameter model [Eq. (2), in dark blue]. The peak frequency (fpeak) is used to describe the first section of the NPS (in light blue). A half-Gaussian is fit through the section beyond fpeak [Eq. (4), dashed yellow]. The σ of that Gaussian describes the apodization part of the NPS (light gray).

JMI_11_3_035501_f001.png

Information on the testing on the applicability of this model is given in Appendix A.

2.2.

Generation of Patches with Various Noise Textures

Given a specific fpeak and σ, a continuous distribution of NPSs can be generated using Eq. (1) or Eq. (2). For a detailed description of the procedure used, see Appendix A. From the NPS resulting from these equations, a two-dimensional NPS (NPS2D) was created assuming that the NPS2D is radially symmetric. To be able to generate a specific noise texture, the generated NPS2D is applied to white noise as follows:

Eq. (5)

N(μ,σ)=F1{NPS2D·F{n(μ=0,σ=1)}},
where N is the resulting colored-noise image, F is the FFT operator, and n is a realization of white Gaussian noise with a mean value of μ and a standard deviation of σ. For the observer study, noise patches of 256×256  pixels were created.

2.3.

Noise Texture Contrast

If two noise textures and their corresponding NPSs are considered, the noise texture contrast (Ctexture) is calculated based on the contrast that an ideal observer is able to see. For the derivation of the ideal observer, see Appendix B. Effectively, the ideal observer looks at the absolute differences between the two NPSs. Therefore, the noise texture contrast is calculated from the NPS2D as

Eq. (6)

Ctexture=|NPS2D,1(f,ω)NPS2D,2(f,ω)|.

2.4.

Observer Study

To investigate the detectability of differences between two colored noise textures, a two alternative forced choice observer study was performed. For each realization of the experiment, three noise patches, created in real time, were shown to the observer. One was labeled the “reference” noise patch, one patch had another noise realization with the same NPS as the reference patch, and the third noise patch originated from an NPS with a different fpeak and/or a different σ. All noise patches were shown with a window level equal to the mean gray level and a window width of 10 (i.e., 10 times the SD). The task for the observer was to identify the patch that had the same noise texture as the reference noise patch (Fig. 2). After the observer made a choice, the correct patch was highlighted for one second and then the next trial was shown.

Fig. 2

Screenshot of a trial shown to the observer. The task of the observer was to select the alternative noise patch (option 1 or option 2) that has the noise texture most comparable to the one of the reference noise patch.

JMI_11_3_035501_f002.png

To determine the parameter values to use for the noise patches of the next trial, a staircase method was applied with a step size of 15% of the current value. The difference in the parameter values between the different NPSs was decreased after three correct responses and increased after one incorrect response.12 The trials were stopped after 12 reversals, and every series was executed 6 times. Per repetition, the geometrical mean over the trials from the last eight reversals was determined, which is an estimate of the 80% correct point on the psychometric curve.13 The average value of this 80% correct point from the last five repetitions was used as the detectability threshold.

For this study, two reference NPSs were chosen, one from a body kernel and one resulting from a lung kernel, determined from images of a 320 mm water phantom on a clinical wide-area CT system (Aquilion One PRISM edition, Canon Medical Systems Corporation, Otawara, Japan) using the dose determined by the automatic tube current modulation and a hybrid iterative reconstruction (HIR) method (AIDR 3D, Canon Medical Systems Corporation). The reference NPSs are shown in Fig. 3. The fpeak and σ values are 1.89 and 1.28  lp/cm for the body kernel and 4.64 and 1.83  lp/cm for the lung kernel.

Fig. 3

Reference noise power spectra (NPSs). The body NPS was obtained using an HIR (Hybrid-IR) method with body settings, and for the lung NPS, a Hybrid-IR with lung settings was used. The fpeak and σ values are 1.89 and 1.28  lp/cm for the body kernel and 4.64 and 1.83  lp/cm for the lung kernel.

JMI_11_3_035501_f003.png

During the observer study, the two reference NPSs were approached from eight directions, involving a change in fpeak only, a change in σ only, and simultaneous changes in both (all from higher and lower values) (see Fig. 4). The starting test values were determined by what was a clearly visible difference in noise texture for one of the investigators. Initially, five nonradiologist observers (PhD students in imaging science and medical physics trainees) completed studies to evaluate all 8 directions for both reference NPSs in multiple sessions. A maximum of 2 directions was performed in one session to prevent fatigue. All observers were able to complete each two-direction session within 1 h.

Fig. 4

All eight directions of changes in fpeak and/or σ that were investigated. Compared to the reference NPS, four directions change fpeak or σ only, and four directions involve a change in both fpeak and σ.

JMI_11_3_035501_f004.png

To investigate if radiologists are able to detect more subtle differences in noise texture, the experiment was repeated with 27 radiologists at the Medical Imaging Perception Lab at the European Congress of Radiology (ECR) 2023 and afterward with five radiologists from the Radboud University Medical Center. Due to the limitation in available time per radiologist, each radiologist only performed one of the reference NPS and direction combinations. Each series was performed only five times, of which the last four were used for the calculation of the geometrical mean. This led to results from two radiologists for each direction.

All experiments were performed in dimmed lighting conditions, comparable to diagnostic reading room conditions. The noise patches were shown on a DICOM GSDF calibrated diagnostic monitor (for the nonradiologist: Barco MDMC-12133 and for the radiologists: Barco MDNC-3321, Barco, Kortrijk, Belgium).

2.5.

Analysis of the Results

For each reference NPS, the threshold fpeak and σ values per nonradiologist observer and the average threshold over all nonradiologist observers were calculated for each direction. A threshold detectability boundary ellipse was fitted through the eight average threshold values using a least squares method.

The threshold noise texture contrast for each threshold condition, as well as their 95% confidence interval, was calculated per observer. For the two radiologists, only the limiting noise texture contrast was calculated. The radiologists were assumed to perform the same as the other observers if their threshold noise texture contrast was within the 95% confidence interval of the nonradiologist observers.

3.

Results

In Fig. 5, the fpeak and σ limiting values for each nonradiologist observer and the overall average are shown for both reference NPSs. Also the detectability threshold ellipse is shown. For the body NPS (fpeak: 1.89  lp/cm and σ: 1.28  lp/cm), the ellipse has the center close to the reference value, with fpeak=1.86  lp/cm and σ=1.30  lp/cm. The major radius of the ellipse makes an angle of 143 deg with the fpeak axis. Based on this elliptical fit, the detectability threshold fpeak is 0.2  lp/cm. Of course, this value changes if σ is changed simultaneously. For the lung NPS (fpeak: 4.64  lp/cm and σ: 1.83  lp/cm), the ellipse center is at fpeak=4.30  lp/cm and σ=2.46  lp/cm, and the major radius makes an angle of 120 deg. The corresponding threshold fpeak is 0.4  lp/cm. Therefore, the detection threshold for a change in fpeak is higher when using the lung NPS as the reference compared with the body NPS as reference.

Fig. 5

Results from the observer study. The 80% threshold limits for each observer, as well as the average values and the fitted detectability threshold ellipse, are shown. Results from the (a) body reference NPS and (b) lung reference NPS. The color of the background indicates the noise texture contrast.

JMI_11_3_035501_f005.png

The background of the two graphs in Fig. 5 shows the noise texture contrast (Ctexture) compared to the reference NPS. The lighter the color is, the higher the contrast is. For a changing fpeak, less Ctexture is needed to be perceptible to a human observer compared with changing the downslope. To make a change in texture perceptible, the most Ctexture is needed in the direction of lowering fpeak combined with increasing σ, or vice versa. The iso-fav line through the reference NPS shows that the average frequency is a better estimator for the visibility of noise texture changes than fpeak because the iso-fav line is more parallel to the major axis of the threshold ellipse, whereas the iso-fpeak line runs more closely to the minor axis of the ellipse. However, NPSs with the same average frequency can still be distinguishable from each other.

The noise texture contrast thresholds determined with radiologists show that radiologists have a noise texture contrast threshold within the 95% confidence interval of the nonradiologist observer results in 17 of the 32 experiments. In 12 cases, the radiologists had a noise texture contrast threshold above the 95% confidence interval of that from the nonradiologist observers. A detectability threshold could not be determined for two experiments because the observers would have needed a larger difference to be able to detect the correct noise texture than would be possible (σ would become negative). Table 1 and Fig. 6 show the individual radiologist results and the average results of the nonradiologist observers.

Table 1

80% detection thresholds for both body and lung NPS as a reference for the nonradiologist observers. Also the noise texture contrast (Ctexture) for the first and second radiologists is shown. If the Ctexture of the radiologist is in bold, this value is outside the 95% interval of the nonradiologist observers.

DirectionBodyLung
Δfpeak (lp/cm)Δσ (lp/cm)Ctexture (HU2)First radiologist Ctexture (HU2)Second radiologist Ctexture (HU2)Δfpeak (lp/cm)Δσ (lp/cm)Ctexture (HU2)First radiologist Ctexture (HU2)Second radiologist Ctexture (HU2)
10.18 ± 0.07n.a.0.014 ± 0.0060.0340.0180.40 ± 0.14n.a.0.019 ± 0.0060.0250.024
2−0.21 ± 0.11n.a.0.017 ± 0.0090.0230.020−0.40 ± 0.19n.a.0.019 ± 0.0090.0330.022
3n.a.0.16 ± 0.050.020 ± 0.0060.022an.a.0.78 ± 0.400.025 ± 0.0080.0310.045
4n.a.-0.15 ± 0.070.020 ± 0.0100.0450.029n.a.-0.37 ± 0.170.023 ± 0.0130.0720.032
50.06 ± 0.030.08 ± 0.040.015 ± 0.0080.0240.0170.18 ± 0.070.26 ± 0.100.017 ± 0.0050.0320.038
6−0.07 ± 0.04−0.09 ± 0.050.017 ± 0.0100.0350.028−0.15 ± 0.07−0.21 ± 0.100.019 ± 0.0100.0330.032
70.20 ± 0.16−0.26 ± 0.200.026 ± 0.0200.0410.0410.75 ± 0.32−1.10 ± 0.500.047 ± 0.0310.0560.056
8−0.30 ± 0.330.30 ± 0.330.027 ± 0.0310.0360.018−1.23 ± 0.471.73 ± 0.650.026 ± 0.0070.028a
n.a.: not applicable (in this direction, this parameter does not change compared to the reference).

aNo value available for the second radiologist for directions 3 (abdomen) and 8 (lung) because, for these observers, the difference had to be larger than possible for σ (σ would become negative).

Fig. 6

(a), (b) Results from the radiologists shown in conjunction with the average of the nonradiologist observers. In 17 of the 32 radiologist experiments, the results are within the 95% confidence intervals of the nonradiologist results.

JMI_11_3_035501_f006.png

4.

Discussion

Because noise texture influences detectability of lesions and new deep learning-based CT methods can more easily modify noise texture, it is of interest to study the effect of noise texture changes on the detectability of lesions. In this research, we focused on the detectability of noise texture changes itself, hypothesizing that if an observer cannot detect differences in noise texture, then lesion detectability across these noise textures would be unaffected. We found the thresholds for detectability with varying fpeak and σ for two commonly used reconstruction kernels and showed that radiologists do not perform better than nonradiologist observers in detecting these differences. This may suggest that the sensitivity to changes in NPS is related to the human visual system. However, depending on the direction of the change, the intraobserver variability of this detectability threshold can be large. This is especially true in the direction of the major axis of the ellipse.

The change in fpeak and σ needed for the human observer to detect the difference varies for both conditions and for the directions within each condition. However, the average noise-texture contrast needed is roughly equivalent, except for the lung reference NPS in the direction of a higher fpeak and lower σ. In this case, the noise texture differences are concentrated at high spatial frequencies. This result may reflect limitations in human observers at high spatial frequencies. This would be consistent with models of texture discrimination that posit octave bandwidth filters that are broad band at higher spatial frequencies and therefore mix noise effects across a larger band of spatial frequencies.1416 However, we should note that these models have been developed for images that look very different from CT noise textures. Therefore, further validation is needed to determine if generalization to CT noise is applicable.

A slightly higher detectability threshold was found for radiologists compared with the nonradiologist observers. This might be caused by the fact that the evaluation with radiologists was performed in only one change direction, so they were less used to the task compared with the nonradiologists, who did all directions. In addition, radiologists only performed five repetitions, and the nonradiologist observers performed six. However, we did not find that the last repetition of the nonradiologist observers was better than the first five.

This research is a first step in the investigation of the effect of noise texture from nonlinear reconstruction methods on the perception of lesions in clinical CT images. Further research is needed to include the visibility of lesions with different noise textures and, eventually, with the inclusion of anatomical background. The latter is needed not only due to its interference with the detectability of lesions but also because the noise texture from nonlinear reconstructions is probably different from that in homogeneous backgrounds, potentially also breaking the assumption that the noise is radially symmetrical. However, in this first initial study, we aimed to determine what differences in noise texture, as characterized by differences in NPS, are actually detectable by the human visual system, so the follow-up studies could be performed with meaningful noise texture differences.

Our study has several limitations. First, the number of observers was limited. Future research might involve increasing the number of observers to better estimate the average thresholds as well as their variability for the various directions. Also evaluating only eight directions of change is quite limited considering that six parameters are needed to describe an ellipse. Hence, evaluating more directions could provide a better estimation of the limiting ellipse. In addition, we used only two reference NPSs. Although these NPSs are used often for lung and body exams, acquisitions for bone and brain result in NPSs having different fpeak and σ. Also other reconstruction techniques, such as model-based iterative reconstructions (MBIR) or DLR, as well as reconstructions from other vendors, will result in different NPS shapes. Finally, the underlying noise distribution used was Gaussian, although in recent studies, we are seeing that nonnormal CT noise distributions can be discriminable from NPS-matched normal noise distributions.17 To further study this effect, similar studies as this one are needed; however, these should not change the NPS but change the underlying noise distribution. Next, just as a follow-up for this study, the effect on lesion detection should be studied.

5.

Conclusions

Human observers showed different sensitivity to changes in CT noise textures based on peak frequency (fpeak), and the downslope of the NPS (σ) alone and in combination. Radiologists did not detect these textural changes any better than nonradiologist observers. Describing NPS using only the fpeak or the fav alone was insufficient to describe perceived differences in CT noise texture. The presented model using fpeak and σ can serve as a starting point to better describe noise texture and to further study the impact of CT noise texture on human task performance.

6.

Appendix A: Verification of the NPS Model in CT

To obtain a wide representation of CT NPS curves, water phantom images were acquired using CTs from four vendors and different reconstruction techniques. These NPSs were modeled and parameterized, and the goodness-of-fit for the models and appropriateness of the parameters was determined.

6.1.

NPS Acquisition

NPS data were acquired on four CT systems from four different vendors (Canon Medical Systems, GE HealthCare, Philips healthcare, Siemens Healthineers). A 320-mm diameter water phantom was imaged using the settings used clinically for the abdomen protocol at the corresponding site and with a lower dose setting. The acquisitions were reconstructed using the clinically used kernel for abdomen, lung, brain, and bone, for the following reconstruction methods (if available): FBP, HIR, MBIR, and DLR. The slice thicknesses used were 0.5 mm for Canon, 0.625 mm for GE, and 1.0 mm for Philips and Siemens. For all but the FBP reconstruction, three strength settings were used, leading to a maximum of 80 reconstructions per CT system, if all reconstruction methods were available [2 dose levels, 4 kernels, 10 reconstruction methods (1 FBP + 3 strengths × 3 methods)]. Each NPS1d was calculated in a central ROI of 128×128  pixels using the method described by Boedeker et al.18 From each acquisition, a stack of at least 100 slices was used for the NPS calculation. For each slice, the NPS was calculated and averaged over all slices. This average NPS1d was normalized to have a unit area under the curve.

6.2.

NPS Data Analysis

The acquired NPS1d was fitted using the six-parameter model [Eq. (1)] and the three-parameter model [Eqs. (2) and (3)] by a least-squares method. To determine the fpeak, the NPS1d was first filtered using a low-pass filter at 4% of the full bandwidth. This prevented small local peaks from affecting the determination of fpeak. Finally, the three-parameter NPS1d was described with a two-parameter model [with Eq. (4)], and the fav was calculated.

To generate the two-parameter parameterized NPS, a procedure in Python was written using the curve_fit and minimize_scalar functions from the scipy.optimize package. The three parameterized NPSs were compared to the original NPS1d using the relative sum of absolute differences (RSAD):

Eq. (7)

RSAD=f|NPSparam(f)NPS1d(f)|NPS1d(f)×100%.

6.3.

Results

All acquired NPS shapes are shown in Sec. 6.4. Of the 194 acquired NPSs, 152 (78%) NPSs have a shape that has a ramp dominating the low frequencies and an apodization part that dominates the higher frequencies. 14 (7%) NPSs only have a ramp (no downslope), and 28 (14%) NPSs have a different shape altogether (e.g., multiple peaks).

In Tables 2Table 35, all values of fpeak, σ, fav, and the RSADs between the parameterized NPSs and the acquired NPSs are given. For several NPSs, there is no σ value because the NPS has no downslope within the Nyquist frequency. Two NPSs with the same fpeak or the same fav are shown in Fig. 7 together with their corresponding noise textures. As can be seen, the noise textures are clearly discernible, whereas fpeak or fav is the same. The combination of fpeak and σ do differ for these situations.

Table 2

Parameters and appropriateness of fit of the various NPSs acquired on a Canon Aquilion One PRISM Edition.

Canonfpeak (lp/cm)σ (lp/cm)fav (lp/cm)Normal dosefpeak (lp/cm)σ (lp/cm)fav (lp/cm)Low dose
Six-param. modelThree-param. modelModeled NPSSix-param. modelThree-param. modelModeled NPS
R2Diff. (%)R2RSAD (%)Diff. (%)R2RSAD (%)R2RSAD (%)RSAD (%)
FBPAbdomen2.631.522.891.001.231.002.573.552.661.512.921.002.371.002.743.65
Lung7.595.691.001.201.001.3833.797.59-5.711.001.581.001.2334.57
Brain2.382.323.241.001.041.002.243.092.472.293.251.001.071.002.031.97
Bone6.093.895.371.001.971.001.7319.966.812.455.391.001.821.001.5614.02
HIRAbdomen-L12.031.342.321.001.441.002.734.921.561.201.911.001.731.003.363.26
Abdomen-L21.911.302.211.001.631.002.865.051.471.161.821.001.881.003.303.32
Abdomen-L31.661.201.971.001.931.003.095.331.311.001.591.002.171.003.755.58
Lung-L14.191.753.961.001.000.986.3316.922.591.672.871.001.141.003.177.26
Lung-L23.532.053.721.001.210.985.216.452.031.812.651.001.331.002.132.22
Lung-L32.752.363.431.001.680.993.644.941.751.722.411.001.671.002.403.28
Brain-L11.911.492.351.001.271.002.373.671.471.261.921.001.611.002.812.83
Brain-L21.841.442.281.001.301.002.754.401.441.221.871.001.721.003.103.07
Brain-L31.591.322.061.001.731.003.343.491.311.081.681.002.111.003.684.07
Bone-L14.881.504.521.002.280.997.0518.303.561.483.461.001.430.995.9214.70
Bone-L24.561.664.331.001.840.987.5113.503.031.723.261.001.230.995.796.00
Bone-L34.381.734.091.001.130.977.8518.102.751.773.021.001.210.995.057.43
MBIRAbdomen-L11.781.412.291.004.141.005.6010.361.341.251.911.003.851.006.556.75
Abdomen-L21.471.302.041.004.671.006.528.601.131.121.701.003.841.007.767.47
Abdomen-L31.131.181.751.004.911.007.587.310.910.961.451.003.991.009.128.67
Lung-L12.381.802.911.003.960.995.216.782.091.402.401.005.570.995.8313.45
Lung-L21.841.472.311.004.960.995.4311.481.501.362.031.004.001.004.578.22
Lung-L31.311.331.901.003.451.004.268.590.841.251.641.003.651.006.737.91
Brain-L10.840.831.271.004.201.008.087.160.690.651.061.005.291.0010.399.74
Brain-L20.690.651.061.004.830.9910.5310.430.560.510.891.007.550.9912.6516.44
Brain-L30.500.490.841.009.390.9912.5714.960.440.380.701.0011.790.9915.0827.49
Bone-L16.092.925.051.000.851.001.966.076.063.255.061.001.431.001.981.97
Bone-L24.562.844.471.001.681.002.032.943.632.113.770.993.930.994.7910.25
Bone-L32.381.913.011.003.900.994.845.222.131.552.601.004.981.005.419.31
DLRAbdomen-L11.911.722.371.002.320.986.4014.801.251.561.911.003.490.969.979.03
Abdomen-L21.221.852.140.993.550.969.199.361.191.411.731.003.690.9512.7911.80
Abdomen-L30.341.851.640.987.410.8916.8187.070.341.521.350.979.850.8722.2170.89
Lung-L14.031.793.830.993.910.948.9716.522.811.622.971.002.230.978.3510.75
Lung-L23.562.023.550.918.600.7813.7316.412.061.912.600.993.270.9312.6011.17
Lung-L30.314.483.120.899.680.6116.71144.531.251.612.011.002.020.995.425.15
Brain-L11.721.212.021.002.111.003.446.001.381.101.731.002.211.003.623.64
Brain-L21.661.161.951.002.281.003.855.781.381.091.721.002.301.003.934.04
Brain-L31.661.121.911.002.361.004.116.941.381.081.721.002.381.004.154.32
Bone-L14.131.743.751.001.260.9310.2925.233.281.673.161.001.660.9610.3821.86
Bone-L23.501.973.331.001.490.8614.6825.561.162.542.550.993.730.9512.7726.81
Bone-L30.593.332.730.976.810.8319.5091.191.131.942.130.995.470.997.027.39
RSAD: Relative sum of absolute differences between the acquired and the modeled NPS. Modeled NPS: NPS determined based on the two-parameter model, using fpeak and σ from the acquired NPS.

Table 3

Parameters of the various NPSs acquired on a GE Discovery CT750 HD.

GEfpeak (lp/cm)σ (lp/cm)fav (lp/cm)Normal dosefpeak (lp/cm)σ (lp/cm)fav (lp/cm)Low dose
six-param. modelThree-param. modelModeled NPSsix-param. modelThree-param. modelModeled NPS
R2RSAD (%)R2RSAD (%)RSAD (%)R2RSAD (%)R2RSAD (%)RSAD (%)
FBPAbdomen2.591.692.780.877.770.809.7611.102.591.702.780.876.940.819.7011.25
Lung5.001.564.781.001.890.994.289.395.061.524.791.001.880.994.5012.27
Brain3.251.783.240.895.900.848.4615.183.281.773.240.906.050.848.4416.35
Bone7.635.500.987.900.995.5728.907.635.520.988.270.995.7129.96
HIRAbdomen-L12.531.612.590.8110.470.7210.7218.131.911.962.590.8211.000.7310.8115.41
Abdomen-L21.811.622.210.7215.550.5814.0414.751.811.622.210.7215.550.5814.1414.95
Abdomen-L31.781.291.900.6620.300.4817.3023.981.781.281.900.6620.480.4817.6324.42
Lung-L14.811.644.651.002.020.994.107.414.911.584.651.001.980.994.3311.68
Lung-L24.311.824.350.991.940.993.193.974.381.794.360.991.940.993.313.97
Lung-L33.941.864.060.992.920.992.712.883.911.884.050.992.970.992.713.29
Brain-L12.632.013.030.857.760.788.608.772.592.043.030.857.670.788.598.82
Brain-L21.882.032.620.7710.880.6710.7111.791.882.032.610.7711.060.6710.8011.49
Brain-L31.811.652.270.7314.080.6013.3714.601.841.632.270.7414.450.6013.5415.29
Bone-L17.63-5.330.978.300.987.2222.257.635.350.993.740.987.4823.29
Bone-L27.66-4.940.937.240.9110.2711.597.634.960.937.530.9110.6412.19
Bone-L37.66-4.550.787.220.7410.9921.457.664.570.787.360.7411.4018.40
RSAD: Relative sum of absolute differences between the acquired and the modeled NPS. Modeled NPS: NPS determined based on the two-parameter model, using fpeak and σ from the acquired NPS.

Table 4

Parameters of the various NPSs acquired on a Philips iCT 256.

Philipsfpeak (lp/cm)σ (lp/cm)fav (lp/cm)Normal dosefpeak (lp/cm)σ (lp/cm)fav (lp/cm)Low dose
six-param. modelThree-param. modelModeled NPSsix-param. modelThree-param. modelModeled NPS
R2RSAD (%)R2RSAD (%)RSAD (%)R2RSAD (%)R2RSAD (%)RSAD (%)
FBPAbdomen2.562.013.061.002.190.994.304.482.562.043.050.993.110.985.505.54
Lung5.441.414.771.001.870.9710.2932.065.471.404.791.002.260.9610.7131.86
Brain2.341.622.661.001.910.994.255.632.411.602.691.001.860.995.016.17
Bone5.441.414.771.001.870.9710.2932.066.001.235.271.003.740.9810.5934.11
HIRAbdomen-L11.972.192.861.002.370.994.207.241.912.132.751.002.700.994.745.45
Abdomen-L21.972.132.771.002.750.994.294.091.812.082.661.003.110.993.633.50
Abdomen-L31.882.052.631.003.350.994.116.441.781.952.470.994.350.976.2110.43
Lung-L15.281.534.631.001.100.9510.7830.025.341.504.541.001.170.9410.8437.35
Lung-L25.411.444.591.001.240.9312.4838.665.411.464.491.001.430.9212.2143.14
Lung-L35.381.464.591.002.810.9312.8637.205.341.514.480.993.450.9013.0839.77
Brain-L11.941.702.481.002.210.994.114.211.911.622.391.002.750.995.105.15
Brain-L21.841.692.391.002.580.994.234.151.811.602.291.003.120.995.366.62
Brain-L31.751.612.251.003.150.994.647.241.691.532.150.993.800.995.709.93
Bone-L16.001.225.211.002.610.9810.1138.415.971.275.151.002.750.9710.1737.99
Bone-L26.001.225.191.003.490.9711.2938.636.031.225.131.003.570.9611.6943.36
Bone-L36.001.215.211.003.420.9711.9238.246.031.215.151.002.210.9612.1341.97
MBIRAbdomen-L10.532.822.460.969.810.7243.0577.030.531.401.870.9311.950.8646.1449.45
Abdomen-L20.501.151.730.9610.940.9044.4945.560.530.761.340.9125.220.8837.9740.49
Abdomen-L30.470.711.000.9712.600.9621.2523.220.340.730.850.9513.420.9127.3228.04
Lung-L16.031.284.700.939.610.8317.4656.955.841.504.22−2.3718.54−0.0322.8563.32
Lung-L25.911.384.29−1.7721.040.0623.8565.660.448.783.62−0.5520.04−2.4435.01124.60
Lung-L30.441.552.050.968.710.8454.8265.830.310.911.440.979.480.8453.8257.10
Brain-L10.382.141.890.9614.640.7734.7384.700.471.511.580.9711.210.8834.5650.12
Brain-L20.341.291.650.9812.520.8265.5165.860.440.641.300.9612.760.9341.9942.21
Brain-L30.340.751.160.989.270.9144.3540.500.440.540.980.9614.090.9528.5633.53
RSAD: Relative sum of absolute differences between the acquired and the modeled NPS. Modeled NPS: NPS determined based on the two-parameter model, using fpeak and σ from the acquired NPS.

Table 5

Parameters of the various NPSs acquired on a Siemens Somatom Force.

Siemensfpeak (lp/cm)σ (lp/cm)fav (lp/cm)Normal dosefpeak (lp/cm)σ (lp/cm)fav (lp/cm)Low dose
six-param. modelThree-param. modelModeled NPSsix-param. modelThree-param. modelModeled NPS
R2RSAD (%)R2RSAD (%)RSAD (%)R2RSAD (%)R2RSAD (%)RSAD (%)
FBPAbdomen2.371.632.731.001.151.001.613.712.021.602.521.000.891.001.142.79
Lung4.681.664.521.001.971.001.834.264.181.624.181.002.041.002.002.33
Brain1.901.612.471.001.241.002.352.511.781.472.291.001.061.001.962.01
Bone7.13-5.511.001.371.004.0544.997.11-5.081.001.401.001.7221.34
MBIRAbdomen-L11.991.812.661.001.131.001.696.501.961.592.461.001.101.001.541.75
Abdomen-L21.781.772.471.001.631.002.113.381.701.582.271.001.511.002.332.21
Abdomen-L31.321.602.111.002.400.995.485.361.291.391.931.005.731.005.385.91
Lung-L14.391.864.481.002.031.001.998.214.211.604.131.002.151.002.086.49
Lung-L24.301.904.301.002.131.003.174.073.801.823.931.001.901.002.993.39
Lung-L33.922.053.721.001.590.974.8517.242.492.443.311.002.240.993.397.21
Brain-L11.841.572.401.001.131.002.732.581.781.402.221.001.221.002.913.59
Brain-L21.731.422.221.001.261.004.284.471.551.332.061.001.461.004.144.16
Brain-L31.581.211.981.001.761.005.395.781.461.141.861.001.861.004.324.37
Bone-L17.135.471.001.451.004.6143.517.118.345.021.001.121.001.8520.07
Bone-L27.135.300.989.090.996.7437.807.115.824.790.993.610.994.9513.18
Bone-L37.114.640.8416.620.8021.7328.707.084.010.559.450.2223.0730.50
RSAD: relative sum of absolute differences between the acquired and the modeled NPS. Modeled NPS: NPS determined based on the two-parameter model, using fpeak, and σ from the acquired NPS.

Fig. 7

NPSs with clearly different noise textures but with the same (a) peak frequency (1.26  lp/cm) and (b) average peak frequency (1.64  lp/cm). To avoid possible differences in appearance due to differences in higher order statistics in the original reconstructions, the noise textures were generated by applying the NPS to a realization of white noise.

JMI_11_3_035501_f007.png

The six-parameter NPS model can fit the acquired NPSs with a RSAD smaller than 10% for FBP, HIR, MBIR, and DLR in 100%, 92%, 71%, and 100% of the cases, respectively. For the three-parameter model, this drops to 88%, 69%, 45%, and 54%, respectively. For a more elaborate overview, see Table 6. The NPSs modeled by the parameters fpeak and σ using the three-parameter model yielded a modeled NPS that, over all manufacturers, resulted within 20% RSAD for FBP, HIR, MBIR, and DLR in 69%, 78%, 54%, and 67% of all NPSs, respectively (Table 6).

Table 6

Percentages of NPS parameterizations with a relative sum of absolute differences with the acquired NPS below 10%, between 10% and 20%, and above 20%, per manufacturer and reconstruction type. The first number in each cell is the percentage for the six-parameter model and the second number is for the Three-parameter model. Some reconstruction types were not available for some manufacturers.

Reconstruction typeDeviationManufacturer
CanonGEPhilipsSiemensAll
FBP<10%100/100100/100100/50.0100/100100/87.5
>10%, <20%0/00/00/50.00/00/12.5
>20%0/00/00/00/00/0
HIR<10%100/10075.0/41.7100/50.0Not available91.7/68.9
>10%, <20%0/016.7/58.30/50.05.2/28.9
>20%0/08.3/00/03.1/2.1
MBIR<10%100/79.2Not available33.3/091.7/91.771.4/45.2
>10%, <20%0/20.850.0/5.64.2/021.4/14.3
>20%0/016.7/94.44.2/8.37.1/40.5
DLR<10%100/54.2Not availableNot availableNot available100/54.2
>10%, <20%0/41.70/41.7
>20%0/4.20/4.2

6.4.

NPS Shapes

NPS shapes for the images obtained for the various manufacturers, reconstruction techniques, and kernels. The acquired NPS and the fitted three-parameter model are shown below.

JMI_11_3_035501_f008.png
JMI_11_3_035501_f009.png
JMI_11_3_035501_f010.png
JMI_11_3_035501_f011.png
JMI_11_3_035501_f012.png
JMI_11_3_035501_f013.png
JMI_11_3_035501_f014.png
JMI_11_3_035501_f015.png
JMI_11_3_035501_f016.png
JMI_11_3_035501_f017.png
JMI_11_3_035501_f018.png
JMI_11_3_035501_f019.png
JMI_11_3_035501_f020.png
JMI_11_3_035501_f021.png
JMI_11_3_035501_f022.png
JMI_11_3_035501_f023.png

Notes

RSAD: Relative sum of absolute differences

The following reconstruction techniques were not available: Siemens HIR and DLR. GE MBIR and DLR. For Philips MBIR Bone, the hospital uses the same reconstruction kernel as for MBIR Lung.

7.

Appendix B: Ideal Observer

We consider a binary discrimination task to discriminate between two classes of Gaussian-distributed images defined by different power spectra. Let g represent the image pixel values as a column vector. The two hypotheses for the discrimination task are

H1:  gMVN(μ,Σ1),H2:  gMVN(μ,Σ2),
where the textural differences are entirely represented in the image covariance matrix (Σ1 versus Σ2) and there is no difference in the image mean (μ).

7.1.

Ideal Observer

The ideal observer test statistic is based on the log-likelihood ratio. The likelihood of a hypothesis (given an image stimulus) is determined from a multivariate normal distribution:

p(g|Hi)=1(2π)M/2|Σi|1/2e12((gμ)TΣi1(gμ)),
where M is the number of pixels in the image and || represents the determinant of the matrix argument. Calling this a likelihood (instead of a probability density function) means that we consider g to be given (i.e., the independent variable) and Hi to be unknown (the dependent variable). The log-likelihood ratio is then given as
λ(g)=log(p(g|H2)p(g|H1))=12((gμ)T(Σ21Σ11)(gμ)+log(|Σ2|)log(|Σ1|))(gμ)T(Σ11Σ21)(gμ).

The last line is equivalent to the log-likelihood ratio, with removing the terms that do not affect the performance.

To evaluate the ideal observer in this case, we need to be able to compute the inverse of the class covariance matrices. This is where textures defined by an NPS can make the computations much easier.

7.2.

Frequency Domain Computation

If the different image textures may be considered to be realizations of a stationary random process, then their covariance matrices are diagonalized by the Fourier basis:

Σi=F1SiFSi=FΣiF1,
where Si is a diagonal matrix representing the noise power spectrum and F is the finite (usually 2D) Fourier transform matrix. So the product Fg would be the FFT of image g. Because of the properties of the FFT, we have F1=1MF*, where the superscript * means the transpose conjugate (sometimes called the Hermitian or adjoint operator). So we have
Si1=FΣi1F1=1MFΣi1F*.

We can use the spectral decomposition of the covariance matrices in the likelihood ratio to recast the ideal observer formula in the Fourier transform domain. Let the caret represent a Fourier transform (i.e., g^=Fg). We write gμ as F1(g^μ^) to get

λ(g)=(gμ)*(Σ11Σ21)(gμ)=(F1(g^μ^))*(Σ11Σ21)F1(g^μ^))=1M2(gμ)*(FΣ11F*FΣ21F*)(g^μ^).

Note that we rewrote (gμ)T as (gμ)*. This is appropriate because the quantity is real, and therefore the Hermitian is equivalent to the transpose. Based on the formula for the inverse spectrum above, we get

λ(g)=1M(g^u^)*(S11S21)(g^u^).

Because the power-spectrum matrices are diagonal, their inverses are as well, so this quadratic form can be written as a sum

λ(g)=1Mk=1M(1S1[k,k]1S2[k,k])|g^[k]μ^[k]|2.

This sum can be problematic if any of the spectral elements are 0, which usually happen from power spectra that are estimated from samples. It may be advisable to regularize the power-spectrum inversion to get

λReg(g)=1Mk=1M(1S1[k,k]+ε21S2[k,k]+ε2)|g^[k]μ^[k]|2,
where ε represents the variance of discretization “noise.”

Disclosures

Ioannis Sechopoulos has Research Agreements with Siemens Healthcare, Canon Medical, ScreenPoint Medical, Sectra Benelux, Volpara Healthcare, Lunit, and iCAD. He also has a Speaker Agreement with Siemens Healthcare. Craig Abbey is an occasional consultant for Canon Medical Systems USA and Izotropic Corporation LLC, where he also holds stock options. Kirsten Boedeker and Daniel Shin are both Employees of Canon Medical Systems Corporation.

Code and Data Availability

All NPS shapes and fits are available in Appendix A of this article as well as all fpeak and σ values. The numeric values of the NPS shapes are available on request and with permission from the hospitals from which these data are retrieved. The code to calculate the fits and the fpeak and σ from an NPS is available at GitHub ( https://github.com/radboud-axti/NPS_peak_sigma).

Acknowledgments

We would like to thank Marcel Greuter and Jan Braaksma from the University Medical Center Groningen; Frank Bastin, Janneke Jansen, and Inge Röttgers from CWZ Hospital Nijmegen; and Rob Houben, Nathalie Jochems, and Marieke Beelen from Bernhove Hospital Uden for aiding in the acquisition of data from the various CT systems. We also would like to thank the organizers of the Perception Lab event at the 2023 ECR conference and all participating radiologists as well as the initial nonradiologist observers. Part of this work was presented at SPIE Medical Imaging Conference 2022 and published in the SPIE conference proceedings ( https://doi.org/10.1117/12.2610736).

References

1. 

J. T. Bushberg et al., “Noise and noise texture,” The Essential Physics of Medical Imaging., 89 –97 4th ed.Wolters Kluwer( (2020). Google Scholar

2. 

M. J. Willemink et al., “Iterative reconstruction techniques for computed tomography. Part 1: Technical principles,” Eur. Radiol., 23 (6), 1623 –1631 https://doi.org/10.1007/s00330-012-2765-y (2013). Google Scholar

3. 

G. Saiprasad et al., “Evaluation of low-contrast detectability of iterative reconstruction across multiple institutions, CT scanner manufacturers, and radiation exposure levels,” Radiology, 277 (1), 124 –133 https://doi.org/10.1148/radiol.2015141260 RADLAX 0033-8419 (2015). Google Scholar

4. 

W. Stiller, “Basics of iterative reconstruction methods in computed tomography: a vendor-independent overview,” Eur. J. Radiol., 109 (Aug.), 147 –154 https://doi.org/10.1016/j.ejrad.2018.10.025 EJRADR 0720-048X (2018). Google Scholar

5. 

Y. J. Ryu et al., “Knowledge-based iterative model reconstruction: comparative image quality and radiation dose with a pediatric computed tomography phantom,” Pediatr. Radiol., 46 (3), 303 –315 https://doi.org/10.1007/s00247-015-3486-6 PDRYA5 1432-1998 (2016). Google Scholar

6. 

T. P. Szczykutowicz et al., “A review of deep learning CT reconstruction: concepts, limitations, and promise in clinical practice,” Curr. Radiol. Rep., 10 (9), 101 –115 https://doi.org/10.1007/s40134-022-00399-5 (2022). Google Scholar

7. 

C. Park et al., “CT iterative vs deep learning reconstruction: comparison of noise and sharpness,” Eur. Radiol., 31 (5), 3156 –3164 https://doi.org/10.1007/s00330-020-07358-8 (2021). Google Scholar

8. 

J. Greffier et al., “Image quality and dose reduction opportunity of deep learning image reconstruction algorithm for CT: a phantom study,” Eur. Radiol., 30 3951 –3959 https://doi.org/10.1007/s00330-020-06724-w (2020). Google Scholar

9. 

“ECR 2020 book of abstracts,” Insights Imaging, 11 (S1), 34 https://doi.org/10.1186/s13244-020-00851-0 (2020). Google Scholar

10. 

J. M. Boone et al., “Report 87,” (2012). Google Scholar

11. 

R. Bujila, A. Fransson and G. Poludniowski, “Practical approaches to approximating MTF and NPS in CT with an example application to task-based observer studies,” Phys. Med., 33 16 –25 https://doi.org/10.1016/j.ejmp.2016.10.016 (2017). Google Scholar

12. 

T. N. Cornsweet, “The staircase-method in psychophysics,” Am. J. Psychol., 75 (3), 485 https://doi.org/10.2307/1419876 (1962). Google Scholar

13. 

L. G. Brown, “Additional rules for the transformed up-down method in psychophysics,” Percept. Psychophys., 58 (6), 959 –962 https://doi.org/10.3758/BF03205497 PEPSBJ 0031-5117 (1996). Google Scholar

14. 

J. Malik and P. Perona, “Preattentive texture discrimination with early vision mechanisms,” J. Opt. Soc. Am. A, 7 (5), 923 https://doi.org/10.1364/JOSAA.7.000923 JOAOD6 0740-3232 (1990). Google Scholar

15. 

Z. M. Westrick and M. S. Landy, “Pooling of first-order inputs in second-order vision,” Vision Res., 91 108 –117 https://doi.org/10.1016/j.visres.2013.08.005 VISRAM 0042-6989 (2013). Google Scholar

16. 

J. D. Victor, S. M. Rizvi and M. M. Conte, “Two representations of a high-dimensional perceptual space,” Vision Res., 137 1 –23 https://doi.org/10.1016/j.visres.2017.05.003 VISRAM 0042-6989 (2017). Google Scholar

17. 

K. L. Boedeker et al., “The 2022 AAPM annual meeting program—discriminability of non-Gaussian noise properties in CT and the limitation of the noise power spectrum to describe noise texture,” Med. Phys., 49 (6), e149 https://doi.org/10.1002/mp.15766 (2022). Google Scholar

18. 

K. L. Boedeker, V. N. Cooper and M. F. McNitt-Gray, “Application of the noise power spectrum in modern diagnostic MDCT: Part I. Measurement of noise power spectra and noise equivalent quanta,” Phys. Med. Biol., 52 (14), 4027 –4046 https://doi.org/10.1088/0031-9155/52/14/002 PHMBA7 0031-9155 (2007). Google Scholar

Biography

Luuk J. Oostveen has been working as a medical physicist in the Department of Medical Imaging at the Radboud University Medical Center since 2006. Since 2019, he has also been a PhD student in the same department. His research interests include current developments in CT imaging.

Biographies of the other authors are not available.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
Luuk J. Oostveen, Kirsten L. Boedeker, Daniel W. Shin, Craig K. Abbey, and Ioannis Sechopoulos "Perceptual thresholds for differences in CT noise texture," Journal of Medical Imaging 11(3), 035501 (9 May 2024). https://doi.org/10.1117/1.JMI.11.3.035501
Received: 7 November 2023; Accepted: 9 April 2024; Published: 9 May 2024
Advertisement
Advertisement
KEYWORDS
Computed tomography

Lung

Affine motion model

Reconstruction algorithms

Abdomen

Bone

Medical image reconstruction

Back to Top