|
1.IntroductionPhotoplethysmography (PPG) is a noninvasive optical technique for cardiovascular monitoring.1 In its simplest form, a PPG device comprises an illumination source (e.g., LED) and detector (e.g., photodiode). By monitoring the temporal illumination changes, PPG devices measure the pulsatile blood pulse waveform from transient local arterial volume fluctuations. This information can be used to monitor cardiovascular characteristics such as heart rate, heart rate variability, blood pressure, and cardiac output.1 However, existing contact-based systems are limited to single-location monitoring (e.g., finger), can only be used by one individual per device, and motion artifacts inhibit its use in ambulatory scenarios. Photoplethysmographic imaging (PPGI) systems are biophotonic systems that have recently gained interest for noncontact widefield cardiovascular monitoring of cardiac parameters such as heart rate, breathing rate, pulse oxygen saturation, and heart rate variability.2,3 Extending upon PPG theory, PPGI systems decouple the illumination source and detector. A camera is commonly used among PPGI systems as the illumination detector, enabling new types of monitoring such as spatial perfusion analysis4,5 and multi-individual monitoring.6 However, many systems use coarse spatial averaging to estimate cardiovascular perfusion, such as averaging over the facial bounding box6–8 and hardcoded regions.9–11 One study attained increased accuracy by incorporating spatial pulsatility priors;12 however, the method relies on a real-time estimate of the true physiological state based on the aforementioned coarse averaging techniques to achieve accurate prediction. In this paper, we developed a continuous probabilistic pulsatility model for importance-weighted blood pulse waveform extraction. The continuous model can be used by PPGI systems of any resolution through appropriate sampling to extract robust blood pulse waveforms. The model was developed using a data-driven approach over a 23 participant sample with highly varying characteristics (11/12 female/male, age 11 to 60 years, body fat 10.5% to 42.3%, muscle 31.0% to 52.7%, BMI 16.4 to ). Using blood pulse waveform spatial correlation priors, an importance weighting scheme was developed which assigned locations with consistently strong pulsatility a higher weight. Samples were aligned and aggregated in a common Cartesian space, and the continuous probabilistic pulsatility model was computed using a kernel density estimation approach. This method was compared against a whole-area uniform spatial averaging approach used by existing studies.6–8 Results showed that signals extracted using the pulsatility model were statistically significantly stronger in temporal (correlation, ) and spectral (SNR, ) characteristics than uniform averaging, and heart rates were in tight agreement with ground-truth measurements (, error , ). Model visualization elucidated important arterial pathways, including the neck, malar regions, glabella regions, lips, and nose. This work significantly extends a previous case study13 by developing a data-driven multiparticipant model rather than participant-specific weights, and presents more rigorous analysis across a larger participant sample. We discuss how the model can be used and trained for custom applications. 2.MethodsThe goal was to compute a continuous probabilistic spatial pulsatility model for use as a priori information in PPGI systems. By computing a continuous model, it can be used by datasets of any resolution through appropriate discrete spatial sampling. Figure 1 shows the processing pipeline to generate this pulsatility model. This study was approved by a Research Ethics Committee at the University of Waterloo and performed in accordance with the Declaration of Helsinki. Informed consent was obtained from all participants prior to data collection. Additionally, informed consent was obtained from those individuals whose photos were used in this paper. 2.1.Data CollectionA PPGI system, coded hemodynamic imaging,14,15 was used to collect the imaging data. The scene was illuminated with a diffuse uniform broadband tungsten-halogen illumination source using a glass fabric front diffuser (Lowel Rifa eX 44). Images were acquired at 60 fps using a monochrome CMOS camera (Point Grey GS3-U3-41C6NIR-C) with near-infrared (NIR) sensitivity. An NIR bandpass filter (850 to 1000 nm) was mounted in front of the lens to constrain the sensor measurements to deep NIR tissue penetration. Participants () were instructed to remain supine for the duration of the study. The camera was positioned overhead at 1.5 m from the participant’s head. The camera angle and field of view remained fixed across participants. Participants wore a finger PPG cuff, providing a ground-truth blood pulse waveform signal synchronously with the video frames. 2.2.Probabilistic Pulsatility ModelDifferent spatial locations exhibit varying amounts of observed pulsatility. For example, a skin location directly above a superficial artery will exhibit high pulsatility due to transient changes in local tissue optical properties from the arterial pulse,1,16 whereas occluding hair will inhibit the observed pulsatility. Let be the probabilistic pulsatility model such that quantifies the probability of observing arterial pulsatility at location . Using a physiologically motivated data-driven approach, can be computed by determining the locations that consistently exhibited pulsatility across a diverse sample of participants (see Table 1). Such a model can be used as an a priori model for new data when extracting cardiovascular properties. Table 1Sample demographics.
2.2.1.Absorbance mappingLet be a set of frames. The blood pulse waveform signal is typically represented by transient changes in absorbance rather than reflectance, thus was transformed to absorbance Each signal was then temporally detrended17 to normalize the illumination and eliminate respiratory-induced artifacts. 2.2.2.Correlation priorsA transformation was sought to map the set of absorbance frames to a pulsatility strength map describing pixelwise pulsatile components Rather than estimating the pulsatility based on heuristic information, which is participant-independent and may introduce uncontrolled sources of noise, this model can be augmented by incorporating prior information directly into the transformation where is the ground-truth blood pulse waveform. This was measured synchronously with the frames, and Pearson’s linear correlation coefficient was computed between the ground-truth signal and each pixel’s temporal signal where is the covariance between the pixel and ground-truth signals, and are the standard deviations of the pixel and ground-truth signal, respectively, and is the indicator function of positive real numbers. This computation is scale- and offset-independent, suitable for capturing the ratio nature of the blood pulse waveform.2.2.3.Importance-weighted kernel density estimationA physiologically derived importance-weighted scheme was developed to quantify the spatial pulsatility strength. An anatomical location that exhibited strong pulsatility contributed a larger weight than those that contained weak or no pulsing. This system allows for continuous kernel-based probabilistic pulsatility density estimation later. The importance map for participant was computed as To infer pulsatility patterns across the whole sample, the primary anatomical locations must be coaligned. The camera was systematically and consistently set up for all participants; however, differences in anatomy, minor rotation (relative to the camera), and translation (relative to the frame region) were observed. To correct for these relative distortions, the problem was posed as a coordinate mapping problem, where each participant’s weight data were projected into the coaligned pulsatility space . Mathematically where is a coordinate mapping function that maps from participant space to in the coaligned space . Note that this transformation projects points directly into a Cartesian space. There is no need for interpolation, which may have caused local inaccuracies. Implementation details of are discussed later (Sec. 2.3). This aggregate coaligned pulsatility space, , was populated with weighted points from each participant where are the coordinates in the participant space that project to in the aggregate space according to the mapping function .A resolution-agnostic model can be computed by estimating a continuous probability density function, and sampling this density function according to the given system’s resolution. A modified Parzen–Rosenblatt kernel density estimation method18,19 was used to estimate the continuous pulsatility probability density function where is the total number of points, is the spatial window width, , is the window kernel, and is a normalization term such that . The kernel was scaled according to the datum’s pulsatility weight . Using the two-dimensional (2-D) Gaussian kernel, the final probabilistic pulsatility model formulation becomes where is modeled as the spatial standard deviation. This pulsatility model can be used by systems of any resolution through appropriate discrete sampling where is the 2-D Dirac delta function and are the resolution periods in the coordinate space of . A physiologically derived blood pulse waveform can be extracted using the discretely sampled pulsatility mapFigure 2 shows a graphical depiction of this process. 2.3.Implementation DetailsThe model was built using 10 s segments for each participant. To reduce minute interparticipant spatial pulsatility differences, each frame was downsampled such that each pixel represented a area. We empirically found that mm worked well for Eq. (10). We implemented the mapping function as a linear projective transformation20 To solve matrix , a least-squares optimization was applied to fit fiducial markers selected on a set of frames to those same anatomical points on a template participant frame (eyes, nose, lips, chin, top, and side of head, and suprasternal notch; see Fig. 1). Specifically, the matrix was solved via a least squares solution of the following linear system of equations:3.Results3.1.SetupThe signals extracted using the proposed probabilistic pulsatility model were compared against those extracted using the FaceMean method used in existing PPGI studies.6–8,11 Briefly, the pixels within the facial region found using the Viola–Jones face detection method21 were spatially averaged for each frame and concatenated, yielding a one-dimensional temporal signal. A leave-one-out cross-validation scheme was implemented for extracting individual participant signals. That is, participant was processed using the pulsatility density learned with the data from participants , where is the set of all participants and is the set difference operator. The signals were temporally filtered using an ideal bandpass filter with bandwidth [0.5, 5] Hz (30 to 300 bpm). Temporal signal fidelity was evaluated by computing the maximum cross-correlation between the extracted signal and the ground-truth signal from the finger PPG cuff where is the covariance between the true and (shifted) extracted signal, and and are the standard deviations of the true and (shifted) extracted signal, respectively. Cross-correlation was used to account for pulsatility timing differences between the finger and face ().Spectral signal fidelity was evaluated by computing the spectral signal-to-noise ratio (SNR) of the extracted signal where and are the zero-DC normalized frequency magnitudes of the true and extracted signal, respectively, and represents frequency. The Wilcoxon signed rank test22 was used to statistically compare the nonnormally distributed pairwise difference between signals extracted using the proposed and FaceMean methods. Heart rate was estimated by the maximum frequency response of a modified spectral power density to reduce frequency discretization error3.2.Data AnalysisTable 1 provides a summary of the sample demographics measured using bioelectrical impedance analysis. The sample contained a wide range of ages (11 to 60 years), body compositions (BMI 16.4 to ), and fair gender representation (11/13 female/male). Visualizing the final probabilistic pulsatility model elucidated common arterial pathways. Figure 3 shows the (sampled) density across all participants, as well as split by gender. The faces appeared structurally similar to a typical human face, indicating accurate sample alignment based on anatomical anchor points. The forehead, eyebrows, eyes, nose, nostrils, lips, chin, and neck were all visually distinguishable. Characteristic differences existed between the female and male probabilistic faces; however, the primary anatomy was consistent, leading to a cohesive combined pulsatility map. Facial hair in the male data was observed as reduced pulsatility probability in those areas due to occlusion. The strongest observed arterial pathway traveled across both sides of the neck. Its anatomical location was consistent with the common carotid artery, which is the primary arterial pathway to the brain and face. The common carotid artery bifurcates into the external (facial) and internal (cerebral) carotid arteries at the top of the neck. The continued blood flow traveling through the external carotid artery across the jawline could be observed in female pulsatility map [Fig. 3(b)]. This phenomenon was not present in the male data [Fig. 3(c)], due to facial hair occlusion in some participants. Areas around mouth and chin exhibited reduced pulsatility compared to the female data due to facial hair occlusion. Common areas of with high pulsing probability across all participants in both genders included the carotid artery pathways, the lips, malar regions, nose, and glabella regions. Pulsing was additionally observed in females along the jaw line. Blood pulse waveform signals extracted using the probabilistic pulsatility model in a leave-one-out cross-validation scheme exhibited stronger temporal and spectral characteristics when compared to those extracted with the FaceMean method. Figure 4 shows the distributions of temporal correlation and SNR results across the participant sample. Combined, these measures provide signal fidelity information in both the time and frequency domains. Pairwise comparison of the signals extracted using the proposed and FaceMean methods showed that the proposed method yielded statistically significant correlation () and SNR () compared to FaceMean signals. Figure 5 shows the pairwise differences in correlation and SNR for each of the 23 participants. Temporal and spectral fidelity improvements were observed in a large number of participants. These results indicate that the probabilistic pulsatility model identified areas of strong pulsatility. Many PPGI studies investigate heart rate extraction algorithms for remote monitoring. To validate the proposed system’s ability to extract heart rate, correlation and agreement to the ground-truth heart rate were investigated. Figure 6 shows the Bland–Altman plot23 for this heart rate comparison. The proposed system not only attained high correlation to the true heart rate (), but it also achieved strong and tight agreement (error , ). This is consistent with the strong signal fidelity results in Fig. 4, which enable strong inferential properties of summary statistics such as heart rate. 4.DiscussionThe results demonstrated that using a spatial probabilistic pulsatility model can enhance blood pulse waveform extraction in PPGI data. The data-driven pulsatility model, which was constructed using data from a 23 participant sample with widely varying demographics, provides a priori anatomical guidance based on consistently observed pulsatile regions. The data-driven model is built using training data containing spatial pulsatility information. Thus, the new “test” data must sufficiently match the training data’s spatial perspective for proper alignment. Though advanced warping models can adjust for some inconsistencies, large deviations in rotation and perspective relative to the training data may produce erroneous alignment. In particular, different imaging and illumination perspectives may change the light–tissue interaction geometry (e.g., shadows), leading to varied photon migration paths. To address this challenge, the model was designed such that its methodology is agnostic to the type of data with which it is trained. Thus, independent training data sets can be used [ in Eq. (1)] to build custom pulsatility models suitable for the study’s specific test environment. For example, some systems may find that training based on rotated viewpoints or a certain class of demographic (e.g., gender or age) may yield increased results for their specific test environment. Additionally, this model can be trained on anatomical locations other than the head, enabling whole-body cardiovascular monitoring. The model use can be extended to detect abnormal perfusion patterns that may be early markers for disorders such as peripheral vascular disease or arteriosclerosis. The model was designed as a continuous model so that it can be sampled by systems of any resolution. To be used, all that is required is a coordinate mapping function [from Eq. (6)] for spatial alignment to the template model. In offline systems, this can be accomplished by manual or semiautomatic spatial alignment through various methods.24 In real-time systems, automatic alignment is required, and can be accomplished using methods such as automatic face fitting.25 In low-motion scenarios (e.g., sleeping studies and controlled experiments), a single alignment operation may be sufficient. In scenarios with increased motion, one strategy could be to calibrate the first frame to the template, and align all subsequent frames to the calibrated source frame, as it would have more similarities to frames within the same video than the model template. 5.ConclusionsHere, we have developed a continuous probabilistic pulsatility model that describes anatomical locations that consistently exhibited arterial pulsing across a 23 participant sample. This model can be used as a priori information to enhance signal extraction in PPGI systems. Since the model is a continuous model, it can be used by systems of any resolution via appropriate spatial sampling. Results showed that signals extracted using the pulsatility model exhibited statistically significant higher correlation and SNR versus unguided whole-area uniform averaging. Discretely sampled maps identified areas of consistently strong pulsing across the training data in the head, specifically the neck, malar regions, glabella regions, lips, and nose. Discussions demonstrated how the model may be used in custom environments for enhanced blood pulse waveform extraction. AcknowledgmentsThis work was supported by the Natural Sciences and Engineering Research Council (NSERC) of Canada (CGSD3-441805-2013), AGE-WELL NCE Inc. (AW-HQP2015-23), and the Canada Research Chairs program. ReferencesJ. Allen,
“Photoplethysmography and its application in clinical physiological measurement,”
Physiol. Meas., 28
(3), R1
–R39
(2007). http://dx.doi.org/10.1088/0967-3334/28/3/R01 PMEAE3 0967-3334 Google Scholar
Y. Sun and N. Thakor,
“Photoplethysmography revisited: from contact to noncontact, from point to imaging,”
IEEE Trans. Biomed. Eng., 63
(3), 463
–477
(2016). http://dx.doi.org/10.1109/TBME.2015.2476337 IEBEAX 0018-9294 Google Scholar
J. Allen and K. Howell,
“Microvascular imaging: techniques and opportunities for clinical physiological measurements,”
Physiol. Meas., 35
(7), R91
(2014). http://dx.doi.org/10.1088/0967-3334/35/7/R91 PMEAE3 0967-3334 Google Scholar
A. A. Kamshilin et al.,
“Photoplethysmographic imaging of high spatial resolution,”
Biomed. Opt. Express, 2
(4), 996
–1006
(2011). http://dx.doi.org/10.1364/BOE.2.000996 BOEICL 2156-7085 Google Scholar
A. A. Kamshilin et al.,
“Variability of microcirculation detected by blood pulsation imaging,”
PLoS One, 8
(2), e57117
(2013). http://dx.doi.org/10.1371/journal.pone.0057117 POLNCL 1932-6203 Google Scholar
M.-Z. Poh, D. J. McDuff and R. W. Picard,
“Non-contact, automated cardiac pulse measurements using video imaging and blind source separation,”
Opt. Express, 18
(10), 10762
–10774
(2010). http://dx.doi.org/10.1364/OE.18.010762 OPEXFF 1094-4087 Google Scholar
Y. Sun et al.,
“Motion-compensated noncontact imaging photoplethysmography to monitor cardiorespiratory status during exercise,”
J. Biomed. Opt., 16
(7), 077010
(2011). http://dx.doi.org/10.1117/1.3602852 JBOPFO 1083-3668 Google Scholar
Y. Sun et al.,
“Use of ambient light in remote photoplethysmographic systems: comparison between a high-performance camera and a low-cost webcam,”
J. Biomed. Opt., 17
(3), 037005
(2012). http://dx.doi.org/10.1117/1.JBO.17.3.037005 JBOPFO 1083-3668 Google Scholar
L. Kong et al.,
“Non-contact detection of oxygen saturation based on visible light imaging device using ambient light,”
Opt. Express, 21
(15), 17464
–17471
(2013). http://dx.doi.org/10.1364/OE.21.017464 OPEXFF 1094-4087 Google Scholar
D. McDuff, S. Gontarek and R. W. Picard,
“Remote detection of photoplethysmographic systolic and diastolic peaks using a digital camera,”
IEEE Trans. Biomed. Eng., 61
(12), 2948
–2954
(2014). http://dx.doi.org/10.1109/TBME.2014.2340991 IEBEAX 0018-9294 Google Scholar
X. Li et al.,
“Remote heart rate measurement from face videos under realistic situations,”
in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition,
4264
–4271
(2014). http://dx.doi.org/10.1109/CVPR.2014.543 Google Scholar
M. Kumar, A. Veeraraghavan and A. Sabharwal,
“DistancePPG: robust non-contact vital signs monitoring using a camera,”
Biomed. Opt. Express, 6
(5), 1565
–1588
(2015). http://dx.doi.org/10.1364/BOE.6.001565 BOEICL 2156-7085 Google Scholar
R. Amelard, D. A. Clausi and A. Wong,
“Spectral photoplethysmographic imaging sensor fusion for enhanced heart rate detection,”
Proc. SPIE, 9701 970113
(2016). http://dx.doi.org/10.1117/12.2213687 Google Scholar
R. Amelard et al.,
“A portable plug-and-play imaging system for physiological monitoring,”
J. Comput. Vision Imaging Syst., 1
(2015). http://dx.doi.org/10.15353/vsnl.v1i1.57 Google Scholar
R. Amelard et al.,
“Feasibility of long-distance heart rate monitoring using transmittance photoplethysmographic imaging (PPGI),”
Sci. Rep., 5 14637
(2015). http://dx.doi.org/10.1038/srep14637 Google Scholar
A. A. Kamshilin et al.,
“A new look at the essence of the imaging photoplethysmography,”
Sci. Rep., 5 10494
(2015). http://dx.doi.org/10.1038/srep10494 Google Scholar
M. P. Tarvainen, P. O. Ranta-aho and P. A. Karjalainen,
“An advanced detrending method with application to HRV analysis,”
IEEE Trans. Biomed. Eng., 49
(2), 172
–175
(2002). http://dx.doi.org/10.1109/10.979357 IEBEAX 0018-9294 Google Scholar
M. Rosenblatt,
“Remarks on some nonparametric estimates of a density function,”
Ann. Math. Stat., 27
(3), 832
–837
(1956). http://dx.doi.org/10.1214/aoms/1177728190 AASTAD 0003-4851 Google Scholar
E. Parzen,
“On estimation of a probability density function and mode,”
Ann. Math. Stat., 33
(3), 1065
–1076
(1962). http://dx.doi.org/10.1214/aoms/1177704472 AASTAD 0003-4851 Google Scholar
R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, Cambridge, UK
(2003). Google Scholar
P. Viola and M. J. Jones,
“Robust real-time face detection,”
Int. J. Comput. Vision, 57
(2), 137
–154
(2004). http://dx.doi.org/10.1023/B:VISI.0000013087.49260.fb IJCVEQ 0920-5691 Google Scholar
F. Wilcoxon,
“Individual comparisons by ranking methods,”
Biom. Bull., 1
(6), 80
–83
(1945). http://dx.doi.org/10.2307/3001968 Google Scholar
J. M. Bland and D. G. Altman,
“Statistical methods for assessing agreement between two methods of clinical measurement,”
Lancet, 327
(8476), 307
–310
(1986). http://dx.doi.org/10.1016/S0140-6736(86)90837-8 LANCAO 0140-6736 Google Scholar
R. Szeliski,
“Image alignment and stitching: a tutorial,”
FNT Comput. Graphics Vision, 2
(1), 1
–104
(2006). http://dx.doi.org/10.1561/0600000009 Google Scholar
A. Asthana et al.,
“Robust discriminative response map fitting with constrained local models,”
in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition,
3444
–3451
(2013). Google Scholar
BiographyRobert Amelard is a PhD candidate at the University of Waterloo, Canada, and a research associate at the Schlegel-UW Research Institute for Aging. He has published peer reviewed journal articles and six patents in the biomedical optics field. His research interests are in the development of biomedical optics technologies and biomedical image processing methods for spatial-temporal cardiovascular and physiological monitoring. He is a member of SPIE. David A. Clausi has been a professor at the University of Waterloo since 1999. He received his PhD in 1996 and then worked in medical imaging at Mitra Imaging. In 1997, he became an assistant professor at the University of Calgary. His research interests are in computer vision, and he has published extensively in this field. He started and sold a high-tech company and has won numerous awards, including two teaching excellence awards. Alexander Wong is the Canada research chair in medical imaging systems and an associate professor at the University of Waterloo. He received his BASc, MASc, and PhD degrees from the University of Waterloo in 2005, 2007, and 2010, respectively. He is the author of more than 300 journal articles, conference papers, book chapters, books, and patents. His current research interests include computational imaging, artificial intelligence, and computer vision. |