# Low-Power CMOS Laser Doppler Imaging Using Non-CDS Pixel Readout and 13.6-bit SAR ADC

Denis Guangyin Chen, Member, IEEE, Man-Kay Law, Member, IEEE, Yong Lian, Fellow, IEEE, and Amine Bermak, Fellow, IEEE

Abstract-Laser Doppler imaging (LDI) measures particle flows such as blood perfusion by sensing their Doppler shift. This paper is the first of its kind in analyzing the effect of circuit noise on LDI precision which is distinctively different from conventional imaging. Based on this result, it presents a non-correlated-double-sampling (non-CDS) pixel readout scheme along with a high-resolution successive-approximation-register (SAR) analog-to-digital-converter (ADC) with 13.6b effective resolution (ER). Measurement results from the prototype chip in 0.18  $\mu$ m technology confirm the theoretical analysis and show that the two techniques improve LDI sensing precision by 6.9 dB and 4.4 dB (compared to a 10b ADC) respectively without analog pre-amplification. The sensor's ADC occupies 518  $\mu m \times 84 \mu m$ and is suitable for fast column parallel readout. Its differential non-linearity (DNL), integral non-linearity (INL), and input referred noise are +3.0/-2.8 LSB, +24/-17 LSB, and 110  $\mu V_{rms}$ respectively, leading to a Figure-of-Merit (FoM) of 23 fJ/state which makes it one of the most energy efficient image sensor ADCs and an order of magnitude better than the best reported LDI system using commercial high-speed image sensors.

*Index Terms*—CMOS image sensor, correlated double sampling (CDS), flowmetry, laser Doppler imaging (LDI), perfusion, successive-approximation-register analog-to-digital-converter (SAR ADC), time-domain comparator.

## I. INTRODUCTION

T HE laser Doppler (LD) effect describes the difference in frequency—the Doppler Shift—between the incident light and its scattered parts from moving particles. The particle's velocity can be estimated by measuring the heterodyne in its back-scattered light. This technique, known as laser Doppler flowmetry (LDF), is used in a large number of industrial and biomedical instruments purposed to study liquid and gas dynamics [1], [2].

Manuscript received May 07, 2014; revised July 31, 2014 and September 26, 2014; accepted October 06, 2014. Date of publication December 18, 2014; date of current version February 22, 2016. This work was supported by the Hong Kong Research Grant Council under Grant 610412. This paper was recommended by Associate Editor J. Van der Spiegel.

D. G. Chen and A. Bermak are with the Smart Sensory Integrated Systems (S2IS) Lab, Electronic and Computer Engineering Department, Hong Kong University of Science and Technology, Kowloon, Hong Kong (e-mail: denischn@gmail.com; eebermak@ust.hk).

M.-K. Law is with the State Key Laboratory of Analog and Mixed-Signal VLSI, Macau University, Macau, China (e-mail: mklaw@umac.mo).

Y. Lian is with the Department of Electrical Engineering and Computer Science of Lassonde School of Engineering, York University, Toronto, ON M3J 1P3, Canada (e-mail: plian@yorku.ca).

Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TBCAS.2014.2365515

LDF in biomedicine can provide accurate, non-invasive, noncontact, and instantaneous measurements of bodily fluid flow including blood perfusion and microcapillaries circulation [3], [4]. When combined with modeling and calibration, LDF can be extended to other monitoring applications such as blood pressure sensing [5]. The basic LDF apparatus consists of a laser source illuminating the blood vessel and a photo-detector to collect the back-scattered photons. This random back-scatter will create a time-varying interference pattern with a bandwidth on the order of 20 kHz. The first moment of this spectrum is an estimator of the flow-rate. Most research efforts on LDF to date including body tissue models, circuit, and device integration [6], [7] have focused on the single photo-detector configuration.

Laser Doppler imaging (LDI) is the 2D extension of LDF parallelized over an array of photo-detectors. It poses formidable new challenges due to its exponential increase in bandwidth [8]. Owing to the lack of suitable hardware platforms other than general-purpose high-speed image sensors [9], [10], most of the research on LDI has been confined to the system and algorithm level [1], [2], [11]. Standard video-rate charge-coupled-device (CCD) or complementary-metal-oxide-semiconductor (CMOS) image sensors can be used in laser speckle contrast analysis (LASCA) where the optical beating is sensed as speckles in the spatial domain instead of the temporal domain [12], its precision and image quality is inferior to LDI unless it is complemented by sophisticated modeling and costly imaging equipment [13].

Much of the current discussion in the literatures on LDI is focused on its system-level implementation. This paper aims to provide a circuit-level analysis on how LDI imposes a distinctively different set of noise requirements compared to general purpose imaging. Quantitative conclusions will be drawn on how ADC resolution and correlated-double-sampling (CDS) impact LDI instrumentation precision. A number of circuit and sensing techniques are described in this paper to enable faster and more precise LDI on a low-cost CMOS device at very low power consumption. A compact body-biased PMOS reset pixel structure is put forward to improve LDI precision by reducing reset-noise without CDS. A compact time-domain noise-averaging comparator is presented to satisfy the ADC resolution requirements of LDI while enjoying the energy efficiency of the SAR architecture. It also eliminates the need for a separate analog pre-amplifier which would otherwise add noise to the readout chain.

The relationship between LDI precision and circuit noise is discussed in Section II. The prototype LDI sensor and its operation is described in Section III. Measurement results are presented in Section IV before final conclusions are made in Section V.

1932-4545 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications standards/publications/rights/index.html for more information.

## 187

# II. LDI CONSIDERATIONS

# A. Principle of Operation

The velocity induced laser Doppler shift of particle flow is calculated from the power spectral density (PSD) of the backscattered light intensity signal

$$S(v) = \left|\mathbb{F}\left(V_{sig}(i)\right)\right|^2 + N_{tot}(v) \tag{1}$$

where  $\mathbb{F}$  is the discrete Fourier transform (DFT) of the sampled light intensity signal  $V_{sig}(i)$  over the time instances  $T, 2T, \ldots, iT, \ldots, NT$ , and  $N_{tot}(v)$  is the noise PSD. The moments of  $S(v), M_n$ , are

$$M_n = \sum_{v>0}^{F_s/2} v^n S(v).$$
 (2)

It is well known [1], [8], [12] that the particle concentration, c, and their perfusion, p, can be calculated from the zeroth,  $M_0$ , and first,  $M_1$ , moments respectively. The particle concentration, c, is proportional to the zeroth moment

$$c \propto M_0 / S(0) \tag{3}$$

and the perfusion, p, is proportional to the first moment

$$p \propto M_1 / S(0). \tag{4}$$

For most biomedical applications, a sampling rate of 40 kSa/s per pixel and N = 256 (256 FFT bins) has been shown to provide sufficient spectral resolution [8].

## B. Noise Behavior

This section provides an analysis on the effect that the noise PSD,  $N_{tot}(v)$ , has on perfusion, p. Only circuit noise is considered here since LDI is always performed under controlled laser illumination. Changes in  $M_1$  only reflects a change in the spectral bandwidth of S(v) and does not shift its base illumination level, so the pixel noise can be characterized from one set of illumination condition consistent with the LDI experiments. Additionally, it allows the noise analysis to be performed in the absence of a noise-free ideal LDI response which would otherwise be difficult to derive in closed form. The standard deviation of p,  $\sigma_p$ , is a function of

$$\overline{v_{rms,tot}^2} = \int_0^\infty N_{tot}(v) \mathrm{d}v = (1-\alpha)\overline{v_{N,w}^2} + \alpha \overline{v_{N,1/f}^2} \qquad (5)$$

where  $v_{rms,tot}^2$  is the total noise at the input of the LDI function,  $\overline{v_{N,1/f}^2}$  is the 1/f noise power weighted by  $\alpha$ , and  $\overline{v_{N,w}^2}$  is the white noise power (flat spectrum up to the Nyquist rate). If  $\alpha = 0$ ,  $\overline{v_{rms,tot}^2}$  is entirely white. It is difficult to directly simulate  $\sigma_p$  because S(0) itself is also a random variable. Fortunately, for large N (N is typically >128 for LDI, see Table III), S(0) can be considered to have very little variation (subject to only 1/Nth of  $\overline{v_{rms,tot}^2}$ 's noise power), and the behavior of  $\sigma_p$  can be estimated from the standard deviation of  $M_1$ ,  $\sigma_{M1}$ . Fig. 1 shows the simulated gain in  $\sigma_{M1}$  as a function of  $\alpha$  [17]. The



Fig. 1. The standard deviation of the first moment,  $\sigma_{M1}$ , as a function of input noise power. The variable  $\alpha$  represents the ratio of 1/f noise to the total noise power,  $P_{N,tot}$ . If  $\alpha = 0$ , the noise PSD is white.

# TABLE I

RMS VOLTAGE NOISE,  $\sqrt{v_{rms,tot}^2}$ , and  $\alpha$  Measured From On-Chip Test-Structures Under 256 Lux Illumination and 22  $\mu$ s Exposure

|                            | $\sqrt{v_{rms,tot}^2}$ (uV) | α    | $\sqrt{\overline{v_{weq}^2}}$ (uV) |
|----------------------------|-----------------------------|------|------------------------------------|
| Pixel pitch = 5.3 $\mu$ m  | 1030                        | 0.62 | 775                                |
| Pixel pitch = 10.6 $\mu$ m | 538                         | 0.66 | 393                                |

range of  $v_{rms,tot}^2$  is chosen based on Table I. There are two important features in Fig. 1. Firstly, the value of  $\sigma_{M1}$  increases at 40 dB/decade because  $\sigma_{M1}^2$  is the 4th moment of the RMS noise voltage. Secondly, white noise has a much bigger impact on  $\sigma_{M1}$  than 1/f noise. To achieve the same  $\sigma_{M1}$ ,  $v_{rms,tot}^2$  must be approximately 3.39 times bigger for  $\alpha = 1$  than for  $\alpha = 0$ . From this, the LDI process' white equivalent (WEQ) noise power,  $v_{weq}^2$ , can be estimated as

$$\overline{v_{weq}^2} = \overline{v_{N,w}^2} + \left(\overline{v_{N,1/f}^2}\right)/w \tag{6}$$

where the WEQ factor, w, is estimated to be 3.39 in this simulation. In other words,  $\sigma_{M1}$  is significantly more sensitive to white noise sources such as thermal noise and ADC quantization noise. Table I shows the measured  $\overline{v_{rms,tot}^2}$  and the estimated  $\overline{v_{weq}^2}$  and  $\alpha$  values from on-chip test-pixels. The 5.3  $\mu$ m pixel is identical to the one described in Section III-A and the 10.6  $\mu$ m pixel is simply a scaled version of the 5.3  $\mu$ m pixel. The values of  $\alpha$  are estimated by taking two noise measurements for each test pixel: one with CDS ( $\overline{v_{N,CDS}^2}$ ) and one without CDS ( $\overline{v_{N nCDS}^2}$ ), and applying

$$\overline{v_{N,w}^2} = \overline{v_{N,CDS}^2}/2\tag{7}$$

$$\overline{v_{N,1/f}^2} = \overline{v_{N,nCDS}^2 - v_{N,w}^2} \tag{8}$$

$$\alpha = \overline{v_{N,1/f}^2} / \overline{v_{rms,tot}^2}.$$
(9)

# C. ADC Resolution

Table II estimates the degradation in LDI precision after quantization noise (modeled as white) is added to the pixel's own noise power (from Table I). For simplicity, LDI input signal is assumed to be evenly distributed and the quantization noise power,  $\overline{v_{N,Q}^2}$ , is modeled as white noise with the equivalent noise power of  $\overline{v_{N,Q}^2} = \delta_v^2/12$  where  $\delta_v$  is the quantization interval. The pixel noise power is the  $\overline{v_{weq}^2}$  value of the 5.3

TABLE II DEGRADATION OF LDI PRECISION AT DIFFERENT ADC RESOLUTIONS FOR THE 5.3  $\mu$ m Pixel. The White Equivalent RMS Noise,  $\sqrt{v_{weq}^{\prime 2}}$ , is the Sum of Both the Pixel's Noise Power and Also the ADC's Quantization Noise Power

| ADC resolution (b) | $\sqrt{v_{weq}^{\prime 2}}$ (uV) | gain on $\sigma_{M1}$ (dB) |
|--------------------|----------------------------------|----------------------------|
| 14                 | 775                              | 0.00                       |
| 13                 | 777                              | 0.04                       |
| 12                 | 785                              | 0.22                       |
| 11                 | 815                              | 0.87                       |
| 10                 | 926                              | 3.09                       |
| 9                  | 1277                             | 8.66                       |
| 8                  | 2173                             | 17.90                      |

 $\mu$ m pixel (pitch chosen to match the ADC layout pitch) from Table I. The gain on  $\sigma_{M1}$  is calculated for increasingly large  $\delta_v$  under lowering ADC resolutions. These data show that an ADC resolution of > 12b should be used in order to avoid any appreciable precision loss. If ADC resolution is reduced from 14b to 10b, the LDI precision is estimated to suffer a loss of more than 3.1 dB in this case. ADCs of even higher resolutions should be used for larger pixels with lower noise power.

# D. Effect of CDS

CDS is routinely used in image sensors with 4T pixels [18]-[20], but it is also used in certain high-speed image sensors that employ 3T-styled pixels [21]. The impact of CDS on the first moment of LDI,  $M_1$ , will be briefly summarized in this section. The detailed analysis can be found in Appendix A. The digital CDS process is depicted in Fig. 2. The input signal from the photo-diode (PD) is sampled at the Nyquist rate,  $F_s$ , and stored on the sample-and-hold (SAH) capacitor. Zeroth order interpolation is then performed on the stored  $X(\omega)$  to double its sampling rate to  $2F_s$ . This signal is modulated by a repeating sequence of  $0.5, -0.5, 0.5, -0.5, \ldots$ , or  $0.5cos(\pi n)$ . The pixel reset-noise (sum of image lag noise and thermal noise),  $v_n^2$ , is modelled in Fig. 2 on a separate path. It is also sampled at the Nyquist rate,  $F_s$ , and stored on the SAH capacitor, but since it is correlated between reset and pixel readout it is simply zeroth order interpolated without going through the modulator. Both the 1/f noise component,  $v_{N,1/f}^2$ , and the white noise component,  $v_{N,w}^2$ , are superimposed in the noise PSD,  $N(\omega)$ , and ADC quantization noise is lumped into  $v_{N,w}^2$ . The signal part of the quantized output of this modulator,  $Y(\omega)$ , is the sequence of CDS signal pairs representing the reset voltage followed by the integration voltage. In this model,  $2F_s$  is the ADC sampling rate. In other words, the correlated samples are only separated by the ADC quantization time, which is the most general case [18]–[21]. The subtraction in CDS is modeled by a single tap finite impulse response (FIR) filter with one delay element followed by decimation by 2. The sampled reset-noise PSD (including image lag noise),  $V_{rst}(\omega)$ , is completely cancelled by the CDS filter. Without CDS, it will essentially appear as an additional white noise component in  $N'(\omega)$  superimposed on the digital pixel value  $Z(\omega)$ . Section III-A will describe an alternative method where a PMOS reset transistor is used to remove this reset-noise without the aid of CDS. Since the



Fig. 2. The pixel CDS process.

pixel circuit in LDI sensors must operate with a much larger bandwidth than compared to conventional video-rate image sensors, its noise power contribution to  $N(\omega)$ , especially the  $v_{Nw}^2$  part, can be comparable or larger than  $v_n^2$ .

Conventional wisdom suggests that the 1/f noise component in  $N(\omega)$  should be largely eliminated by the CDS subtraction, but this is not quite true when considering the spectral weighting of the first moment,  $M_1$ . Numerical evaluation of the closed-form solution in Appendix A shows that when compared to the non-CDS case, CDS has a noise power gain of approximately 0.91 for 1/f noise (compared to 2 for white noise). This is because  $M_1$  places higher weight on the high-frequency part of the spectrum. After repeating the noise simulation in Fig. 1 with the CDS filter, the WEQ factor, w, is evaluated to be 2.72 which is close to 2/0.91 = 2.2 from the analytical results. The total equivalent noise power gain of CDS in determining  $M_1$ should be between 0.91 and 2. Consequently, the worst-case total SNR degradation of the first moment,  $M_1$ , is 2. Since  $M_1$ is calculated directly from the PSD, its variance,  $\sigma_{M1}^2$ , is the 4th moment of the RMS voltage noise. The deterioration of precision in  $M_1$  in terms of  $\sigma_{M1}^2$  should be  $< 2^2$ , or 6 dB. Since CDS does not significantly mitigate the effect of 1/f noise on  $\sigma_{M1}$ but it doubles the white noise contribution which  $\sigma_{M1}$  is sensitive to, it should be avoided in LDI.

# E. Comparison of Current Systems

Table III summarizes the current state-of-the-art LDI systems found in the literatures. It can be seen that most of them are based on off-the-shelf high-speed image sensors which typically resort to using ADCs with limited resolution. Another important observation is that they typically use pixels larger than the 5.3  $\mu m$  pixel pitch in this study (a compromise between prototyping cost and imaging resolution). This is a necessary measure when limited laser source power is spread over a large imaging area. Interpreting this together with the noise trends in Table I and Table II suggests that the ADC resolutions in these LDI sensors are not high enough to avoid losses in sensing precision. The effects of quantization noise due to low ADC resolution can be mitigated by introducing high SNR analog amplification prior to the ADC stage. However, this approach basically shifts the design challenge from the ADC to the analog pre-amplifier, which still needs to pay the necessary costs in circuit area and power consumption to meet the same noise requirements. Another important consideration here is the ADC's sampling speed. It sets a fundamental limit on a LDI sensor's frame-rate, and also to

| Year                            | 2013           | 2011             | 2005             | 2009 2006        |                  | 2014             |
|---------------------------------|----------------|------------------|------------------|------------------|------------------|------------------|
| Ref                             | [14]           | [8]              | [10]             | [15] [16]        |                  | This work        |
| Custom chip                     | Yes            | No               | No               | No               | -                | Yes              |
| Technology (µm)                 | 0.35           | -                | -                | -                | -                | 0.18             |
| Resolution (pixel)              | $64 \times 64$ | $480 \times 480$ | $256 \times 256$ | $128 \times 128$ | $256 \times 64$  | $128 \times 208$ |
| FFT points                      | 1024           | 128              | 256              | 1024             | 256              | 256              |
| Sampling frequency (kSa/s)      | 40             | 12.4-14.9        | 14               | 27               | 45               | 44.6             |
| Frame-rate (fps)                | 1              | 14.6-59.2        | 0.18             | 0.2              | 0.25             | 1.4              |
| ADC area $(\mu m^2)$            | 586 × 586      | -                | -                | -                | -                | 518 × 84         |
| ADC resolution (b)              | 10             | 9                | 8                | 10               | 12               | 13.6             |
| ADC rate (MHz)                  | 1.28           | -                | 40               | -                | -                | 0.714            |
| Pixel throughput (MSa/s)        | 4.2            | 430.6            | 3.1              | 3.4              | 0.3              | 8.9              |
| Pixel pitch ( $\mu$ m)          | 55             | 14               | 7                | 17               | -                | 5.3              |
| Sensor area (mm <sup>2</sup> )  | 12.4           | 45.2             | 3.2              | 4.7              | -                | 0.7              |
| Imaging area (mm <sup>2</sup> ) | $25 \times 25$ | $80 \times 80$   | 55 × 55          | $70 \times 70$   | $200 \times 150$ | $11 \times 18$   |
| Laser wavelength (nm)           | 780            | 808              | 671              | 671              | 780              | 638              |
| Sensor power (mW)               | -              | -                | 39               | -                | -                | 4.75             |
| LDI precision (dB)              | 20.3           | 6-14             | -                | -                | >26              | 21.2             |

TABLE III Comparison of LDI Systems

some extent its sensing accuracy (related to the number of FFT points). It is challenging to design an ADC with both high-speed and high-resolution at the same time. It is even harder to do so within a reasonable power budget. Therefore the paramount need for ADC parallelization favors small ADCs with limited chip area.

In summary, good LDI performance in terms of sensing precision, speed, and accuracy requires the sensor to use the largest possible pixel size (assuming same or better fill-factor) and avoid CDS, and its ADCs must be high-resolution (> 12b), small (for parallelization), and fast. Compared to other entries in Table III, this work is the first attempt to systematically optimize the circuit-level design trade-offs for custom LDI sensors. It has the smallest pixel pitch, the smallest ADC, the highest ADC resolution, and the second highest pixel throughput rate. Yet its power consumption is more than an order of magnitude smaller than the best reported system based on a commercial high-speed image sensor.

## III. IMAGE SENSOR

Recent advances in CMOS fabrication have made it possible to build low power, small, and inexpensive single-chip cameras offering comparable imaging performance to CCD [22]–[26]. In state-of-the-art CMOS image sensors (CIS), column parallel successive-approximation-register (SAR) analog-to-digital-converters (ADC) have become popular for reasons of speed and power consumption [19], [27]–[32], but SAR ADCs face immense challenges for resolutions beyond the 7b to 10b [33]–[40] due to device mismatch, comparator noise, and circuit area. This section describes a number of circuit techniques useful in overcoming these challenges.

The CMOS LDI sensor depicted in Fig. 3(a) is composed of a  $128 \times 128$  pixel array and 13 column parallel high-resolution SAR ADCs. Each ADC is shared between 16 neighboring pixel columns via transfer-gate multiplexers. The source-follower circuit from the pixel column selected by MUX[15:0] is completed by the single-NMOS current source. Each current source



Fig. 3. The (a) block diagram of the LDI sensor chip and (b) its column circuit from pixel to source follower to SAR ADC.

is biased with 10  $\mu$ A to balance between power consumption and speed. The output of this source follower is sampled by the 16b SAR ADC which generates 19 bits of data (including redundancy) for each sample. The ADC's resolution is chosen to be as high as possible in order to study the impact of quantization noise on LDI precision. The sensor can operate in both standard imaging mode where each pixel is exposed once and LDI mode where each pixel is exposed 256 times at a rate of 40 kSa/s. CDS can be enable or disabled for both modes. In CDS mode, the pixel is sampled once immediately after pixel reset and once at the end of exposure. If CDS is disabled, the pixel is only sampled at the end of exposure.

# A. Body-Biased Pixel

The complete column circuit starting from the pixel is shown in Fig. 3(b). Each pixel has a large NWELL-Psub photo-diode (PD) for high sensitivity and Signal-to-Noise Ratio (SNR) photo-detection. The PMOS reset transistor, MP1, is placed



Fig. 4. The (a) schematic and (b) layout of the body-biased PMOS reset pixel.

inside this PD [Fig. 4(a)] to avoid the need for a separate NWELL. It is evident in Fig. 4(b) that this choice leads to a very compact pixel layout. The advantage of using a PMOS reset transistor is twofold: firstly, it eliminates image lag between samples and ensures all PDs are operating at the same bias point with similar charge-to-voltage gains; secondly, it can increase the nodal capacitance during reset and reduce the thermal reset-noise,  $\overline{v_{nt}^2}$ , of each pixel. The total pixel reset-noise ( $\overline{v_n^2}$  in Fig. 2) is the sum of image lag noise and  $\overline{v_{nt}^2}$ .

The thermal noise sources  $(\overline{v_{ns}^2} \text{ and } \overline{v_{nr}^2})$  contributing to the total thermal reset-noise of each individual pixel,  $\overline{v_{nt}^2}$ , from a N pixel array are illustrated in Fig. 5. The ratio of the source impedance,  $R_s$ , to the reset transistor's drain-to-source impedance,  $R_{ds}$ , play a critical role in determining the value of  $\overline{v_{nt}^2}$ . It can be shown that

$$\overline{v_{nt}^2} = \frac{kT}{C_{ph}} \left( \frac{R_s + R_{ds}}{NR_s + R_{ds}} \right).$$
(10)

If  $R_{ds} \ll NR_s$ , the source impedance,  $R_s$ , effectively sees N capacitors ( $C_{tot} = NC_{ph}$ ) connected in parallel and the  $kT/C_{tot}$  term is reduced by a factor of N when compared to the case of  $kT/C_{ph}$  of a single PD. If  $R_{ds} \gg NR_s$  then each pixel sees an independent noise source dominated by  $R_{ds}$ . In conventional pixels with NMOS reset, where MP1 is replaced with a NMOS transistor, the NMOS reset transistor spends most of the reset phase in cut-off region, and the reset-noise can be shown by temporal analysis to be approximately  $kT/2C_{ph}$  [41].



Fig. 5. Model of pixel reset-noise.

The PMOS reset will have an advantage when  $R_{ds} < NR_s$ . A 0.22  $\mu$ m wide and 0.3  $\mu$ m long 3.3 V PMOS transistor in Global Foundries 0.18  $\mu$ m process has a  $R_{ds}$  of 14.6 k $\Omega$ . If  $R_s = 100 \Omega$ , then  $v_{nt}^2 < kT/2C_{ph}$  when N > 146. This is not difficult to achieve in modern mega-pixel CIS. Additional low-frequency noise sources are not considered here because  $M_1$  in (2) inherently discriminates against them [see (6) in Section II-B].

The main drawback of the described topology lies in the parasitic diode between the source and body of the PMOS reset transistor. This parasitic diode becomes forward biased when the PD voltage is discharged beyond its on-voltage. The solution to this problem is shown in Fig. 4(a). A time delay is inserted to ensure that the reset voltage  $V_{pixel}$  is pulled to ground after MP1 is turned off by  $\overline{RST}$ . This is implemented by an inverter delay chain in the row decoder outside of the pixel array.

# B. SAR ADC

The schematic of the 16b SAR ADC is shown in Fig. 6. Its sub-radix-2 digital-to-analog converter (DAC) with 19 weights is split into 3 sections. The MSB array is sized so  $kT/C \approx 1LSB_{rms}$ . The unit capacitor is a 5  $um \times 5 um 28$  fF MIM capacitor. Fig. 7 illustrates the operation of this ADC on a simplified 4b DAC. During the sampling phase [Fig. 7(a)], the bottom-plates of all capacitors in the negative DAC,  $DAC_n$ , except for the MSB capacitor is connected to GND while their top-plates of all capacitors in the positive DAC,  $DAC_n$ , except for the MSB capacitor is connected to GND while their top-plates of all capacitors in the positive DAC,  $DAC_p$ , except for the MSB capacitor is connected to  $V_{ref}$  while their bottom-plates sample the input signal,  $V_{sig}$ .

When the DAC is settled after the sampling phase in Fig. 7(b), the DAC output,  $V_{DAC}$ , as the difference between the  $DAC_n$  output,  $V_x[0]$ , and the  $DAC_p$  output,  $V_x[1]$ , becomes

$$V_{DAC} = V_x[0] - V_x[1]$$
  
=  $(KV_{ref} + V_{sig}) - (KV_{ref} - V_{sig})$   
=  $2V_{sig}$  (11)

where the DAC offset factor, K, is

$$K = \frac{\sum_{i \neq 2, j \neq 8} C_{ij}}{\sum_{i,j} C_{ij}}.$$
(12)

The DAC settling voltage as a function of  $V_{sig}$  is plotted in Fig. 8. The differential DAC output to the SAR comparator,



Fig. 6. Schematic of the sub-radix-2 16b SAR ADC.

 $V_{DAC}$  is  $2V_{sig}$ . There are two main benefits to this sampling scheme. Firstly, the DC bias of the comparator input is determined by K. This simplifies comparator design complexity because it only needs to work for one common-mode input voltage instead of a wide range. Secondly,  $V_{sig}$  enjoys an inherent voltage gain of  $\times 2$  through the sampling process. Since both  $DAC_n$  and  $DAC_p$  are connected to  $V_{sig}$  during sampling, the kT/C thermal noise requirement is satisfied by the sum of  $C_{ij}$  from both DACs. During successive approximation, the capacitance at each comparator input is halved, but because of the DAC's gain in the voltage domain, the comparator's effective input referred noise is still attenuated by 3 dB.

# C. DAC Calibration

The redundant sub-radix-2 DAC in Fig. 6 is calibrated against an off-chip commercial 18b DAC. Because the SAR ADC uses a much lower reference voltage (1.8 V) when compared to the commercial DAC (5 V), the actual usable resolution of the commercial DAC is close to 16b. The commercial DAC claims to have an INL and DNL of 0.2 LSB and 0.15 LSB respectively. This means that given enough noise averaging samples, the DAC's differential steps can be used as a reliable means of calibrating the SAR ADC's weights. The detailed calibration procedure based on matrix projection is detailed in Appendix B. This calibration scheme prioritizes the minimization of the ADC's Differential Non-Linearity (DNL) error in the absence of an accurate reference for INL. It can be integrated on-chip by adding a high-resolution thermal coded single-slope reference to be shared among the ADC array. This can be in the form of a large (2900  $\mu$ m × 2900  $\mu$ m for 18b) capacitive DAC, or a smaller but very slow  $\Delta\Sigma$  DAC. A popular alternative would be self-calibration schemes. They are often faster and much more compact [28], [42].

## D. Low-Noise Time-Domain Comparator

The resolution of a SAR ADC is largely limited by the input referred noise of its comparator, especially from a power consumption's point of view. This SAR ADC overcomes this limitation by using a noise-averaging time-domain comparator. The core of this comparator, as shown in Fig. 9, is formed by two voltage controlled delay lines (VCDL). The transistors M4, M5, M6, and M7 form an inverter chain for the  $V_{CLK}$  pulse to propagate through. During reset,  $V_{CLK}$  is low and the transistors M4 and M7 are turned on to reset all internal nodes of the inverter chain while M5 and M6 are off. The common source node of M6 is reset to the supply voltage by M1, M2, and M3and the common source node of M5 is reset to ground by M8, M9, and M10. During the evaluation phase,  $V_{CLK}$  is asserted high to create a step pulse which turns on M5 and M6 from stage to stage. Their current supplies are starved by M3 and



Fig. 7. Operation of a 4b SAR ADC during (a) sampling phase and (b) MSB evaluation.

M8 (biased in the saturation region). Consequently, the difference in the gate voltages of M3 and M8 between the two VCDL determines the difference of their delay time. The absolute delay time of each VCDL determines their respective noise bandwidth because the noise current of M3 and M8 are integrated over the time it takes for  $V_{CLK}$  to propagate from one end of the delay line to the other. This leads to an averaging effect on the comparator noise. The difference in delay time between the two VCDL is then arbitrated by the arbiter to produce the final comparator output.

The transistors M5 and M6 are sized to be much bigger than M3 and M8 (approximately eight times), so they operate in the linear region and have low gain. The above configuration is fundamentally different from the VCDL in [43] where an independent transistor is used to control the current of each inverter stage. In certain sizing conditions, it can become a multi-stage amplifier while the circuit described here is always a single-stage amplifier. The inverters formed by M4 and M5 see a common noise source, M8, and similarly the inverters formed by M6 and M7 see a common noise source, M3. This ensures that the phase noise,  $\overline{n_T^2}$ , of each inverter-pair stage is added



Fig. 8. DAC post-sample settling voltage for different input voltages,  $V_{sig}$ .



Fig. 9. Schematic of the time-domain noise-averaging comparator.

equally and does not experience the cascaded gain in [43] where the first inverter stage will have the largest noise contribution. The net effect, as the  $V_{CLK}$  pulse propagates through the inverter chain, is that  $N_s$  observations of  $n_T^2$  is added to the total inverter chain delay power  $(T_{delay}^2)$ 

$$T_{delay}^2 = (N_s A_T V_{in})^2 + N_s \overline{n_T^2}$$
(13)

where  $A_T$  is the voltage-to-time gain of a single inverter-pair stage comprising M4, M5, M6, and M7. The first term of (13),  $N_s A_T V_{in}$ , is the voltage-to-time conversion gain of the entire VCDL. Its second term,  $N_s \overline{n_T^2}$ , is the time-domain noise power summed from the  $N_s$  inverter-pair stages. The output SNR is estimated by

$$SNR = \frac{N_s (A_T V_{in})^2}{\overline{n_T^2}}.$$
 (14)

The noise power term,  $\overline{n_T^2}$ , is attenuated by a factor of  $N_s$ , which is the noise-averaging factor of the comparator. Intuitively, this is the result of the output noise current of M3 and M8 being integrated over the time period of  $T_{delay}$ .



Fig. 10. The comparator's (a) input referred noise, energy consumption, (b) speed, and FoM as a function of the number of VCDL inverter-pair stages. The corresponding vertical axis of each curve is indicated by its associated arrow.

The transistors M1, M2, M9, and M10 are biased by the common-mode input voltage. They serve the purpose of tuning the bias conditions of M3 and M8 to minimize  $v_{ni}^2$  and enlarge  $A_T$ . A larger  $A_T$  reduces noise contributions from the arbiter.

The comparator's input referred noise,  $\sqrt{v_{ni}^2}$ , and energy consumption per comparison, E, is plotted in Fig. 10(a) against an increasing number of inverter-pair stages. The supply and reference voltage,  $V_{ref}$ , are set to 1.8 V, and the load capacitance,  $C_L$ , is 15 fF. The lower bound of  $\sqrt{v_{ni}^2}$  observed in Fig. 10(a) is due to the noise contributions from the inverter-pair themselves which follows the conventional noise model of cascaded amplifier chains. In Fig. 10(b), the comparator speed is inversely related to the number of inverter stages. In this design,  $N_s = 8$  is chosen as a conservative choice. For aggressive high-speed designs, 5 or 6 stage inverter-pairs offer good performance trade-offs.

## IV. PROTOTYPE AND EXPERIMENTAL RESULTS

## A. Prototype Chip

The prototype LDI sensor in Fig. 11 is fabricated in Global Foundry 1P6M 0.18  $\mu$ m mixed-signal technology. The imager and ADC specifications are summarized in Table IV. The DNL and INL reported in Table IV are measured from the first ADC channel in the array. The Figure-of-Merit (FoM) calculated for each imaging mode and also the ADC itself (without CDS) is based on the effective resolution, *ER*. This definition is appropriate for image sensors since the output from the pixel array can be considered to be a DC signal [20], [45], [47], [48] as opposed to a full-range sinusoid.

Evident from entries in Table III, only a limited number of LDI sensors use custom chips, and an even smaller number of them disclose their circuit details. In order to better assess



Fig. 11. Prototype LDI sensor (MONAVALE).

 TABLE IV

 Summary of the Prototype Chip

|                                      | Standard Image    |       | LDI Image |      |  |
|--------------------------------------|-------------------|-------|-----------|------|--|
|                                      | no CDS            | CDS   | no CDS    | CDS  |  |
| Process                              | 0.18 µm 1P6M CMOS |       |           |      |  |
| Supply voltage (V)                   |                   | 3.3,  | 1.8       |      |  |
| ADC Area $(\mu m^2)$                 |                   | 518 : | × 84      |      |  |
| Clock frequency (MHz)                |                   | 2     | 0         |      |  |
| Pixel resolution                     |                   | 128 × | < 208     |      |  |
| Frame rate (fps)                     | 340               | 167   | 1.4       | 0.67 |  |
| Pixel rate (kSa/s)                   | 0.340             | 0.167 | 44.6      | 43.1 |  |
| Pixel size $(\mu m^2)$               | 5.3× 5.3          |       |           |      |  |
| Fill factor                          | 45%               |       |           |      |  |
| Dark current ( $e^{-}$ /sec)         | <4688             |       |           |      |  |
| Sensitivity $(e^{-}/Lux.sec)$        | 269000 @ 201 Lux  |       |           |      |  |
| Dynamic range (dB)                   |                   | 7.    | 4         |      |  |
| ADC rate (kSa/s)                     | 683               | 335   | 714       | 345  |  |
| ADC noise $(\mu V_{rms})$            | 110 (4 LSB)       |       |           |      |  |
| DNL (LSB)                            | +3.0/-2.8         |       |           |      |  |
| INL (LSB)                            |                   | +24/  | /-17      |      |  |
| column FPN ( $\mu$ V)                | 22700             | 138   | -         | -    |  |
| Pixel noise@256Lux ( $\mu V_{rms}$ ) | 509               | 449   | -         | -    |  |
| Pixel noise@dark ( $\mu V_{rms}$ )   | 477               | 427   | -         | -    |  |
| SF noise $(\mu V_{rms})$             | 812               | 873   | 812       | 873  |  |
| ER (b)                               | 13.6              |       |           |      |  |
| Imager power (mW)                    | 4.46              | 4.75  | 4.75      | 4.73 |  |
| Energy (nJ/pixel)                    | 0.50              | 1.09  | 131       | 270  |  |
| Imager FoM (fJ/step) <sup>a</sup>    | 31                | 66    | 31        | 64   |  |
| ADC FoM (fJ/step) <sup>a</sup>       | 23                |       |           |      |  |

<sup>a</sup> The Figure-of-Merit (FoM) is defined as  $FoM = P/(F_s \times 2^{ER})$ . Here, P is the power consumption,  $F_s$  is the sampling frequency, and ER is the effective resolution ( $ER = log_2(V_{ADC}/n)$  where  $V_{ADC}$  is the ADC's input range and n is the average input referred noise of the ADC).

the performance merit of this design, a comparison is made in Table V against general-purpose image sensors especially in terms of energy efficiency. The FoM of the prototype chip is calculated from its effective resolution, ER, and it includes energy contributions from digital CDS (two ADC samples), pixel source-follower, and additional shutter-related digital control. For the other entries in Table V, because image sensors typically do not quote their standalone ADC noise, their FoMs are calculated directly from their nominal resolution assuming zero

| Year                       | 2009              | 2009     | 2011           | 2011        | 2009      | 2012      | 2012            | 2013                            |
|----------------------------|-------------------|----------|----------------|-------------|-----------|-----------|-----------------|---------------------------------|
| Ref                        | [44]              | [18]     | [20]           | [45]        | [19]      | [27]      | [46]            | This work                       |
| ADC architecture           | 2-step SS         | Cyclic   | $\Delta\Sigma$ | Cyclic      | SAR       | SAR       | SAR             | SAR                             |
| Technology (µm)            | 0.35              | 0.18     | 0.13           | 0.13        | 0.18      | 0.13      | 0.18            | 0.18                            |
| Supply (V)                 | 2.8               | 1.8, 3.3 | 2.8, 1.2       | 2.8, 1.5    | 3.3, 1.8  | 2.8, 1.5  | -               | 1.8, 3.3                        |
| Column area $(\mu m^2)$    | $11.2 \times 484$ | 5.6×1112 | 4.5×600        | 18×240      | 8.4×1320  | 9×425     | $15 \times 700$ | $518 \times 84$                 |
| Resolution                 | 320×240           | 640×428  | 1692×1212      | 1696×1212   | 4112×2168 | 1600×1200 | 256×256         | $128 \times 208$                |
| Pixel pitch ( $\mu$ m)     | 5.6               | 5.6      | 2.25           | 2.25        | 4.2       | 2.25      | 15              | 5.3                             |
| Pixel structure            | 4T APS            | 4T APS   | 2.5T APS       | 2.5T APS    | 4T APS    | 2.5T APS  | CTIA            | BB 3T APS                       |
| Sensitivity (V/lux.s)      | 0.52              | 7.1      | -              | 0.11        | -         | 1.6       | 68.5            | 2.0                             |
| Conv. gain $(\mu V/e^-)$   | 46                | 61       | 80             | 80          | 45        | -         | -               | 7.6                             |
| Saturation level $(e^-)$   | 18500             | -        | 11000          | 11000       | 27800     | -         | -               | 68115                           |
| Read noise $(\mu V_{rms})$ | 490               | 336      | 152            | 1000        | 17.6      | 2400      | 3202            | 909 ( <b>110</b> <sup>a</sup> ) |
| Dynamic Range (dB)         | 64.8              | 71       | 75             | 59          | 84        | 52        | 56.5            | 74                              |
| Clock (MHz)                | 25                | -        | 48             | 25          | 41.25     | 7         | 25              | 20                              |
| Frame rate (fps)           | 700               | 390      | 120            | 250         | 50        | 150       | 1500            | 167                             |
| Analog gain                | No                | No       | No             | No          | Yes       | No        | Yes             | No                              |
| CDS                        | Analog            | Digital  | Digital        | Analog      | Digital   | Analog    | Analog          | Digital                         |
| ADC resolution (b)         | 11                | 13       | 12             | 10          | 14        | 10        | 10              | 13.6                            |
| DNL(LSB)                   | +0.53/-0.78       | 0.5      | +0.55/-0.63    | +0.59/-0.83 | 1         | -         | 0.34            | +3.0/-2.8                       |
| INL(LSB)                   | +1.42/-1.61       | 3.2      | +3.7/-0.8      | +2.8/-3.6   | +2/-15    | -         | 0.6             | +24/-17                         |
| column FPN                 | 0.10%             | 0.06 LSB | 0.01%          | 0.10%       | 0.38e-    | 0.4 LSB   | 0.52%           | 0.026 %                         |
| Power (mW)                 | 36                | 297      | 180            | 300         | 1085      | 220       | 390             | 4.46                            |
| Energy (nJ/pixel)          | 0.670             | 2.780    | 0.731          | 0.584       | 2.434     | 0.764     | 3.967           | 1.09                            |
| FoM (fJ/step)              | 327               | 339      | 179            | 570         | 149       | 746       | 3874            | 66                              |

TABLE V COMPARISON OF ADCS FOR IMAGE SENSORS

<sup>a</sup> noise contribution from ADC.

noise and non-linearity. Despite this, the SAR ADC in this paper still has the lowest FoM among the published works.

## B. Non-Linearity and FPN

The ADC's differential non-linearity (DNL) and integral nonlinearity (INL) are characterized from the calibration data generated by the method in Section III-C. The measured DNL of the first 4 ADC channels in the ADC array is plotted in Fig. 12(a). Out of the 13 ADCs on-chip, 11 have a maximum DNL value [half of the peak-to-peak span in Fig. 12(a)] less than 4 LSBs. This indicates that the sub-radix-2 DAC has provided enough redundancy to accommodate process variation within the die. The INL in Fig. 12(b) has a large common-mode component across all channels. This INL error profile changes from measurement to measurement. It may be the result of limitations in the test setup where the ADC's reference voltage is directed connected to an Agilent E3646A power supply causing low-frequency drifts in the reference. Fortunately, in most imaging applications INL is less critical than DNL because large INL does not lead to missing codes; linearity is either not critical or can be corrected during post-processing. The LDI perfusion calculated by (4) inherently discriminates against low-frequency components in the signal.

# C. Noise

The ADC input referred noise in Table IV is measured by calculating the power spectrum of the ADC output when the ADC input is clamped to a heavily low-pass filtered DC input. Similarly, the source-follower (SF) noise is measured from the power spectrum of the ADC output when all the photo-diodes (PD) are biased at a constant voltage by turning on the in-pixel



Fig. 12. (a) DNL and (b) INL (normalized to the LSB) of the first four columns of 16b SAR ADC array.

reset PMOS transistor. The PD noise can be calculated by illuminating the sensor array with constant and uniform illumination from an integrating sphere and subtracting from its ADC output power spectrum the SF noise power characterized earlier. Two sets of pixel noise measurements are reported in Table IV: one under 256 Lux illumination and one in dark conditions. The





Pump

Fig. 13. The (a) photo and (b) schematic of LDI flowmetry experiment apparatus.

pixel reset-noise without CDS is back calculated to be  $212 \ \mu V$ which is smaller than the 355  $\mu V$  noise level expected from  $kT/C_{ph}$  of a single pixel where  $C_{ph} = 33$  fF. Among dark shot noise, optical shot noise and reset-noise, dark shot noise is rendered negligible here due to the short exposure time, and reset-noise is alleviated by the PMOS reset switch. So the PD noise is mainly contributed by optical shot noise, and a constant illumination source is a valid means to characterize this. The results in Table IV confirm that the PD and its SF are the dominant noise sources. The SF here has the same noise bandwidth in standard imaging mode as it does in LDI mode. Its noise power when compared to the PD's own contribution can therefore be higher than certain video-rate image sensors whose SF is low-pass filtered to much smaller bandwidths.

## D. Power Consumption

The total sensor power consumption reported in Table IV is composed of 12% from the ADC reference, 60% from the analog circuits (predominantly ADC comparators and pixel source-followers), 4% from digital circuits, and 24% from pixel circuits. These ratios are consistent across all imaging modes. The contribution of digital circuits is low because image processing is performed off-chip.

# E. Laser Doppler Imaging

The LDI algorithm ((4)) is implemented off-chip using Matlab's fast Fourier transform (FFT) function. For future on-chip LDI processing,  $M_0$  and  $M_1$  can be approximated by simple digital filters [49]. The LDI experimental set-up is shown in Fig. 13. Skimmed milk with 3.4% protein content (human blood has a 7% protein content) is used as the phantom serum because there were no blood samples or alternative optical phantoms available at the time of this experiment. Milk has been used as the test phantom in a number of studies due to



Fig. 14. Spectral response of the pixel's NWELL photo-diode.

its optical properties [50]–[53]. In this experiment, it is pumped through a clear 1.6 mm PVC tube (1.6 mm wall thickness) by a 3 ml syringe pump from KD Scientific. A section of the PVC tube and its content is illuminated by a 1 mW 638 nm laser. This wavelength is chosen to match the spectral response of the pixel's NWELL photo-diode as shown in Fig. 14. The illuminated section of the PVC tube is imaged by the LDI sensor chip in Fig. 15. While a number of earlier LDI experiments [8]–[11], [14] on biological tissues have provided important insights into LDI's biomedical application, the above setup is useful for subjectively measuring LDI precision against a known flow reference.

The effects of CDS and quantization noise on LDI quality are compared in Fig. 15. Fig. 15(a) shows the light intensity image (with CDS) captured by the prototype camera. Fig. 15(b) shows the flow-image [first moment,  $M_1$ , (2)] without CDS. Fig. 15(c) shows the same flow-image with CDS, and finally Fig. 15(d) recalculates Fig. 15(b) with 10b quantization error. The laser dots in Fig. 15(b), Fig. 15(c), and Fig. 15(d) is much smaller than that in Fig. 15(a) because LDI only generates a response in areas where there is particle flow. In Fig. 15(c), the image quality is significantly deteriorated by the noise gain of the CDS process. Similarly, the background noise in Fig. 15(d) is increased by 2.3 dB because of higher quantization noise as shown by the corresponding reduction in its PSNR (maximum LDI response divided by background RMS noise).

Similar noise performance trends are also observed in perfusion (4) measurements. In Fig. 16, Fig. 17, and Fig. 18, the measured LDI response in arbitrary units (AU) is plotted over a range of flow-rates set by the syringe pump. Ten measurements are repeated over the same LDI pixel for each chosen flow-rate. The confidence interval deteriorates from 16b without CDS to 10b without CDS to 16b with CDS. Due to the large noise in the single pixel measurements, the underlying linearity of the LDI response is more easily seen by averaging across 49 neighboring pixels centered around the laser spot. This is appended to Fig. 16 and Fig. 18.

The LDI sensing precision,  $\Psi$ , is defined as

$$\Psi = \frac{\overline{p}}{\sigma_p} \tag{15}$$

where  $\overline{p}$  is the average perfusion as defined by (4) over the 10 measurements and  $\sigma_p$  is its standard deviation. This  $\Psi$  is plotted in Fig. 19 using data from Fig. 16, Fig. 17, and Fig. 18. Averaging across the 49 pixel patch increased the  $\Psi$  by 15 dB when



Fig. 15. The (a) light intensity image (PSNR = 40.6 dB) against LDI images (12 mm/s flow-rate) in (b) 16b resolution without CDS (PSNR = 40.5 dB), (c) 16b resolution with CDS (PSNR = 27.8 dB), and (d) 10b resolution without CDS (PSNR = 38.2 dB).



Fig. 16. Box plot and mean plot of 10 samples of a single pixel versus the group average of 49 ( $7 \times 7$ ) pixels under 16b ADC resolution without CDS for different flow-rates.



Fig. 17. Box plot and mean (line plot) of 10 samples of a single pixel under 16b ADC resolution with CDS for different flow-rates.

compared to single pixel values. This indicates that the perfusion noise has very little correlation between pixels. On average, 4.4 dB of improvement is observed when the ADC resolution is



Fig. 18. Box plot and mean plot of 10 samples of a single pixel versus the group average of 49 ( $7 \times 7$ ) pixels under 10b ADC resolution without CDS for different flow-rates.



Fig. 19. SNR comparison across the various sensing configurations used in Figs.  $16{-}18$ .

increased from 10b to 16b (ER = 13.6b). This falls in well with the 3.1 dB lower-bound estimated in Table II (10b case) from Section II-C because the pixel circuit makes up for the

dominant share of noise power. On average, CDS deteriorates  $\Psi$  by 6.9 dB. This is close to the 6 dB upper-bound estimated in Section II-D.

#### V. CONCLUSION

LDI sensing precision in relation to pixel size, ADC resolution, and CDS is analyzed. A prototype sensor with non-CDS PMOS reset pixel and low-noise SAR ADC is presented to improve this precision. The phantom serum measurement results match well with the expected theoretical values. The SAR ADC is able to achieve high effective resolution without sacrificing energy efficiency thanks to its low-noise time-domain comparator. It has the lowest FoM amongst published works making it suitable for future mobile LDI applications.

#### APPENDIX A

#### THE EFFECT OF DIGITAL CDS ON THE FIRST MOMENT OF LDI

The digital CDS process is depicted in Fig. 2. Let  $X(\omega)$  be the pixel output PSD (discrete-time Fourier transform version of S(v)) with a response up to  $F_s/2$  ( $F_s$  is the sampling frequency) or  $\pi$ . The analog input to the ADC,  $Y(\omega)$ , is an up-sampled version of  $X(\omega)$  modulated with 0.5, -0.5, 0.5..., or  $0.5cos(\pi n)$ . The quantized  $Y(\omega)$  is passed though the CDS function

$$|H_{CDS}(\omega)| = |e^{-j\omega\tau} - 1| = 2sin(\omega\tau/2).$$
 (A.1)

Both the 1/f noise component,  $N_{1/f}/\omega$ , and the white noise component,  $N_w$ , are superimposed in the noise PSD,  $N(\omega)$ , and ADC quantization noise is lumped into  $N_w$ .

$$N(\omega) = \frac{N_{1/f}}{\omega} + N_w \tag{A.2}$$

The 1/f noise component and the white noise component are analyzed separately. Applying the first moment,  $M_1$ , from (2) to the 1/f noise power and taking into account that both the signal and its modulated image (centered at 0 and  $2\pi$  respectively) fold into the post-CDS Nyquist band yields

$$P_{1/f,CDS} = \frac{N_{1/f}}{2} \int_{0}^{\hat{n}} \frac{2}{\omega} 4sin^{2} \left(\frac{\omega}{4}\right) \omega d\omega + \frac{N_{1/f}}{2} \int_{0}^{\pi} \frac{2}{2\pi - \omega} 4cos^{2} \left(\frac{\omega}{4}\right) \omega d\omega. \quad (A.3)$$

The non-CDS case is simply  $P_{1/f,non-CDS} = 2\pi N_{1/f}$ , and

$$\frac{P_{1/f,CDS}}{P_{1/f,non-CDS}} \approx 0.91. \tag{A.4}$$

CDS simply doubles the white noise power

$$\frac{P_{w,CDS}}{P_{w,non-CDS}} = 2.$$
 (A.5)  
Appendix B  
DAC CALIBRATION

This calibration scheme prioritizes the minimization of the ADC's Differential Non-Linearity (DNL) error. It is used in the absence of an accurate reference for estimating INL. For each ADC input,  $V_{sig}(i)$ , generated by the commercial DAC

at step *i*, the SAR ADC quantizes it to a *N* bit binary word,  $a_{ij}|_{j=1,2,...,N}$ , by a combination of its weights  $w_j$ 

$$V_{sig}(i) = \sum_{j=1}^{N} a_{ij} w_j \tag{B.1}$$

where  $w_1, w_2, \ldots, w_N$  represent the capacitor weights from LSB to MSB. Fig. 6 shows the case of N = 19 where  $w_1$  and  $w_N$  corresponds to  $C_{01}$  and  $C_{28}$ . Let N' be the non-redundant resolution of the ADC (N' = 16 in this case). If a ramp of S  $\approx 2^{N'}$  uniform steps is generated by the commercial DAC, the corresponding ADC output is described by

$$\mathbf{D} = \mathbf{A}\mathbf{W} \tag{B.2}$$

where the matrices D, A, and W are defined as

$$\mathbf{D} = \begin{bmatrix} V_{sig}(2) \\ V_{sig}(3) \\ \vdots \\ V_{sig}(S) \end{bmatrix} - \begin{bmatrix} V_{sig}(1) \\ V_{sig}(2) \\ \vdots \\ V_{sig}(S-1) \end{bmatrix}$$
(B.3)  
$$\mathbf{A} = \begin{bmatrix} a_{2(N)} & a_{2(N-1)} & \dots & a_{21} \\ a_{3(N)} & a_{3(N-1)} & \dots & a_{31} \\ \vdots & \vdots & \ddots & \vdots \\ a_{S(N)} & a_{S(N-1)} & \dots & a_{51} \end{bmatrix}$$
$$- \begin{bmatrix} a_{1(N)} & a_{1(N-1)} & \dots & a_{11} \\ a_{2(N)} & a_{2(N-1)} & \dots & a_{21} \\ \vdots & \vdots & \ddots & \vdots \\ a_{(S-1)(N)} & a_{(S-1)(N-1)} & \dots & a_{(S-1)1} \end{bmatrix}$$
(B.4)  
$$\mathbf{W} = \begin{bmatrix} w_N & w_{N-1} & \dots & w_1 \end{bmatrix}^T.$$
(B.5)

In practice, each row of A is averaged across a large number of consecutive ramps. Each sample,  $V_{sig}(i)$ , within a ramp represents one DAC step. When A is long enough to cover all ADC code changes in the ramp, it will have sufficient eigenvalues to regress all entries in W. If  $S < 2^{N'}$ , A may be too sparse to give reliable estimates of all the LSB weights in W. In that case, the LSB weights can be linearly extrapolated from the last reliable MSB weight [28]. While the DNL can be accurately estimated from this calibration data, INL measurements are still subject to fluctuations in the reference and supply.

From (B.2), the weight matrix, W, can be projected by

$$\mathbf{W} = \left(\mathbf{A}^{\mathrm{T}}\mathbf{A}\right)^{-1}\mathbf{A}^{\mathrm{T}}\mathbf{D}.$$
 (B.6)

#### REFERENCES

- J. D. Briers, "Laser Doppler, speckle and related techniques for blood perfusion mapping and imaging," *Phys. Meas.*, vol. 22, no. 4, p. R35.
- [2] G. Golpayegani and K. Maghooli, "Laser Doppler and laser speckle techniques for blood flow measurement," in *Proc. 2nd Int. Conf. Bioinformatics and Biomedical Engineering*, May 2008, pp. 1555–1560.
- [3] M. F. Swiontkowski, "Laser Doppler flowmetry-development and clinical application," *Iowa Orthopaedic J.*, vol. 11, pp. 119–126, Jun. 1991.
- [4] M. Wang and J. Chen, "Volumetric flow measurement using an implantable CMUT array," *IEEE Trans. Biomed. Circuits Syst.*, vol. 5, no. 3, pp. 214–222, Jun. 2011.
- [5] S. Shaw, "A device for measuring blood pressure," N. S. Tarilian Consulting Ltd., Patent WO/2007/017 661, 02 15, 2007.

- [6] Y. Kimura, M. Goma, A. Onoe, E. Higurashi, and R. Sawada, "Integrated laser Doppler blood flowmeter designed to enable wafer-level packaging," *IEEE Trans. Biomed. Eng.*, vol. 57, no. 8, pp. 2026–2033, Aug. 2010.
- [7] A. Serov, J. Nieland, S. Oosterbaan, F. de Mul, H. van Kranenburg, H. Bekman, and W. Steenbergen, "Integrated optoelectronic probe including a vertical cavity surface emitting laser for laser Doppler perfusion monitoring," *IEEE Trans. Biomed. Eng.*, vol. 53, no. 10, pp. 2067–2074, Oct. 2006.
- [8] M. Leutenegger, E. Martin-Williams, P. Harbi, T. Thacher, W. Raffoul, M. André, A. Lopez, P. Lasser, and T. Lasser, "Real-time full field laser Doppler imaging," *Biomed. Opt. Express*, vol. 2, no. 6, pp. 1470–1477, Jun. 2011.
- [9] A. Serov, B. Steinmacher, and T. Lasser, "Full-field laser Doppler perfusion imaging and monitoring with an intelligent cmos camera," *Opt. Express*, vol. 13, no. 10, pp. 3681–3689, May 2005.
- [10] A. Serov and T. Lasser, "High-speed laser Doppler perfusion imaging using an integrating cmos image sensor," *Opt. Express*, vol. 13, no. 17, pp. 6416–6428, Aug. 2005.
- [11] K. Wardell, A. Jakabsson, and G. E. Nilsson, "LaserDoppler perfusion imaging by dynamic light scattering," *IEEE Trans. Biomed. Eng.*, vol. 40, no. 4, pp. 309–316, Apr. 1993.
- [12] A. Rege, K. Murari, N. Li, and N. Thakor, "Imaging microvascular flow characteristics using laser speckle contrast imaging," in *Proc. Annu. Int. Conf. IEEE Engineering in Medicine and Biology Soc.*, Sep. 4, 2010, pp. 1978–1981.
- [13] K. Forrester, J. Tulip, C. Leonard, C. Stewart, and R. Bray, "A laser speckle imaging technique for measuring tissue perfusion," *IEEE Trans. Biomed. Eng.*, vol. 51, no. 11, pp. 2074–2084, Nov. 2004.
- [14] D. He, H. C. Nguyen, B. R. Hayes-Gill, Y. Zhu, J. A. Crowe, C. Gill, G. F. Clough, and S. P. Morgan, "Laser Doppler blood flow imaging using a CMOS imaging sensor with on-chip signal processing," *Sensors*, vol. 13, no. 9, pp. 12632–12647, Sep. 2013.
- [15] M. Draijer, E. Hondebrink, T. van Leeuwen, and W. Steenbergen, "Twente optical perfusion camera: System overview and performance for video rate laser Doppler perfusion imaging," *Opt. Express*, vol. 17, no. 5, pp. 3211–3225, Mar. 2009.
- [16] Moor Instruments, moorLDLS-BI [Online]. Available: http://us.moorclinical.com/product/LDLS-BI/1, Jul. 10, 2014
- [17] "Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection," *Biomed. Eng. Online*, vol. 6, no. 1, Jun. 2007.
- [18] J.-H. Park, S. Aoyama, T. Watanabe, K. Isobe, and S. Kawahito, "A high-speed low-noise CMOS image sensor with 13-b column-parallel single-ended cyclic ADCs," *IEEE Trans. Electron Devices*, vol. 56, no. 11, pp. 2414–2422, Nov. 2009.
- [19] S. Matsuo, T. Bales, M. Shoda, S. Osawa, K. Kawamura, A. Andersson, M. Haque, H. Honda, B. Almond, Y. Mo, J. Gleason, T. Chow, and I. Takayanagi, "8.9-megapixel video image sensor with 14-b columnparallel SA-ADC," *IEEE Trans. Electron Devices*, vol. 56, no. 11, pp. 2380–2389, Nov. 2009.
- [20] Y. Chae, J. Cheon, S. Lim, M. Kwon, K. Yoo, W. Jung, D.-H. Lee, S. Ham, and G. Han, "A 2.1 MPixels, 120 frame/s CMOS image sensor with column-parallel  $\Delta\Sigma$  ADC architecture," *IEEE J. Solid-State Circuits*, vol. 46, no. 1, pp. 236–247, Jan. 2011.
- [21] J. Dubois, D. Ginhac, M. Paindavoine, and B. Heyrman, "A 10 000 fps CMOS sensor with massively parallel image processing," *IEEE J. Solid-State Circuits*, vol. 43, no. 3, pp. 706–717, Mar. 2008.
- [22] E. Fossum, "Active pixel sensors: Are CCDs dinosaurs?," in Proc. SPIE Charged-Coupled Devices and Solid State Optical Sensors III, Feb. 1993, vol. 1900, pp. 30–39.
- [23] A. El Gamal, "Trends in CMOS image sensor technology and design," in Proc. Int. Electron Devices Meeting Dig., Dec. 2002, pp. 805–808.
- [24] R. Guidash, T.-H. Lee, P. Lee, D. Sackett, C. Drowley, M. Swenson, L. Arbaugh, R. Hollstein, F. Shapiro, and S. Domer, "A 0.6 μm CMOS pinned photodiode color imager technology," in *Proc. Int. Electron Devices Meeting, Tech. Dig.*, Dec. 1997, pp. 927–929.
- [25] A. Theuwissen, "CMOS image sensors: State-of-the-art and future perspectives," in *Proc. Eur. 37th Solid State Device Research Conf.*, Sep. 2007, pp. 21–27.
- [26] D. G. Chen, D. Matolin, A. Bermak, and C. Posch, "Pulse-modulation imaging-review and performance analysis," *IEEE Trans. Biomed. Circuits Syst.*, vol. 5, no. 1, pp. 64–82, Feb. 2011.
- [27] M.-S. Shin, J.-B. Kim, M.-K. Kim, Y.-R. Jo, and O.-K. Kwon, "A 1.92megapixel CMOS image sensor with column-parallel low-power and area-efficient SA-ADCs," *IEEE Trans. Electron Devices*, vol. 59, no. 6, pp. 1693–1700, Jun. 2012.

- [28] R. Xu, B. Liu, and J. Yuan, "Digitally calibrated 768-kS/s 10-b minimum-size SAR ADC array with dithering," *IEEE J. Solid-State Circuits*, vol. 47, no. 9, pp. 2129–2140, Sep. 2012.
- [29] D. G. Chen, F. Tang, and A. Bermak, "A low-power pilot-DAC based column parallel 8b SAR ADC with forward error correction for CMOS image sensors," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 60, no. 10, pp. 2572–2583, Sep. 2013.
- [30] D. G. Chen, F. Tang, M.-K. Law, and A. Bermak, "A 12 pJ/pixel analog-to-information converter based 816 × 640 pixel CMOS image sensor," *IEEE J. Solid-State Circuits*, vol. 49, no. 5, pp. 1210–1222, May 2014.
- [31] F. Tang, D. G. Chen, B. Wang, and A. Bermak, "Low-power CMOS image sensor based on column-parallel single-slope/SAR quantization scheme," *IEEE Trans. Electron Devices*, vol. 60, no. 8, pp. 2561–2566, Jun. 2013.
- [32] D. G. Chen, F. Tang, M.-K. Law, X. Zhong, and A. Bermak, "A 64 fJ/step 9-bit SAR ADC array with forward error correction and mixedsignal CDS for CMOS image sensors," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 61, no. 11, pp. 3085–3093, Nov. 2014.
- [33] P. Harpe, C. Zhou, Y. Bi., N. van der Meijs, X. Wang, K. Philips, G. Domans, and H. de Groot, "A 26 μW 8 bit 10 MS/s asynchronous SAR ADC for low energy radios," *IEEE J. Solid-State Circuits*, vol. 46, no. 7, pp. 1585–1595, Jul. 2011.
- [34] M. van Elzakker, E. van Tuijl, P. Geraedts, D. Schinkel, E. Klumperink, and B. Nauta, "A 10-bit charge-redistribution ADC consuming 1.9 μW at 1 MS/s," *IEEE J. Solid-State Circuits*, vol. 45, no. 5, pp. 1007–1015, May 2010.
- [35] M. Yoshioka, K. Ishikawa, T. Takayama, and S. Tsukamoto, "A 10-b 50-MS/s 820-  $\mu$ W SAR ADC with on-chip digital calibration," *IEEE Trans. Biomed. Circuits Syst.*, vol. 4, no. 6, pp. 410–416, Dec. 2010.
- [36] C.-C. Liu, S.-J. Chang, G.-Y. Huang, and Y.-Z. Lin, "A 10-bit 50-MS/s SAR ADC with a monotonic capacitor switching procedure," *IEEE J. Solid-State Circuits*, vol. 45, no. 4, pp. 731–740, Apr. 2010.
- [37] Y. Zhu, C.-H. Chan, U.-F Chio, S.-W. Sin, S.-P.U. R. Martins, and F. Maloberti, "A 10-bit 100-MS/s reference-free SAR ADC in 90 nm CMOS," *IEEE J. Solid-State Circuits*, vol. 45, no. 6, pp. 1111–1121, Jun. 2010.
- [38] C.-C. Liu, S.-J. Chang, G.-Y. Huang, Y.-Z. Lin, C.-M. Huang, C.-H. Huang, L. Bu, and C.-C. Tsai, "A 10b 100 MS/s 1.13 mW SAR ADC with binary-scaled error compensation," in *Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers*, Feb. 2010, pp. 386–387.
- [39] D. G. Chen and A. Bermak, "A low-power dynamic comparator with digital calibration for reduced offset mismatch," in *Proc. IEEE Int. Symp. Circuits and Systems*, May 2012, pp. 2825–2828.
- [40] P.-H. Lee, D. G. Chen, A. Bermak, and M.-K. Law, "A high voltage zero-static current voltage scaling ADC interface circuit for microstimulator," in *Proc. IEEE Int. Symp. Circuits and Systems*, Jun. 2014, pp. 1380–1383.
- [41] H. Tian, B. Fowler, and A. Gamal, "Analysis of temporal noise in CMOS photodiode active pixel sensor," *IEEE J. Solid-State Circuits*, vol. 36, no. 1, pp. 92–101, Jan. 2001.
- [42] W. Liu, P. Huang, and Y. Chiu, "A 12-bit, 45-MS/s, 3-mW redundant successive-approximation-register analog-to-digital converter with digital calibration," *IEEE J. Solid-State Circuits*, vol. 46, no. 11, pp. 2661–2672, Nov. 2011.
- [43] S.-K. Lee, S.-J. Park, H.-J. Park, and J.-Y. Sim, "A 21 fJ/conversionstep 100 kS/s 10-bit ADC with a low-noise time-domain comparator for low-power sensor interface," *IEEE J. Solid-State Circuits*, vol. 46, no. 3, pp. 651–659, Jan. 2011.
- [44] S. Lim, J. Lee, D. Kim, and G. Han, "A high-speed CMOS image sensor with column-parallel two-step single-slope ADCs," *IEEE Trans. Electron Devices*, vol. 56, no. 3, pp. 393–398, Mar. 2009.
- [45] S. Lim, J. Cheon, Y. Chae, W. Jung, D.-H. Lee, M. Kwon, K. Yoo, S. Ham, and G. Han, "A 240-frames/s 2.1-mpixel CMOS image sensor with column-shared cyclic ADCs," *IEEE J. Solid-State Circuits*, vol. 46, no. 9, pp. 2073–2083, Sep. 2011.
- [46] R. Xu, B. Liu, and J. Yuan, "A 1500 fps highly sensitive 256 × 256 CMOS imaging sensor with in-pixel calibration," *IEEE J. Solid-State Circuits*, vol. 47, no. 6, pp. 1408–1418, Jun. 2012.
- [47] K. Kagawau, S. Shishido, M. Nunoshita, and J. Ohta, "A 3.6 pW/frame/ pixel 1.35 V PWM CMOS imager with dynamic pixel readout and no static bias current," in *Proc. IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers*, Feb. 2008, pp. 54–595.
- [48] K.-B. Cho, A. Krymski, and E. Fossum, "A 1.5 V 550 μw 176 × 144 autonomous CMOS active pixel image sensor," *IEEE Trans. Electron Devices*, vol. 50, no. 1, pp. 96–105, Jan. 2003.

- [49] "Low resource processing algorithms for laser Doppler blood flow imaging," *Med. Eng. Phys.*, vol. 33, no. 6, pp. 720–729, 2011.
- [50] B. Wilson, Y. Park, Y. Hefetz, M. Patterson, S. Madsen, and S. Jacques, "The potential of time-resolved reflectance measurements for the noninvasive determination of tissue optical properties," *Proc. SPIE*, vol. 1064, pp. 97–106, Aug. 1989.
- [51] A. B. Pravdin, S. R. Utz, and V. I. Kochubey, "Physical modeling of human skin optical properties using milk and erythrocytes mixturesa," *Proc. SPIE*, vol. 2627, pp. 221–226, Dec. 1995.
- [52] H. Z. Yeganeh, V. Toronov, J. T. Elliott, M. Diop, T.-Y. Lee, and K. S. Lawrence, "Broadband continuous-wave technique to measure baseline values and changes in the tissue chromophore concentrations," *Biomed. Opt. Express*, vol. 3, no. 11, pp. 2761–2770, Nov. 2012.
- [53] K. M. Yoo, B. B. Das, and R. R. Alfano, "Imaging of a translucent object hidden in a highly scattering medium from the early portion of the diffuse component of a transmitted ultrafast laser pulse," *Opt. Lett.*, vol. 17, no. 13, pp. 958–960, Jul. 1992.



**Denis Guangyin** Chen (S'07–M'13) received the B.Eng. (1st Class Hons.) degree in electrical and electronic engineering from the University of Canterbury, Christchurch, New Zealand, and the Ph.D. degree in electronic and computer engineering from the Hong Kong University of Science and Technology (HKUST), Kowloon, Hong Kong, in 2008 and 2013, respectively.

Currently, he is a Sensing Systems Hardware Engineer at Apple Inc. From 2013 to 2014, he was a Research Associate at the Smart Sensory Integrated

System (S2IS) Laboratory at HKUST, where he worked on CMOS image sensors, compressive imaging, low-power SAR ADC with error-correction, biomedical implants, smart infrastructures, and laser Doppler imaging (LDI).

IEEE TRANSACTIONS ON MULTIMEDIA, the 2011 IES Prestigious Engineering Achievement Award, and the 2012 Faculty Research Award. As an educator, he received the University Annual Teaching Excellent Award in two consecutive academic years from 2008 to 2010 and many other teaching awards from the Faculty of Engineering. Under his guidance, his students received many awards including the Best Student Paper Award in ICME 2007, winner of the 47th DAC/ISSCC Student Design Contest in 2010, Best Design Award in A-SSCC 2013 Student Design Contest. He serves as Vice President for Publications of the IEEE Circuits and Systems Society, Steering Committee Member of the IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS and IEEE TRANSACTIONS ON MULTIMEDIA, and the Past Chair of the DSP Technical Committee of the IEEE Circuits and Systems (CAS) Society. He was the Editor-in-Chief of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS PART II: EXPRESS BRIEFS for two terms from 2010 to 2013. He also served as Associate Editor for IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-PART I: REGULAR PAPERS, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-PART II: EXPRESS BRIEFS, IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, and Journal of Circuits, Systems Signal Processing in the past 10 years, and was the Guest Editor for eight special issues in IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-PART I: REGULAR PAPERS, IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, and Journal of Circuits, Systems Signal Processing. He was the Vice President of the Asia Pacific Region of the IEEE CAS Society from 2007 to 2008, AdComm Member of the IEEE Biometrics Council from 2008 to 2009, CAS Society Representative to the BioTechnology Council from 2007 to 2009, Chair of the BioCAS Technical Committee of the IEEE CAS Society from 2007 to 2009, Member of the IEEE Medal for Innovations in Healthcare Technology Committee from 2010 to 2012, and a Distinguished Lecturer of the IEEE CAS Society from 2004 to 2005. He is the Founder of the International Conference on Green Circuits and Systems, the Asia Pacific Conference on Postgraduate Research in Microelectronics and Electronics, and the IEEE Biomedical Circuits and Systems Conference. He is a Fellow of the Academy of Engineering Singapore.



**Man-Kay Law** (M'11) received the B.Sc. degree in computer engineering and the Ph.D. degree in electronic and computer engineering from the Hong Kong University of Science and Technology (HKUST), Kowloon, Hong Kong, in 2006 and 2011, respectively.

During his doctoral study, he performed research on ultra-low power/energy harvesting CMOS sensor designs for wireless sensing platforms. He joined HKUST as a Visiting Assistant Professor in February 2011. Currently, he is an Assistant Professor in the

State Key Laboratory of Analog and Mixed-Signal VLSI, University of Macau, Macao, China. His research interests are in the area of ultra-low power energy harvesting and sensing circuits for wireless sensing and biomedical systems, with special emphasis on smart CMOS temperature sensors, CMOS image sensors, ultra-low power analog design techniques, and integrated energy harvesting techniques.

Dr. Law serves as a member of the IEEE Circuits and Systems Society (CASS) Technical Committee on Sensory Systems as well as Biomedical Circuits and Systems.



Yong Lian (M'90–SM'99–F'09) received the B.Sc. degree from the College of Economics and Management, Shanghai Jiao Tong University, Shanghai, China, and the Ph.D. degree from the Department of Electrical Engineering, National University of Singapore (NUS), Singapore, in 1984 and 1994, respectively.

He spent nine years in industry and joined NUS in 1996. He was appointed as the first Provost's Chair Professor in the Department of Electrical and Computer Engineering of NUS in 2011. He is also the

Founder of ClearBridge VitalSigns Pte. Ltd., a startup company for wireless wearable biomedical devices. His research interests include biomedical circuits and systems and signal processing.

Dr. Lian has received many awards including the 1996 IEEE CAS Society's Guillemin–Cauer Award for the best paper published in IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II, the 2008 Multimedia Communications Best Paper Award from the IEEE Communications Society for the paper published in



Amine Bermak (M'99–SM'04–F'13) received the M.Sc. and Ph.D. degrees in electronic engineering from Paul Sabatier University, Toulouse, France, in 1994 and 1998, respectively.

During his doctoral work, he was part of the Microsystems and Microstructures Research Group at the French National Research Center LAAS-CNRS, where he developed a 3D VLSI chip for artificial neural network classification and detection applications in a project funded by Motorola. While completing his Ph.D. degree, he joined the Advanced

Computer Architecture research group at York University, York, U.K., where he was doing postdoctoral work on VLSI implementation of CMM neural network for vision applications in a project funded by British Aerospace. In 1998, he joined Edith Cowan University, Perth, Australia, first as a Research Fellow working on smart vision sensors, then as a Lecturer and a Senior Lecturer in the School of Engineering and Mathematics. In 2002, he moved to the Hong Kong University of Science and Technology (HKUST), Kowloon, Hong Kong, where he is currently a Full Professor leading the Smart Sensory Integrated Systems (S2IS) Lab and also serving as the Director of the M.Sc. degree in Integrated Circuit Design (ICDE). His research interests are related to VLSI circuits and systems for signal, image processing, sensors and microsystems applications. He has authored more than 230 papers on the mentioned topics in various journals, book chapters, and conferences.

Dr. Bermak has received many distinguished awards, including the 2004 IEEE Chester Sall Award; the Best Paper Award at the 2005 International Workshop on System-On-Chip for Real-Time Applications; the student Best Paper Award at IEEE ISCAS 2010; Michael Gale Medal in 2011, and the Engineering Teaching Excellence Award from the School of Engineering HKUST in 2004 and 2009. He held a number of technical program committees' memberships at a number of international conferences, including the IEEE Custom Integrated Circuit Conference and Design Automation and Test in Europe. He is the General Cochair of the IEEE International Symposium on Electronic Design Test and applications, Hong Kong 2008, and the General Cochair of the IEEE Conference on Biomedical Circuits and Systems, Beijing, 2009. He has served and currently serves on many international journal editorial boards, including IEEE TRANSACTIONS ON VLSI SYSTEMS, IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, and IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II. He is a member of the IEEE CAS Committee on Sensory Systems and the CAS Committee on Biomedical Circuits and Systems as well as an IEEE distinguished lecturer.