# A GENERALIZED TIMING-SKEW-FREE, MULTI-PHASE CLOCK GENERATION PLATFORM FOR PARALLEL SAMPLED-DATA SYSTEMS

Sai-Weng Sin, Seng-Pan U<sup>1</sup>, R.P.Martins<sup>2</sup>

Analog and Mixed Signal VLSI Laboratory,

Faculty of Science and Technology, University of Macau, Av. Padre Tomás Pereira S.J., Macao, China

E-mail - terryssw@ieee.org

(1 - Chipidea Microelectronics, Macao, on leave from University of Macau, E-mail - benspu@umac.mo)

(2 - on leave from IST, E-mail - rmartins@umac.mo)

# ABSTRACT

This paper presents a comprehensive analysis of mismatchinsensitive clock generation techniques for general parallel sampled-data systems. Two jitter-insensitive clock generation schemes are described, and a novel class of multi-purpose, lowjitter, multi-phase clock generator platform will be proposed. The platform can provide clock phases for the two pre-described clock generation schemes and has the advantages of being insensitive to timing mismatches, having a simple and highly robust architecture such that the clock generator can be generalized not only for an arbitrary number N of timeinterleaved (TI) paths, but also can be applied to general TI sampled data systems including ADCs, DACs and N-path filters.

# **1. INTRODUCTION**

High-speed sampled-data systems are experiencing an increasingly demand from the present telecommunication systems, as well as, from other signal processing applications, like for example, the single-chip CMOS receiver [1], the digital oscilloscope and RGB-LCD display conversion [2]. Time-Interleaved (TI) architectures are one of the effective solutions to the high-speed sampled-data systems, but they suffer from periodic timing skew that produces modulation images at frequency locations of multiples of  $f_s/M$  ( $f_s$ -overall sampling rate, M-period of timing skew), which are caused by timing mismatches in different time-interleaved paths [3]. These images will severely affect, especially, the dynamic range of TI-sampled data systems in very high-speed applications. As the sampling frequency increases beyond 100MHz, the design of a low-jitter, non-overlapping clock generator becomes a critical task to ensure a satisfactory performance of such high-speed systems.

Various techniques had been reported in the literature to produce low-jitter sampling clocks. One of such techniques uses delay lock loop (DLL) to compensate the timing-jitter [4], but it suffers from complex control and delay sensing circuitry, and the clock de-skew performance is also limited by the speed of the DLL. A more accurate technique comprises the utilization of precise clock edges, such as those from master clock [2-7], to reduce the timing skew, which can be classified in two main categories according to the nature of the assignment, and designated as edge-driven clocking or edge-driven switching. In this paper, both schemes of low-jitter clock generation techniques are described, and a novel class of multi-purpose, low-jitter multi-phase non-overlapping clock generation platform will be proposed. This new platform is generalized for different applications, including TI ADCs, DACs and N-path sampleddata filtering, and it also provides the clock signals for both the clocking and switching schemes. It presents significant advantages due to the simple circuit architecture and the high robustness that allows its extended operation to multi-phase with a low hardware cost.

## 2. CLOCK EDGES REQUIREMENTS

The sampling instants of sampled-data systems are defined either at the rising or falling edges of the sampling clock, thus implying that the clock edges inaccuracy would be directly related to the timing jitter. Considering, for example, an SC TI sample-and-hold (S/H) circuit from an A/D conversion path [8], as shown in Fig. 1(a), that is a typical example of a TI sampleddata system with N paths. As it is common in SC circuits, they usually require multi-phase non-overlapping clock signals  $\phi_1 \ldots \phi_N$  and  $\phi_{1'} \ldots \phi_{N'}$  to control the analog switches. Thus, in order to minimize the effect of signal-dependent charge injection and clock feedthrough errors [2] in the sampling capacitors  $C_1...C_N$ , the switches controlled by pre-phases  $\phi_{1p}...\phi_{Np}$  will be opened slightly earlier than the post-phases  $\phi_1 \dots \phi_N$ , which means that the falling edges of the pre-phases should appear slightly earlier than that of the post-phases as shown in Fig. 2(a). On the other hand, for certain applications with higher accuracy, the rising edges of the pre-phases could be placed also slightly earlier than that of post-phases to further suppress the charge injection errors (with small loss on the available settling time of the opamp) [9]. However, for better settling of the opamps, both phases can rise together as shown in all edges, in gray-color, in Fig. 2(a). Inaccurate sampling clock edges produce periodic timing skew with period M=N and, consequently through this arrangement, the falling edges of the pre-phases would become the critical edges to be controlled for avoiding timing skew in A/D conversion paths [2].

Similarly, for the TI D/A conversion path an output multiplexer is typically utilized, as shown in Fig. 1(b), to produce high-speed output [6]. Since the charges in the capacitors are transferred to the outputs once the switches  $\phi_1...\phi_N$  are turned on, the critical edges are the **rising edges** of the **post-phases** (or both-phases if they rise together) in the **D/A conversion path**, as presented in Fig. 2(b) [6].



Fig. 1: SC sampled-and-hold and multiplexer circuits used in time-interleaved sampled-data systems (modifications to edge-driven switching scheme are also shown): (a) Input S/H circuit; (b) Output multiplexer circuit; (c) Timing-diagram of both circuits Dashed lines point out critical edges



Fig. 2: Timing diagram to illustrate the critical clock edges: (a) A/D, (b) D/A conversion & (c) N-path sampled-data filtering

Finally, for general sampled data systems such as *N*-path filters, both S/H and multiplexer are utilized for input sampling and output play-out, implying that **both edges** described previously will become critical edges. Fig. 2 summarizes the clock waveforms and critical sampling edges for these three different cases.

## 3. JITTER-INSENSITIVE CLOCK GENERATION TECHNIQUES

The timing skew effects can be greatly reduced by using a method called "Clock Edge Reassignment", in which all the critical sampling edges are assigned by only one of the accurate master clock edges (either falling or rising). According to the nature of the assignment of clock edges, this technique can be classified into two categories: edge-driven clocking or edge-driven switching.

# A. Edge-Driven Clocking

This scheme imposes the reduction of the effect of timing skew by accurate control of the clock edge timing. Previously [2-6], such concept has been utilized, and the critical sampling edges of the clock are "reassigned" by another clock signal that possesses an accurate edging time. Such accurate clock sampling will considerably reduce the timing skew and it allows the use (without any modifications) of traditional S/H or multiplexer circuits, as in Fig. 1.

## **B. Edge-Driven Switching**

In addition to the generation of low-jitter multi-phase clock signals by controlling the accuracy of the clock edges, an alternative approach that uses common sampling switches to avoid the timing skew was described in [7], and designated as "Edge-Driven Switching".

The method utilizes a common sampling switch between various paths that is controlled by the common sampling clock  $\phi_S$  operated at full sampling rate, as shown in the modifications

presented in Fig. 1. The clock  $\phi_S$  possesses a series of short negative pulses, controlling the common sampling switch to be closed slightly earlier than the regular clock phases  $\phi_1 \dots \phi_N$ , thus the sampling instants are determined by  $\phi_S$  for A/D conversion which implies that the sampling circuit in Fig. 1(a), even without a set of pre-phases, is free from timing skew influence. For D/A conversion and N-path filters, the multiplexer circuits can be modified with an additional switch S added in the output (Fig. 1(b)). The switch S in the sampling part of the multiplexer is still required to avoid charge injection errors. In this scheme, all clock phases of the multiplexer are identical to those of Fig. 1(a).

## 4. A NOVEL GENERALIZED LOW-JITTER CLOCK GENERATION PLATFORM

Using the previous mentioned techniques a novel, multipurpose platform for generation of low-jitter, multi-phase nonoverlapping clock phases has been developed. This platform is generalized for applications of A/D Conversion (including decimators), D/A Conversion (including interpolators) and Npath sampled-data filtering. Fig. 3 shows the block and timing diagrams of the three proposed clock generators, whereas in Figs. 3(a) and 3(b) Edge-Driven Clocking is presented (relying on accurate control of application-specific clock edges to reduce timing-skew errors) for A/D conversion and D/A conversion or N-path filtering, respectively, and in Fig. 3(c) Edge-Driven Switching is addressed. On the other hand, Fig. 4 exhibits the clock generator platform containing the following building blocks:

#### A. Master-slave D-flip-flop with master output

Fig. 4(a) shows a master-slave D-flip-flop (DFF) with an additional master output that is extracted from the master latch. With such arrangement the master (M) output signal always leads the slave (Q) quadraturely.



(b) Fig. 4: Building blocks of the proposed clock generator platform: (a) Master-slave DFF with master output; (b) Self-starting mod-4 ring counter; (c) Edge Decision Block (EDB)

Master 4

Slave 4

Master 3 Slave 3

# B. Self-starting mod-N ring counter

The self-starting mod-N ring counter is designed to provide shifted negative pulses, which serve as envelopes of various multi-phase clock signals to ensure a non-overlapping operation. Fig. 4(b) shows an example of self-starting mod-4 ring counter, where dummy gates are used to balance the loading conditions of Q and M outputs.

#### C. Edge Decision Block (EDB)

Since for A/D converters (or decimators) the critical sampling edges are on the falling edges of the pre-phases, these falling edges will be "assigned" by the rising edges of an accurate clock (pre/post-clk, delay version of master clock by  $d_0$  and  $d_0+d_{pre}$  respectively), as it is presented in Fig. 3(a) together with the resulting clock phase outputs  $\phi_{mp}/\phi_m$  (m=1...N). However, this assignment is not possible, as it can be verified for example in the shaded area of the figure, where the post-clk signal is equal to 1 and the slave (Q) is 0 in both areas while the output  $\phi_m$  is dissimilar. In order to identify separately these two regions a master output M from the DFF serves as a boundary between the shaded areas. According to the assignment of alphabetic letters shown in the figure, the logic function that implements the desired output is given by

$$out = \overline{C} \cdot \overline{AB} \tag{1}$$

and the circuit diagram shown in Fig. 4(c), called "Edge Decision

Block" (EDB), can be used to implement such a logic function. Also, the function of all shaded delay block  $d_a$  in Fig. 3 is used to provide a delay between the rising edges of pre- and post-phases to further reduce charge injection errors. The amount of nonoverlapping time delay is also shown in Fig. 3.

Master 1

Slave 1

(c)

Master 2 Slave 2

In the second circuit used with D/A converters (or interpolators) or *N*-path filters, both the rising edges of the postphases (or both phases) and the falling edges of the pre-phases become vital. These two critical edges can be controlled simultaneously by the use of identical EDB blocks as previously mentioned, but with different signals routing as shown in Fig. 3(b). The rising edges of both phases are determined by the falling edges of the post-DFF-clk, while the falling edges of the pre-phases are assigned by the rising edges of pre-clk. Since both the pre-clk and post-DFF-clk are delayed versions (by  $d_0$  and  $d_2$  respectively) of the master clock, the *N*-phases produced are accurate in the rising edges of both phases and in the falling edges of the pre-phases, thus suitable in DAC applications, interpolating filters, and *N*-path sampled-data filters. The bubble on the delay  $d_2$  indicates a logic inversion of the delay output.

Finally for the Edge-Driven Switching of Fig. 3(c), since all the pre-phase functions are replaced by a common sampling clock  $\phi_S$ , the corresponding blocks that are responsible for generation of the pre-phases have been removed. Moreover, the blocks that generate various multi-phase non-overlapping clocks



Fig. 5: Hspice FFT simulation results for the various clock generators: (a) conventional; (b) proposed circuits for A/D conversion; (c) for D/A and N-path filters (Edge-Driven Clocking) and (d) for Edge-Driven Switching

 $\phi_1,...\phi_N$  are exactly identical to their corresponding parts in the D/A conversion circuit of Fig. 3(b), and only the common sampling clock  $\phi_S$  needs to be determined. The falling and rising edges of phase  $\phi_S$  are determined by the rising edges of "pre-clk" and the falling edges of "post-DFF-clk", respectively, producing  $\phi_S$  that has the same period  $T_{clk}$  of the master clock. As shown in the timing diagram of Fig. 3(c), the falling edges of  $\phi_S$  due to the various delays presented in Fig. 3(c), thus all the clock signals required by the Edge-Driven Switching scheme can be produced. Such assignment of  $\phi_S$  can be achieved by a simple NAND function. However, to maximize the delay matching, the EDB with the C terminal connected to logic 0 is used instead.

## 5. SIMULATION RESULTS

In order to verify the effectiveness of the proposed clock generation platform, a 4-phase version of the clock generators were simulated with Hspice with the master clock frequency set at 160 MHz (also the overall sampling rate).

The designed clock generators are used to drive the switches in the TI S/H or multiplexer circuits, to sample or play out a 75 MHz sinusoidal signal with the overall sampling frequency of 160 MHz. Fig. 5(a) shows the FFT output spectrum for the S/H circuits that are driven by the 4-phase version of the conventional clock generator from [10] that has no control over clock edge accuracies. The spectrum shows 3 mismatch-induced image tones appearing at frequency 5, 35, 45 MHz, respectively, and only 46.75 dB SNR is achieved, corresponding to 13 ps timing mismatches by the formula derived in [3]. Fig. 5(b) and (c) shows the corresponding spectrum with the proposed clock generator (used in A/D and D/A conversion) and the image tones are effectively reduced with an SNR of 64.15 dB and 66.6 dB respectively (less than 2 ps timing mismatches), corresponding to 17~19 dB improvement. Finally, the third proposed circuit from Fig. 3(c) (Edge-Driven Switching) by its nature doesn't produce any mismatch-induced image tones in the spectrum of Fig. 5(d).

# 6. CONCLUSIONS

A thorough and generalized analysis on multi-phase clock generation techniques to reduce timing skew effects in parallel or time-interleaved sampled-data systems has been presented, and two approaches are described to reduce the effects of timing skew: Edge-Driven Clocking and Edge-Driven Switching. Then, a novel multi-purpose, low-jitter, multi-phase non-overlapping clock generation platform is proposed. The platform is generalized to provide clocking phases for the two pre-described schemes, and also to all three classes of TI sampled-data systems including A/D conversion, D/A conversion and a complete sampled data system (N path filters). Simulations with Hspice are performed to verify the effectiveness of the proposed clock generation techniques. Hspice simulation results shows 17 dB improvement in SNR compared with the conventional clock generator as verified by the FFT analysis of the simulated results.

### **ACKNOWLEDGMENTS**

This work was financially supported by *University of Macau* under the Research Grant with Ref No: RG069/02-03S/MR/FST.

#### REFERENCES

- [1] J.C. Rudell, et al., "1.9GHz wide-band IF double conversion CMOS receiver for cordless telephone application," in *IEEE J. Solid-State Circuits*, vol. 32, pp. 2071-2088, December 1997.
- [2] Y.T.Wang, An 8-Bit 150-MHz CMOS A/D Converter, Ph.D. Dissertation, University of California, Los Angeles, USA, 1999.
- [3] Sai-Weng Sin, Seng-Pan U, R.P.Martins, and J.E.Franca, "Timing-mismatch analysis in high-speed analog front-end with nonuniformly holding output," in in *Proc. ISCAS'03*, vol. 1, pp. 129 – 132, May 2003.
- [4] Lin Wu and W.C. Jr. Black, "A low-jitter skew-calibrated multiphase clock generator for time-interleaved applications," in *ISSCC, Digest of Technical Paper*, vol. 44, no.1, pp. 396 -397, 470, February 2001.
- [5] G..Manganaro, "Feed-forward approach for timing skew in interleaved and double-sampled circuits," in *IEE Electronics Letters*, vol. 37, no.9,26, pp. 552-554, April 2001.
- [6] Seng-Pan U, R.P.Martins, J.E.Franca, "A 2.5V, 57MHz, 15-Tap SC Bandpass Interpolating Filter with 320MHz Output Sampling Rate in 0.35µm CMOS," in *ISSCC Dig. Tech. Papers*, pp.380-382, February 2002.
- [7] L.Sumanen, M.Waltari and K.Halonen, "A 10-bit 200MS/s CMOS parallel pipeline A/D converter," in *IEEE J. Solid-State Circuits*, vol.36, no.7, pp.1048-1055, July 2001.
- [8] C.S.G.Conroy, D.W.Cline and P.R.Gray, "An 8-b 85-MS/s parallel pipeline A/D converter in 1-µm CMOS," in *IEEE J. Solid-State Circuits*, vol.28, no.4, pp. 447-454, April 1993.
- [9]J.Krupar, et al, "Minimizing charge injection errors in highprecision, high-speed SC-circuits," in Proc. ISCAS'01, vol. 1, pp. 727-730, May 2001.
- [10] Yong-In Park, et al, "A low power 10 bit, 80 MS/s CMOS pipelined ADC at 1.8 V power supply," in Proc. ISCAS'01, vol. 1, pp. 580-583, May 2001.