# 國立臺灣大學電機資訊學院電子工程學研究所 

碩士論文Graduate Institute of Electronics Engineering College of Electrical Engineering \＆Computer Science National Taiwan University

Master Thesis

一個高速採用時間交錯式的連續漸進式類比至數位轉換器

## A High－Speed Time－Interleaved SAR ADC

黄維家

Wei－Chia Huang

指導教授：陳信樹博士
Advisor：Hsin－Shu Chen，Ph．D．

中華民國105年7月，
July， 2016

## 國立臺灣大學碩士學位論文口試委員會審定書

## 一個高速採用時間交錯式的連續漸進式類比至數位轉換器

## A High－Speed Time－Interleaved SAR ADC

本論文係 黄維家 君（R02943026）在國立臺灣大學電子工程學研究所完成之碩士學位論文，於民國105年7月13日承下列考試委員審查通過及口試及格，特此證明

口試委員：


所長


## A High-Speed Time-Interleaved SAR ADC

By

Wei-Chia Huang

## THESIS

Submitted in partial fulfillment of the requirement for the degree of Master of Science in Electronics Engineering at National Taiwan University Taipei, Taiwan, R.O.C.

$$
\text { July. } 2016
$$

Approved by :

Advised by:

Approved by Director : $\qquad$

## 致谢

感謝母親李明影，父親黄耀南與兄長黄維清在升學上面的支持。

感謝指導教授陳信樹博士從大學專題研究開始一直以來的指導。

感謝口試委員蔡宗亨教授，劉深淵教授和陳怡然教授的論文建議。

感謝林清凉老師與許文僊老師從大學以來的鼓勵。

感謝張藝薰同學的共同努力。

感謝戴宏彥，蔡博仰，蔡承學，胡耀升，蔡宗翰，施紀懷，李秉玠，陳成銘，莊郁暐，郭文杰，廖柏詩，陳玠瑋，翁志齊，嚴俊軒，金邦諺，吳世偉，林楷越，黄柏超，楊蜜迪，林瑞民，林昭輝，林鼎國，郭懷正，陳韋安，黄立宇等同學們的幫助與課業上的討論。

感謝臺大電機系棒，臺大基層棒球服務社，臺大研究生協會，慈林基金會，福智青年，台灣高等教育產業工會，慕哲政經塾，臺大學代會以及社會上各個崗位的朋友給予我研究以外的視野。

感謝這三年來所有幫助過我的朋友，謝謝。

## 摘要

在次世代無線通訊系統中，類比至數位轉換器必須操作在極高速的取樣頻率及低中解析度下。

本論文提出一個以 40 奈米 CMOS 一般製程，實現出六位元每秒四十五億次取樣的時間交錯連續漸進式類比至數位轉换器，藉由結合前端輸入緩衝器的架構及分組式技巧，有效地降低時間交錯式架構的輸入電容負載，並增加輸入訊號的可用安定時間，提出的 16 通道的時間交錯連續漸進式類比至數位轉换器得以達到高取樣速度及高輸入頻寬的表現。透過使用零交越（zero－crossing）偵測技巧，可以補償多通道間時間偏移的誤差，並透過數位校正處理多通道間的增益電壓及偏移電壓不匹配。在單通道方面，採用連續漸進式架構，並使用非同步處理及單向電容切換技巧使單通道具有高速度及高能量效率的特性。除此之外，任意選擇權重之電容陣列解決了單通道中比較器的動態偏移電壓的問題。

量測結果顯示，在每秒四十億的轉換，DNL 和 INL 分別為＋0．17／－0．29 LSB 和＋0．20／－0．18 LSB。在每秒四十五億的轉換及輸入頻率為一億赫茲下，SNDR 和 SFDR 分別為 32.15 dB 和 41.04 dB ，在 1.2 V 的供應電壓下，功率消耗為 24.9 毫瓦，品質因數（FoM）為 $159 \mathrm{fJ} / \mathrm{c} .-\mathrm{s}$ 。全部的晶片面積大小為 $1.275 \mathrm{~mm}^{2}$ ，核心電路的面積是 0.195 平方毫米。

關鍵詞：類比至數位轉換器，時間交錯式，連續漸進式，分組技巧，高輸入頻寬

Analog-to-digital converter (ADC) has to operate at ultra-high speed with low to medium resolution in the next-generation wireless communication systems.

A 6-bit 4.5 GS/s time-interleaved SAR ADC in 40 nm CMOS general-process (GP) technology is proposed. By combining the front-end input buffers and the grouping technique into the timeinterleaved architecture, the input capacitance effectively decreases and the available settling of input buffers increases. The proposed 16-channel time-interleaved SAR ADC achieves the performance of high-speed sampling rate and high input bandwidth. A zero-crossing detection technique is employed to correct timing skew among sub-ADCs. Gain and offset mismatches between sub-ADCs are calibrated in the digital domain. Asynchronous processing and monotonic capacitor switching technique used in the single-channel SAR ADC make the sub-ADC high speed and power-efficiency. Furthermore, AWCA technique solves the dynamic offset problem of the comparator in the sub-ADC.

The measurement results show that ADC exhibits DNL of $+0.17 /-0.29$ LSB and INL of $+0.20 /-$ 0.18 LSB at $4 \mathrm{GS} / \mathrm{s}$ with $\mathrm{F}_{\text {in }}$ of 50 MHz . SNDR and SFDR are 32.15 dB and 41.04 dB at $4.5 \mathrm{GS} / \mathrm{s}$ with $\mathrm{F}_{\text {in }}$ of 1 GHz . The power consumption is 24.9 mW at 1.2 V supply voltage. As a result, the FoM (Power/2 $2^{\mathrm{ENOB}} / \mathrm{Fs}_{\mathrm{S}}$ ) is $159 \mathrm{fJ} /$ conversion-step. The whole chip including pads occupies $1.275 \mathrm{~mm}^{2}$ while area of core circuit is $0.195 \mathrm{~mm}^{2}$.

Keywords: analog to digital converter, time-interleaved, SAR, grouping technique, high input bandwidth

## Contents

致謝 ..... III
摘要 ..... IV
Abstract ..... V
Contents ..... VI
List of Figures ..... X
List of Tables ..... XIV
Chapter 1 Introduction ..... 1
1．1 Motivation ..... 1
1．2 Thesis Organization ..... 2
Chapter 2 Fundamentals of Analog to Digital Converter ..... 4
2．1 Introduction ..... 4
2．2 Performance Metrics ..... 4
2．2．1 Static Performance ..... 4
2．2．1．1 Offset and Gain Error ..... 4
2．2．1．2 Differential and Integral Nonlinearities（DNL，and INL） ..... 5
2．2．2 Dynamic Performance ..... 6
2．2．2．1 Signal－to－Noise Ratio（SNR） .....  .7
2．2．2．2 Total Harmonic Distortion（THD） ..... ． 7
2．2．2．3 Spurious Free Dynamic Range（SFDR） .....  .7
2．2．2．4 Signal－to－Noise and Distortion Ratio（SNDR） ..... 8
2．2．2．5 Effective Number of Bits（ENOB） ..... 8
2．2．2．6 Figure of Merit（FoM） ..... 8
2．3 Architectures ..... 9
2.3.1 Flash Architecture ..... 9
2.3.2 Successive-Approximation-Register Architecture ..... 10
2.3.1 Time-Interleaved Architecture ..... 12
2.4 Error Sources and Calibrations in Time-Interleaved Architecture ..... 14
2.4.1 Offset Mismatch and Calibration ..... 14
2.4.2 Gain Mismatch and Calibration ..... 17
2.4.1 Timing Skew Mismatch and Calibration ..... 20
2.5 Summary ..... 23
Chapter 3 Proposed Time-Interleaved SAR ADC ..... 24
3.1 Introduction ..... 24
3.2 Time-Interleaved ADC Architecture ..... 24
3.2.1 One Rank Time-Interleaved Architecture ..... 25
3.2.2 Two Rank Time-Interleaved Architecture ..... 28
3.2.3 Grouping Technique ..... 30
3.2.3.1 Prior Work 1: The 64-Channel Time-Interleaved 10-Bit SAR ADC [9] ..... ]...32
3.2.3.2 Prior Work 2: The 4-Channel Time-Interleaved 6-Bit Flash ADC [21]. ..... 34
3.2.3.3 Prior Work 3: The 32-Channel Hierarchical Sampling Time-Interleaved
7-Bit SAR ADC [23] ..... 36
3.3 Proposed Time-Interleaved SAR ADC ..... 39
3.3.1 Design Considerations ..... 44
3.3.1.1 Input Bandwidth and Settling Time Requirement. ..... 44
3.3.1.2 Offset and Gain Mismatch Calibration ..... 50
3.3.1.3 Timing Skew Mismatch Calibration ..... 51
3.3.1.4 Single-Channel SAR ADC ..... 54
3.3.1.4.1 Asynchronous Processing ..... 54
3.3.1.4.2 Monotonic Capacitor Switching ..... 55
3.3.1.4.3 Dynamic Offset ..... 56
3.3.1.4.4 Arbitrary Weight Capacitor Array (AWCA) Technique ..... 57
3.3.1.4.5 Thermal Noise of AWCA ..... 59
3.4 Summary ..... 59
Chapter 4 Circuit Implementation and Simulation Results ..... 61
4.1 Introduction ..... 61
4.2 Building Blocks and Circuit Implementation ..... 62
4.2.1 Front-End Circuit ..... 62
4.2.2 Bias Circuit ..... 65
4.2.3 Single-Channel SAR ADC ..... 66
4.2.3.1 Sampling Switch ..... 66
4.2.3.2 Arbitrary Weight Capacitor Array (AWCA) ..... 68
4.2.3.3 Comparator. ..... 70
4.2.3.4 SAR Logic ..... 72
4.2.3 Clock Generator ..... 75
4.2.4 Delay Line ..... 79
4.2.5 Clock Divider and Multiplexer ..... 80
4.3 Overall ADC Simulation Results ..... 82
4.4 Summary ..... 85
Chapter 5 Measurement Results ..... 86
5.1 Introduction ..... 86
5.2 Floor Plan and Layout Design ..... 86
5.3 PCB Design ..... 89
5.4 Test Setup ..... 94
5.5 Measurement Results ..... 97
5.5.1 Static Performance ..... 97
5.5.2 Dynamic Performance ..... 99
5.6 Discussion ..... 103
5.6.1 Dynamic performance with different RF transformers ..... 103
5.6.2 Analysis of timing skew for simulation and measurement ..... 107
5.7 Summary ..... 113
Chapter 6 Conclusions ..... 115
Chapter 7 Future Work. ..... 116
Bibliography ..... 117

## List of Figures

## Chapter 2

Fig. 2.1 Transfer curve with (a) offset and (b) gain error .....  5
Fig. 2.2 Transfer curve with DNL and INL .....  6
Fig. 2.3 Flash architecture ..... 10
Fig. 2.4 Successive-approximation-register architecture ..... 11
Fig. 2.5 Time-interleaved architecture ..... 13
Fig. 2.6 Time-interleaved timing diagram ..... 13
Fig. 2.7 A two-channel time-interleaved ADC with offset mismatch ..... 15
Fig. 2.8 Offset mismatch (a) ADC output and error signal in time domain (b) ADC output power
spectrum ..... 16
Fig. 2.9 Offset mismatch calibration in analog domain ..... 17
Fig. 2.10 A two-channel time-interleaved ADC with gain mismatch ..... 18
Fig. 2.11 Gain mismatch (a) ADC output and error signal in time domain (b) ADC output power
spectrum ..... 19
Fig. 2.12 A two-channel time-interleaved ADC with timing skew mismatch ..... 21
Fig. 2.13 Timing skew mismatch (a) ADC output and error signal in time domain (b) ADC output
power spectrum ..... 21
Fig. 2.14 Timing-skew detection method ..... 23
Chapter 3
Fig. 3.1 One rank time-interleaved architecture ..... 27
Fig. 3.2 Timing diagram of (a) track-time of $50 \%$ duty cycle (b) track-time of $\frac{1}{\mathrm{M}} \%$ duty cycle ..... 27
Fig. 3.3 Two rank time-interleaved architecture ..... 29
Fig. 3.4 Timing diagram of the two rank time-interleaved architecture ..... 29
Fig. 3.5 Time-interleaved architecture using grouping technique and the time diagram ..... 31
Fig. 3.6 ADC architecture and the timing diagram of prior work 1 ..... 33
Fig. 3.7 Charge sharing problem between T/H switches ..... 35
Fig. 3.8 Block diagram of prior work 2 ..... 36
Fig. 3.9 Architecture and timing diagram of prior work 3 ..... 38
Fig. 3.10 Architecture and timing diagram of the proposed time-interleaved SAR ADC ..... 43
Fig. 3.11 Architecture of sub-ADC ..... 44
Fig. 3.12 (a) Source follower with ideal bias current, its (b) small-signal model, and its (c) high-frequency equivalent model.46
Fig. 3.13 The RC model of the frond-end circuit ..... 48
Fig. 3.14 The block diagram of offset and gain mismatch calibration ..... 51
Fig. 3.15 (a) Sampling waveform of TI ADC (b) block diagram of the timing skew mismatch detection54
Fig. 3.16 Synchronous and asynchronous processing ..... 55
Fig. 3.17 Monotonic switching method ..... 56
Fig. 3.18 Dynamic Offset ..... 57
Chapter 4
Fig. 4.1 (a) Front-end circuit (b) biasing circuit of the source follower ..... 62
Fig. 4.2 Simulated 3-dB frequency versus IREF. ..... 64
Fig. 4.3 Simulated SFDR versus $\mathrm{I}_{\text {BIAS }}$ with 3.78 GHz input frequency ..... 64
Fig. 4.4 Simulated SFDR versus input common mode voltage with 3.78 GHz input frequency ..... 64
Fig. 4.5 Simulated SFDR versus input amplitude with 3.78 GHz input frequency ..... 65
Fig. 4.6 Bias circuit with current routing and voltage routing ..... 66
Fig. 4.7 Sampling switch ..... 67
Fig. 4.8 Turn-on resistance $\left(\mathrm{R}_{\text {on }}\right)$ of the sampling switch versus $\mathrm{V}_{\text {IN }}\left(\mathrm{V}_{\text {op }}, \mathrm{V}_{\text {on }}\right)$ ..... 67
Fig. 4.9 Arbitrary weight capacitor array ..... 69
Fig. 4.10 ENOB versus offset using binary-weighted capacitor array and AWCA ..... 70
Fig. 4.11 Schematic of the dynamic comparator ..... 72
Fig. 4.12 (a) Block diagrams and (b) waveforms of asynchronous SAR logic ..... 74
Fig. 4.13 (a) Conventional DFF control logic (b) DCL ..... 75
Fig. 4.14 (a) Block diagrams and (b) waveforms of clock generator ..... 77
Fig. 4.15 Diagram of $\div 4$ divider ..... 78
Fig. 4.16 Schematic of TSPC flip-flop preloaded (a) with high (b) with low ..... 79
Fig. 4.17 Schematic of delay line ..... 80
Fig. 4.18 Circuit of the clock divider ..... 81
Fig. 4.19 Diagram of Multiplexer ..... 82
Fig. 4.20 Timing diagram of clock divider ..... 82
Fig. 4.21 FFT spectrum with low input frequency ..... 83
Fig. 4.22 FFT spectrum with Nyquist rate input frequency before timing skew calibration ..... 84
Fig. 4.23 FFT spectrum with Nyquist rate input frequency after timing skew calibration. ..... 84
Chapter 5
Fig. 5.1 Floor plan of proposed time-interleaved SAR ADC ..... 87
Fig. 5.2 Floor plan of single-channel SAR ADC ..... 87
Fig. 5.3 Layout of proposed time-interleaved SAR ADC ..... 88
Fig. 5.4 Schematic of analog input on PCB ..... 90
Fig. 5.5 Schemtaic of reference voltage on PCB ..... 91
Fig. 5.6 Schematic of power supplies on PCB ..... 92
Fig. 5.7 Ground and signal of configuration on PCB ..... 93
Fig. 5.8 Photo of whole PCB with time-interleaved ADC chip ..... 93
Fig. 5.9 Test setup ..... 95
Fig. 5.10 Photos of (a) E8257D, (b) SMA100, and (c) PPT-3615 ..... 96
Fig. 5.12 DNL and INL of the single-channel ADC ..... 98
Fig. 5.13 DNL and INL of the time-interleaved ADC ..... 98
Fig. 5.14 FFT spectrum for $\mathrm{F}_{\text {in }}=50.78 \mathrm{MHz}$ (decimated by a factor of 125) ..... 100
Fig. 5.15 FFT spectrum for $\mathrm{F}_{\text {in }}=998 \mathrm{MHz}$ (a) before, and (b) after offset and gain mismatch
calibration (decimated by a factor of 125) ..... 101
Fig. 5.16 SNDR, SFDR and THD versus input frequency at $\mathrm{F}_{\mathrm{s}}=4.5 \mathrm{GS} / \mathrm{s}$ ..... 102
Fig. 5.17 SNDR and SFDR versus sampling frequency with $F_{\text {in }}$ of 50 MHz ..... 102
Fig. 5.18 FFT spectrum for $\mathrm{F}_{\text {in }}=1 \mathrm{GHz}$ with RF transformers (TCM4-14+) ..... 104
Fig. 5.19 SNDR, SFDR and THD versus input frequency range from 1 to 1.4 GHz ..... 104
Fig. 5.20 FFT spectrum at $\mathrm{F}_{\text {in }}=707 \mathrm{MHz}$ (a) time-interleaved result (b) single-channel ..... 106
Fig. 5.21 Floor plan of the clock generator ..... 112
Fig. 5.22 New floor plan of the clock generator ..... 112
Fig. 5.23 Shielding a clock phases $\Phi_{\mathrm{Gi}}(\mathrm{i}=2 \sim 3)$ ..... 112
Fig. 5.24 Power consuming proportion of the proposed ADC ..... 114

## List of Tables

## Chapter 3

Table 3-1 Comparison of different ADC architecture when $\mathrm{M}=16$ ..... 41
Table 3-2 Comparison of the required input bandwidth for time-interleaved architecture. ..... 50
Table 3-3 Error tolerance of corresponding bit ..... 59
Chapter 4
Table 4-1 Performance summary of post-layout simulation for the proposed ADC ..... 85
Chapter 5
Table 5-1 Functions of 55 pins in proposed ADC ..... 88
Table 5-2 Specification of RF transformers ..... 90
Table 5-3 List of materials ..... 94
Table 5-4 C+CC post-layout simulation of timing skew from adjacent channels ..... 109
Table 5-5 R+C+CC post-layout simulation of timing skew from adjacent channels ..... 109
Table 5-6 Variation of timing skew spurs with different arrangement of timing skew ..... 110
Table 5-7 $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ post-layout simulation of timing skew of the new version of layout ..... 113
Table 5-8 Time-interleaved ADC Comparison Table. ..... 113

## Chapter 1

## Introduction

### 1.1 Motivation

Analog-to-digital converters (ADCs) are the essential interface between the analog world and the digital system. ADCs acquire and quantize the analog signal for digital signal processing (DSP). The process of signals in the digital domain has an advantage of high accuracy. It also acquires higher data rate and great power efficiency with the advanced CMOS technology.

High-speed ADCs with multi-GS/s and 4-8 bit resolutions have become an important trend for next-generation communication system such as $60-\mathrm{GHz}$ wireless transceivers, high-speed serial link transceivers, ultra-wideband systems, and millimeter-wave standards system [1], [2]. In the past, ADCs of the flash architecture has been dominantly used for the communication network. However, the flash architecture requires many parallel preamplifiers and comparators to quantize the analog signal at the same time, resulting larger power consumption. Although another ADCs of the pipeline architecture also achieve these requirements, the design of high-performance operational amplifier (OPA) is more challenging with the advanced CMOS process. Moreover, OPAs consume large power in pipelined ADCs.

In recent years, ADCs of the successive approximation register (SAR) architecture have become a mainstream architecture. The SAR ADC is a digital-friendly architecture and the comparator-only operation makes the SAR ADC gain higher conversion rate and area- and power-efficiency with the
advanced CMOS process [3]-[6]. However, the SAR ADC requires several comparison cycles to complete one conversion and limits the ADC in much higher speed applications. On the other hand, when the speed of these single-channel ADCs increases to some limitation, the power consumption grows exponentially.

The time-interleaved (TI) architecture is adopted to fulfill much higher sampling rate by using multiple sub-ADCs in parallel. Sub-ADCs can be any type of ADC architectures, such as flash, pipelined, and SAR architectures. SAR architecture is suitable for TI architecture because it only needs to operate in moderate sampling rate and thus the entire TI ADC maintain good power efficiency [7]-[10]. However, TI ADC suffers from mismatches between sub-ADCs, such as offset, gain, and timing skew [11]. These mismatches degrade the overall performance of ADC. Furthermore, the input capacitance increases proportionally with the number of sub-ADCs, and hence it limits the input bandwidth of the overall ADC.

In this thesis, the proposed frond-end architecture is proposed to overcome the input bandwidth of TI ADC, thus achieving a 16 -channel TI SAR ADC of high-speed sampling rate.

### 1.2 Thesis Organization

This thesis contains seven chapters. In chapter one, it briefly introduces the motivation and the thesis organization. Chapter two presents the basic ADC knowledge and error sources and several methods of calibration in TI ADCs. Chapter three discusses prior works of TI ADCs and the proposed

TI SAR ADC. Details of proposed building blocks with analyses, circuit implementations, and simulation results are described in chapter four. Next, measurement results are presented in chapter five. Finally, chapter six concludes this thesis and chapter seven point out the improvement of this work.

## Chapter 2

## Fundamentals of Analog to Digital Converter

### 2.1 Introduction

Several important performance metrics about static and dynamic performance of ADC are defined [16]. Three types of commonly used ADC architectures are described [16]. Finally, the error sources and related methods of calibration in time-interleaved architecture are discussed.

### 2.2 Performance Metrics

The performance of ADC requires a number of parameters. These parameters can be divided into static and dynamic performance.

### 2.2.1 Static Performance

### 2.2.1.1 Offset and Gain Error

Offset error is the shift of actual transfer curve from ideal transfer curve shown in Fig. 2.1(a). Gain error is the difference of the slopes of actual transfer curve and ideal transfer curve when offset error is zero shown in Fig. 2.1(b).


Fig. 2.1 Transfer curve with (a) offset and (b) gain error

### 2.2.1.2 Differential and Integral Nonlinearities (DNL, and INL)

Differential nonlinearity (DNL) is defined as the difference of each actual transition level width and ideal transition level width without offset and gain error. DNL at code n is given as:

$$
\begin{equation*}
D N L=\frac{V_{n+1}-V_{n}}{V_{L S B}}-1,0 \leq n \leq\left(2^{N}-1\right) \tag{2.1}
\end{equation*}
$$

where $\mathrm{V}_{\mathrm{n}}$ is an actual transition point of digital output code n , and N is the resolution of ADC. $\mathrm{V}_{\text {LSB }}$ is an amplitude voltage of a Least significant bit (LSB), so DNL is expressed by LSB.

Integral nonlinearity (INL) is defined as the deviation of actual transfer curve from an ideal straight line and is also expressed by LSB. INL at code n is said to be the width difference of actual transition point n and ideal transition point n . Summing up DNL from code 0 to code n is an another way to evaluate INL:

$$
\begin{equation*}
\operatorname{INL}(n)=\sum_{i=1}^{n} D N L(i) \tag{2.2}
\end{equation*}
$$

DNL and INL are shown in Fig. 2.2. DNL or INL is often expressed like $+0.5 /-0.5 \mathrm{LSB}$, which are maximum and minimum value from corresponding DNL or INL sequence.


Fig. 2.2 Transfer curve with DNL and INL

### 2.2.2 Dynamic Performance

Dynamic performance is measured by Fast Fourier Transformer (FFT) analyzer, which is a hardware device or mathematic software such as MATLAB. Signal-to-Noise Ratio (SNR), Total Harmonic Distortion (THD), Spurious Free Dynamic Range (SFDR), Signal-to-Noise and Distortion Ratio (SNDR) are the important dynamic performances to evaluate an ADC.

### 2.2.2.1 Signal-to-Noise Ratio (SNR)

Signal to noise ratio (SNR) is the ratio of signal power to noise power. SNR is written as:

$$
\begin{equation*}
\left.\mathrm{SNR}\right|_{\mathrm{dB}}=10 \log \left(\frac{\text { Signal Power }}{\text { Noise Power }}\right) \tag{2.3}
\end{equation*}
$$

### 2.2.2.2 Total Harmonic Distortion (THD)

Total harmonic distortion (THD) is the ratio of harmonics power to signal power. Harmonics power is defined as the root mean square (RMS) of all harmonics in the FFT spectrum. However, in the most cases, the sum of first five harmonics power to represent harmonics power is enough because the rest is insignificant and is submerged in noise flow. THD is expressed as:

$$
\begin{equation*}
\left.\mathrm{THD}\right|_{\mathrm{dB}}=10 \log \left(\frac{\text { Harmonics Power }}{\text { Signal Power }}\right) \tag{2.4}
\end{equation*}
$$

### 2.2.2.3 Spurious Free Dynamic Range (SFDR)

Spurious free dynamic range (SFDR) is the ratio of signal power to the largest power of distortion. For single-channel ADC, the largest distortive power is mainly caused by harmonic distortion. However, for time-interleaved ADC, the largest distortive power may be from mismatch among sub-ADCs. SFDR is mentioned as:

$$
\begin{equation*}
\left.\operatorname{SFDR}\right|_{\mathrm{dB}}=10 \log \left(\frac{\text { Signal Power }}{\text { Largest Distortion Power }}\right) \tag{2.5}
\end{equation*}
$$

### 2.2.2.4 Signal-to-Noise and Distortion Ratio (SNDR)

Signal to noise and distortion ratio (SNDR) is the ratio of signal power to sum of noise power and harmonics power. SNDR is described as:

$$
\begin{equation*}
\left.\operatorname{SNDR}\right|_{\mathrm{dB}}=10 \log \left(\frac{\text { Signal Power }}{\text { Noise Power }+ \text { Harmonics Power }}\right) \tag{2.6}
\end{equation*}
$$

### 2.2.2.5 Effective Number of Bits (ENOB)

Effective number of bits (ENOB) is a measure of the quality of a digitized signal. ENOB is derived from SNDR:

$$
\begin{equation*}
\left.\mathrm{ENOB}\right|_{\mathrm{bit}}=\frac{\left.\mathrm{SNDR}\right|_{\mathrm{dB}}-1.76}{6.02} \tag{2.7}
\end{equation*}
$$

### 2.2.2.6 Figure of Merit (FoM)

Figure of Merit (FoM) is an important performance metric of ADC in terms of power, speed, and accuracy. It is a measure that various types of ADCs can be put together and compared together.

FoM is defined as:

$$
\begin{equation*}
\text { FoM }=\frac{\text { Power }}{2^{\text {ENOB }} \times F_{S}} \text { or } \frac{\text { Power }}{2^{\mathrm{ENOB}} \times E R B W} \tag{2.8}
\end{equation*}
$$

where $\mathrm{F}_{S}$ is the conversion rate of ADC and ERBW is the signal frequency at which SNDR drops by 3 dB compared to low frequency value.

### 2.3 Architectures

### 2.3.1 Flash Architecture

Flash architecture is shown in Fig. 2.3. It contains several $2^{N}-1$ comparators where N is the resolution of ADC and a decoder. Operation of flash ADC is as follows. Analog input of flash ADC is compared with reference voltages generated by resistor-ladder. Output of a comparator is digital " 1 " if analog input is larger than corresponding reference voltage. On the other hand, output of a comparator is digital " 0 " if analog input is smaller than corresponding reference voltage. By collecting digital output of the comparators in order, thermometer code is generated. A decoder converts the thermometer code to the binary code. Besides comparators, preamplifiers are often added in front of comparators to decrease input-referred offset. Flash ADC achieves the fastest conversion rate because of the parallel processing. However, when the resolution of flash ADC increases, power and area consumption increase exponentially. As a result, flash ADC is usually implemented in low resolution and high-speed design.


Fig. 2.3 Flash architecture

### 2.3.2 Successive-Approximation-Register Architecture

Successive-approximation-register (SAR) ADC architecture is shown in Fig. 2.4. It is composed of a front-end track-and-hold circuit (T/H), a comparator, a successive-approximation (SA) control logic, and a digital-to-analog converter (DAC). Binary search algorithm is applied to approach analog input and to simultaneously determine the digital output. Take a 6 -bit resolution SAR ADC as an example. First, T/H acquires analog input voltage while the registers of the SA control logic are set " 100000 ". Through DAC, $\mathrm{V}_{\mathrm{DAC}}$ is set $\frac{\mathrm{V}_{\mathrm{REF}}}{2}$. Next, the comparator compares $\mathrm{V}_{\mathrm{T} / \mathrm{H}}$ with $\mathrm{V}_{\mathrm{DAC}}$, and the result of comparison is send to the $S A$ control logic. If $\mathrm{V}_{\mathrm{T} / \mathrm{H}}$ is larger than $\mathrm{V}_{\mathrm{DAC}}$, most significant bit (MSB) is still " 1 ", and SA control logic is set " 110000 ". Then, $\mathrm{V}_{\mathrm{DAC}}$ becomes $\frac{3 \mathrm{~V}_{\text {REF }}}{4}$ for next
comparison. Otherwise, most significant bit (MSB) is changed to 0 , and SA control logic is set " 010000 " which makes $V_{\text {DAC }}$ equal to $\frac{\mathrm{V}_{\text {REF }}}{4}$ for next comparison. The procedure repeats until six digital code is determined.

Because only one comparator is used in SAR ADC, power consumption is therefore much lower than other ADC. The comparator offset is independent of the linearity since it just shift the entire transfer characteristic. Furthermore, SAR ADC benefits from the advanced CMOS technology and the lower supply voltage due to its digital-friendly architecture. Therefore, SAR ADC recently becomes a popular ADC architecture in medium-to-high resolution and low-power design.


Fig. 2.4 Successive-approximation-register architecture

### 2.3.1 Time-Interleaved Architecture

The time-interleaved ADC is an effective way to achieve a high-speed conversion rate. The architecture of time-interleaved ADC is shown in Fig. 2.5. It is established by paralleling M ADCs where M is the number of ADC channels (sub-ADCs). The time-interleaved ADC can be implemented by various type of ADC architecture such as flash, SAR or pipelined architecture. It works as follow: In a time-interleaved ADC with M channels, the sampling clock enters clock generator and divided into multiple clock phases (clk1~clkM), and each clock phase triggers the corresponding sub-ADC to sample $\mathrm{V}_{\text {in }}$ analog signal and then the sub-ADC starts to digitize $\mathrm{V}_{\text {in }}$ analog signal. When conversion is done, digital output of channels are realigned through multiplexer (MUX) circuit. The timing diagram of TI ADC is shown in Fig. 2.6. The time interval between subsequent channels is one sampling period $\frac{1}{\mathrm{~F}_{\mathrm{s}}}$. By interleaving M channels, the time-interleaved ADC operate M-times faster than the single-channel ADC. This architecture relieves the harsh requirement that avoids designing a single-channel ADC to work at high conversion rate, and hence lowers the power consumption. However, the time-interleaved ADC suffers from mismatch between channels, such as offset, gain, and timing skew mismatches. These mismatch degrade the performance heavily and the ADC needs to be calibrated [12]-[15]. Furthermore, the Nyquist frequency of the time-interleaved ADC is M times faster than a single-channel ADC. It makes the front-end circuit of ADC such as input buffers and $T / H$ circuits need sample $M$ times faster frequency of input signal. Therefore, the front-end circuit becomes a challenging issue in the time-interleaved architecture.


Fig. 2.5 Time-interleaved architecture


Fig. 2.6 Time-interleaved timing diagram

### 2.4 Error Sources and Calibrations in Time-Interleaved Architecture

Although the time-interleaved ADC has high sampling rate and high energy-efficiency, mismatches among sub-ADCs cause undesired spur over the signal spectrum. These phenomena are described and related techniques of calibration are also introduced in this section. Generally, calibration is processed in two steps: mismatch detection and error correction. Two steps perform in either digital or analog domain. In addition, the calibration is implemented in either foreground or background. Foreground calibration means that the ADC is calibrated during start-up and ADC starts normal operation after calibration. On the contrary, background calibration enables continuous calibration while ADC maintains the normal operation.

### 2.4.1 Offset Mismatch and Calibration

A two-channel time-interleaved ADC having offset mismatch is shown in Fig. 2.7, and assuming that other characteristics are ideal. If the input is a sinusoidal signal $\mathrm{V}_{\mathrm{in}}=\cos \left(2 \pi \mathrm{f}_{\mathrm{in}} \mathrm{t}+\theta\right)$, the output of two-channel ADC is written as

$$
\begin{array}{ll}
\mathrm{y}_{1}[n]=\cos \left(2 \pi f_{\text {in }} n T_{s}+\theta\right)+V_{\text {os } 1} & n=\text { even }  \tag{2.9}\\
\mathrm{y}_{2}[n]=\cos \left(2 \pi f_{\text {in }} n T_{s}+\theta\right)+V_{\text {os } 2} & n=\text { odd }
\end{array}
$$

where $\mathrm{f}_{\mathrm{in}}$ is input frequency; n is sampling point; $\mathrm{T}_{\mathrm{s}}$ is sampling period; and $\mathrm{V}_{\text {osi }}$ is channel's offset $(\mathrm{i}=1,2)$.

Considering (2.9), the output of the ADC is expressed as

$$
\begin{equation*}
\mathrm{y}[n]=\cos \left(2 \pi f_{i n} n T_{s}+\theta\right)+V_{o s}+\frac{\Delta V_{o s}}{2} \cos \left(2 \pi \frac{f_{s}}{2} n T_{s}\right) \tag{2.10}
\end{equation*}
$$

where $\mathrm{V}_{\text {os }}=0.5\left(\mathrm{~V}_{\text {os } 1+} \mathrm{V}_{\text {os } 2}\right), \Delta \mathrm{V}_{\text {os }}=\mathrm{V}_{\text {os } 1}-\mathrm{V}_{\text {os } 2}$; and $\mathrm{f}_{\mathrm{s}}$ is sampling frequency.
In (2.10), the third term shows that offset mismatch causes a tone at $\frac{\mathrm{f}_{\mathrm{s}}}{2}$ and it is independent of the amplitude or frequency of the input. As shown in Fig.2.8, the error due to offset mismatch does not depend on the input signal and the noise tone is observed at $\frac{\mathrm{f}_{\mathrm{s}}}{2}$ in the output spectrum.

In more channels than two-channel tome-interleaved ADC, the distortions caused by offset mismatch appear at the frequencies:

$$
\begin{equation*}
\frac{k}{M} f_{s} \quad k=1, \ldots, M-1 \tag{2.11}
\end{equation*}
$$

where $M$ is the number of channels in a time-interleaved ADC.


Fig. 2.7 A two-channel time-interleaved ADC with offset mismatch

(a)

(b)

Fig. 2.8 Offset mismatch (a) ADC output and error signal in time domain (b) ADC output power spectrum

To solve offset mismatch is relatively simple as hardware and algorithm complexity. Fig. 2.9 shows a common calibration technique [12], [17] in analog domain. First, the analog input nodes are shorted and then the output would correspond to the sigh of the input-refereed offset during detection. Next, the calibration logic such as DAC starts to adjust the comparator's current sources or variable capacitors and compensate the offset voltage. Finally, the digital output begins to oscillate and the offset converges to the smallest correction step at the end of calibration. Moreover, if ADC has frontend buffers, these buffers may cause offset mismatch. The problem is solved by the same technique. Other calibration technique in digital domain is extensively used [15], [18]-[19]. Each sub-ADC's offset is detected by calculating long term mean of its output with a quasi-stationary input signal such
as sinusoidal wave. Then offset is calibrated by subtracting the offset value fromits respective output.


Fig. 2.9 Offset mismatch calibration in analog domain

### 2.4.2 Gain Mismatch and Calibration

Fig. 2.10 shows a two-channel time-interleaved ADC with gain mismatch. Similarly, suppose that other characteristics are ideal and the input is a sinusoidal signal $\mathrm{V}_{\mathrm{in}}=\cos \left(2 \pi \mathrm{f}_{\mathrm{in}} \mathrm{t}+\theta\right)$, the output of two-channel ADC is written as

$$
\begin{array}{ll}
\mathrm{y}_{1}[n]=G_{1} \cos \left(2 \pi f_{\text {in }} n T_{s}+\theta\right) & n=\text { even } \\
\mathrm{y}_{2}[n]=G_{2} \cos \left(2 \pi f_{\text {in }} n T_{s}+\theta\right) & n=\text { odd } \tag{2.12}
\end{array}
$$

where $f_{i n}$ is input frequency; $n$ is sampling point; $G_{i}(i=1,2)$ is gain of the ADC; and $T_{s}$ is sampling period.

Considering (2.12), the output of the ADC is expressed as

$$
\begin{equation*}
\mathrm{y}[n]=G \cos \left(2 \pi f_{i n} n T_{s}+\theta\right)+\frac{\Delta G}{2} \cos \left[2 \pi\left(f_{i n}-\frac{f_{s}}{2}\right) n T_{s}+\theta\right] \tag{2.13}
\end{equation*}
$$

where $G=0.5\left(\mathrm{G}_{1+} \mathrm{G}_{2}\right) ; \Delta \mathrm{G}=\mathrm{G}_{1}-\mathrm{G}_{2} ; \mathrm{f}_{\mathrm{s}}$ is sampling frequency.

In (2.13), the second term shows that gain mismatch causes a noise tone at $\mathrm{f}_{\mathrm{in}}-\frac{\mathrm{f}_{\mathrm{s}}}{2}$. Consequently, it is dependent on the input frequency but independent of the input amplitude. As shown in Fig.2.11, the noise tone is at $f_{i n}-\frac{f_{s}}{2}$ in the output spectrum. In time domain, the largest error occurs at the peak of the sinusoid wave. In more channels than two-channel tome-interleaved ADC, the distortions caused by gain mismatch appear at the frequencies:

$$
\begin{equation*}
\pm f_{i n}+\frac{k}{M} f_{s} \quad k=1, \ldots, M-1 \tag{2.14}
\end{equation*}
$$

where $M$ is the number of channels in a time-interleaved ADC.


Fig. 2.10 A two-channel time-interleaved ADC with gain mismatch



Fig. 2.11 Gain mismatch (a) ADC output and error signal in time domain (b) ADC output power spectrum

Gain mismatch calibration is similar to that in offset mismatch and they are generally calibrated together [15], [18]-[19]. Each sub-ADC's gain is detected by long term average of each absolute output with sinusoidal wave and then gain mismatch can be corrected by dividing each sub-ADC output's estimated gain in digital domain. Using calibration DAC to correct sub-ADCs' or front-end buffers' gain is also a correction method in analog domain [9], [20], [21].

### 2.4.1 Timing Skew Mismatch and Calibration

As shown in Fig. 2.12, the two-channel time-interleaved ADC is considered again with timing mismatch. It is supposed that $\mathrm{clk}_{1}$ is sampling ideally but $\mathrm{clk}_{2}$ has timing skew dt . In this situation, the output of two-channel ADC can be written as

$$
\begin{array}{ll}
\mathrm{y}_{1}[n]=\cos \left(2 \pi f_{\text {in }} n T_{s}+\theta\right) & n=\text { even }  \tag{2.15}\\
\mathrm{y}_{2}[n]=\cos \left[2 \pi f_{\text {in }} n\left(T_{s}+d t\right)+\theta\right] & n=\text { odd }
\end{array}
$$

where $f_{i n}$ is input frequency; $n$ is sampling point; $T_{s}$ is sampling period; dt is $\mathrm{ADC}_{2}$ 's timing skew.

By combing two sub-ADCs' output through some computation simplification, the output of the ADC is expressed as

$$
\begin{equation*}
\mathrm{y}[n]=\cos \left(2 \pi f_{i n} n T+\pi f_{i n} d t+\theta\right)-\frac{\omega d t}{2} \sin \left[2 \pi\left(\frac{f_{s}}{2}-f_{i n}\right) n T-2 \pi f_{i n} \frac{d t}{2}-\theta\right] \tag{2.16}
\end{equation*}
$$

In (2.13), the first term is the actual input sample which has a little amplitude modulation because of the timing skew. The second term causes the noise tone of the timing skew at $\frac{f_{s}}{2}-f_{i n}$. It is shifted by 90 degree compared to the gain mismatch while the location of noise tones is the same as that of gain mismatch in the output spectrum. As shown in Fig. 2.13, the largest error occurs when the input signal has the largest slope in the time domain. Furthermore, in more channels than two-channel tomeinterleaved ADC, the distortions caused by timing skew mismatch appear at the frequencies:

$$
\begin{equation*}
\pm f_{\text {in }}+\frac{k}{M} f_{s} \quad k=1, \ldots, M-1 \tag{2.17}
\end{equation*}
$$

where $M$ is the number of channels in a time-interleaved ADC.


Fig. 2.12 A two-channel time-interleaved ADC with timing skew mismatch


Fig. 2.13 Timing skew mismatch (a) ADC output and error signal in time domain (b) ADC output power spectrum

Unlike offset mismatch or gain mismatch, it is much harder to detect and correct timing skew mismatch. Although there are several techniques proposed to calibrate timing-skew mismatch, each technique has its own disadvantage and this issue is still being researched. In the beginning, digital calibration methods related digital signal processing (DSP) approaches is presented. It needs FIR
filters or FFT processors and causes large chip-area and high power consumption. Hence, this method is seldom implemented in real chips. In [12], a correlation-based algorithm is used between sub-ADCs to estimated timing skew in background. In [10], the derivative of the input signal was estimated and thus timing skew is measured by combing the input derivative and error in sub-ADC output. However, both works [10] and [12] require additional reference ADCs for calibration. Another method in [13] detects timing skew by computing the probability of a zero crossing among additional samples. Nevertheless, these additional samples result in extra timing error. Fig. 2.14 shows a common detection method used recently [22]. The average value of $\left|x_{2}-x_{1}\right|-\left|x_{3}-x_{2}\right|$ is proportional to timing skew value if analog input signal is wide-sense stationary (WSS) such as sinusoidal wave. In this situation, the autocorrelation of input signal depends only on time difference rather than time absolute value. Then, timing skew can be detected due to this property. After timing skew is detected, the correction is done in digital or analog domain. However, the convergence time of these techniques of calibration is another problem in real application. In digital domain, numerous digital filters are needed, which costs high power and large area. On the other hand, variable delay line are added at the sampling clock to adjust timing skew in analog domain, while it increases the noise and jitter of the sampling clock.


Fig. 2.14 Timing-skew detection method

### 2.5 Summary

In Chapter 2, how to quantify the performance of ADC is introduced and various types of ADC such as flash, SAR, and time-interleaved architecture mentioned in this thesis are also described.

Detailed discussion about mismatch and calibration in time-interleaved architecture help to understand the proposed time-interleaved SAR ADC in Chapter 3.

## Chapter 3

## Proposed Time-Interleaved SAR ADC

### 3.1 Introduction

As mentioned in Section 2.3, the time-interleaved architecture is popular for multi-GS/s conversion rate because it maintains excellent power efficiency. However, the input bandwidth of the time-interleaved architecture is a challenge because the Nyquist frequency is N times higher than that of a single-channel ADC.

On the other hand, as a single-channel architecture, SAR architecture shows great powerefficient in advanced CMOS technology. Therefore, the SAR ADC with time-interleaved technique is proposed in this thesis. In this chapter, the time-interleaved ADC architecture and an issue of bandwidth are introduced in Section 3.2. Section 3.3 presents the proposed time-interleaved SAR ADC and discusses the design consideration. Finally, Section 3.4 summarizes this work.

### 3.2 Time-Interleaved ADC Architecture

In this section, two conventional time-interleaved architectures are described. However, these architectures have limitation of the input bandwidth when the number of channel becomes large. Recently, grouping technique is a solution to overcome this issue. Three related prior works [9], [21], [23] are introduced to draw forth the proposed time-interleaved SAR ADC.

### 3.2.1 One Rank Time-Interleaved Architecture

Fig. 3.1 shows the first conventional time-interleaved architecture and it is also called one rank time-interleaved architecture [32]. The sampling rate of a time-interleaved ADC is M times faster than that of sub-ADC where M is the number of sub-ADCs. Each sub-ADC has its track-and-hold (T/H) circuit which is triggered by corresponding clock phases (clki, $\mathrm{i}=1,2 \ldots, \mathrm{M}-1, \mathrm{M}$ ). This architecture achieves high sampling rate, but the problem of timing skew which is described in Section 2.4.1 degrades performance of ADC and needed to be solved.

The input bandwidth is an important consideration of time-interleaved ADC because it determines the operation speed and the allowable number of sub-ADCs. Two main factors which affect the input bandwidth are the input capacitance and the available settling time. According to [31], an approximation for the total input capacitance is

$$
\begin{equation*}
C_{I N}=N \cdot C_{\text {sample }}+M \cdot C_{\text {switch }}+M^{2} \cdot C_{\text {wire }} \tag{3.1}
\end{equation*}
$$

where $\mathrm{C}_{\text {sample }}$ is sampling capacitance of a single-channel $\mathrm{ADC}, \mathrm{C}_{\text {switch }}$ is the capacitance of $\mathrm{T} / \mathrm{H}$ circuit and $\mathrm{C}_{\text {switch }}$ consists of gata capacitances, $\mathrm{C}_{\text {wire }}$ is the capacitance of wire of a definite length, N is the number of $\mathrm{T} / \mathrm{H}$ circuits which turn on at the same time, and M is the number of $\mathrm{T} / \mathrm{H}$ circuits which connect the input. $\mathrm{C}_{\text {sample }}$ usually dominates the input capacitance. The input capacitance is proportional to N and the input bandwidth decreases as the input capacitances increases. Besides the input capacitance, the available settling time affects the requirement of input bandwidth. The available settling time is determined by the track-time of T/H circuits and duty cycle of clock phases.

As shown in Fig. 3.2, the different time diagrams of clock phases cause different influences on the input bandwidth, where M is the number of sub-ADCs, and Fs is the sampling rate. Take an 8 GS/s 16-channel ( $\mathrm{M}=16$ ) TI ADC as an example, and two different time diagrams of clock phases are described. In Fig. 3.2(a), a number of $8 \mathrm{~T} / \mathrm{H}$ circuits are in the sampling mode at the moment due to $50 \%$ duty cycle of clock phases. An approximation for the total input capacitance is

$$
\begin{equation*}
C_{I N}=8 \cdot C_{\text {sample }}+16 \cdot C_{\text {switch }}+16^{2} \cdot C_{\text {wire }} \tag{3.2}
\end{equation*}
$$

The third team is ignored for simplicity, and hence the input capacitance is 8 times larger than that of a single channel ADC. Due to $50 \%$ duty cycle of clock phases, the available settling time is 1 ns . The number of channels is limited due to the insufficient bandwidth. Moreover, this time diagram causes charge sharing between T/H switches and degrades the accuracy of the sampling signals. Therefore, this architecture is seldom used in recent years.

Another timing diagram is shown in in Fig. 3.2 (b). The duty cycle of clock phases is less than $6.25 \%$. An available settling time of one period is less than 125 ps . This timing diagram ensures that only one input capacitance of sub-ADC is connect to the input at a time, lowering the input capacitance and enabling the higher number of channels. An approximation for the total input capacitance is

$$
\begin{equation*}
C_{I N}=1 \cdot C_{\text {sample }}+16 \cdot C_{\text {switch }}+16^{2} \cdot C_{\text {wire }} \tag{3.3}
\end{equation*}
$$

However, as the number of channels increases, the duty cycle of clock phases decreases. The available time for sampling capacitance to settle is reduced. $\mathrm{C}_{\text {switch }}$ and $\mathrm{C}_{\text {wire }}$ still increase when the number of
channels increases. The improvement of the bandwidth is still limited.

In conclusion, both timing diagrams of the conventional time-interleaved architecture have the problem of limited bandwidth if the number of channels becomes large.


Fig. 3.1 One rank time-interleaved architecture


Fig. 3.2 Timing diagram of (a) track-time of $50 \%$ duty cycle (b) track-time of $\frac{1}{\mathrm{M}} \%$ duty cycle

### 3.2.2 Two Rank Time-Interleaved Architecture

For one rank time-interleaved architecture, timing skew mismatch between $\mathrm{T} / \mathrm{H}$ switches results in error of sampling instants and degrades the SNDR of the ADC. To solve the timing skew issue, a two rank architecture also called the frontend sampler time-interleaved architecture [17]-[19], [33] is shown in Fig. 3.3. Fig. 3.4 shows the timing diagram. The frontend sampler operates at maximum sampling rate $\mathrm{F}_{\mathrm{s}}$ with the same clock phase, sampling all the input signal. Then T/H circuits of subADCs sample the constant DC signal from the frontend sampler. Thus, timing skew issue does not affect performance even if timing skew mismatch between sub-ADCs occurs. Because there are two rank of T/H, the concept of hierarchy is introduced and spread by other wok recently. It is discussed in Section 3.2.3.3.

The disadvantage is that the frontend sampler need to operate at the full sampling rate and the track time is $\frac{1}{2 \mathrm{M}} \%$ duty cycle. The track time is half shorter than the one rank architecture shown in Fig. 3.2 (b). An approximation for the total input capacitance is

$$
\begin{equation*}
C_{I N}=1 \cdot C_{\text {sample }}+M \cdot C_{\text {swich }}+M^{2} \cdot C_{\text {wire }} \tag{3.4}
\end{equation*}
$$

$\mathrm{C}_{\text {switch }}$ and $\mathrm{C}_{\text {wire }}$ also limit the bandwidth of fronted sampler when the channel increases.

Therefore, in this architecture, the frontend sampler limits the sampling rate of the ADC and the speed of the ADC cannot operate as fast as the one rank time-interleaved architecture.


Fig. 3.3 Two rank time-interleaved architecture


Fig. 3.4 Timing diagram of the two rank time-interleaved architecture

### 3.2.3 Grouping Technique

Grouping technique is used to solve the input bandwidth of time-interleaved architecture when the number of channels becomes extensive. Fig. 3.5 is the basic time-interleaved architecture using grouping technique. There are totally $\mathrm{M} \times \mathrm{N}$ sub-ADCs which is divided into M groups. Each group usually has a frond-end sampler forwarding the input signal to the sampling capacitor of the subADC. The parasitic capacitance at the output of Frond-end sampler is thus reduced drastically because a small number of parallel T/H circuits in a group. Like the timing diagram in Fig. 3.5, if only one T/H turns on at a time, the input capacitance is roughly the same as a single channel ADC. An approximation for the total input capacitance is

$$
\begin{equation*}
C_{I N}=1 \cdot C_{\text {sample }}+M \cdot C_{\text {switch }}+M^{2} \cdot C_{\text {wire }} \tag{3.5}
\end{equation*}
$$

$\mathrm{C}_{\text {switch }}$ and $\mathrm{C}_{\text {wire }}$ are N times smaller than architecture without grouping technique.

Moreover, combining grouping technique with hierarchical concept makes time-interleaved architecture have a variety of architectures. Three related prior works [9], [21], [23] are introduced to draw forth the proposed time-interleaved SAR ADC.


Fig. 3.5 Time-interleaved architecture using grouping technique and the time diagram

### 3.2.3.1 Prior Work 1: The 64-Channel Time-Interleaved 10-Bit SAR ADC [9]

Prior work 1 aims at a high resolution ADC for GS/s rates. As resolution increases, the sampling capacitor also inevitably increases to solve thermal noise due to $\frac{\mathrm{C}}{\mathrm{kT}}$. This phenomenon causes the limitation of the number of channels when one rank or two rank time-interleaved architecture are used. Therefore, this prior work use a grouping concept to successfully interleave 64 SAR ADCs.

This prior work implemented a 64-channel 10-bit $2.6 \mathrm{GS} / \mathrm{s}$ time-interleaved SAR ADC in 65 nm CMOS. Fig. 3.6 shows ADC architecture and the timing diagram. 64 ADC units are arranged in four groups. Each group is partitioned into two domain: T/H and Quarter SAR ADC arrays. Each T/H drives a Quarter ADC array (QADC) which consists of 16 SAR ADCs. An on-chip calibration corrects gain and offset mismatches.

The operation using the time diagram is described as follows. A clock generator receives 2.6 GHz CLK and each Local clk generates 650 MHz clocks with $50 \%$ duty cycle for the operation of $\mathrm{T} / \mathrm{H}$. In addition, four QADCs receive 650 MHz clocks with $0^{\circ}, 90^{\circ}, 180^{\circ}, 270^{\circ}$, which are used to generate local clocks for the operation of each SAR ADC. Each SAR ADC produces 10-bit data after 12 cycles, one for resampling input signal and 11 for quantization.

Partitioning ADC into 4 groups makes T/H and SAR ADC arrays separate interconnect from each group. It reduces parasitic capacitance of T/H from routing of connecting ADC arrays and clock circuit, enabling wide input bandwidth and allows the use of 64-channel ADCs. However, using one buffer to drive a QADC which has 16 ADC units is still difficult due to large output load. Accordingly,
one SAR ADC is driven by one buffer to reduce load and 1-16 demultiplexer (demux) resamples the signal on the main sampling capacitor Cs to one buffer. The disadvantage is that large power is consumed due to a total of 64 buffers.


Fig. 3.6 ADC architecture and the timing diagram of prior work 1

### 3.2.3.2 Prior Work 2: The 4-Channel Time-Interleaved 6-Bit Flash ADC [21]

Prior work 2 proposed the interleaved architecture to improve the input bandwidth limitation of the TI ADC. As shown in Fig. 3.7, T/H switches of 4-channel sub-ADCs are driven by one buffer $\mathrm{G}_{1}$. In architecture 1 , two clock phases such as clk1and clk2 overlap each other at the same time, corrupting the sampling input signal because charge sharing between two T/H switches causes an error in the sampling signal. For architecture 2 in Fig.3.7, this problem is solved by dividing T/H switches in two groups. And clock phases are exchanged. For example, clk1 and clk3 are in one group now and they do not overlap each other. As a result, each group is driven by one buffer $\mathrm{G}_{1}$ and charge sharing does not occur. Another advantage is that the track-time of T/H switches becomes $50 \%$ duty cycle by grouping sub-ADCs. The available settling time is prolonged. The concept of architecture 2 is used in this work.

Fig. 3.8 shows the architecture of TI ADC in prior work 2. This prior work implemented a 4channel 6-bit 10.3 GS/s time-interleaved flash ADC in 40 nm CMOS. The ADC comprises of two frontend variable gain amplifiers VGAs (G1) and 4-channel sub-ADCs with their own VGAs (G2). A G1 provides coarse gain control at high bandwidth, while a slower G2 provides fine gain control. Each G1 drives a pair of T/H switches that are triggered by complementary $2.575-\mathrm{GHz}$ clock phases. Since only small capacitance is used in T/H switch's output, G1 achieves high tracking bandwidth allowably. Then, each G2 connects a T/H circuit and drives the sub-ADC's input load. However, both buffer G1 and G2 have programmable resistor array that vary gain to calibrate gain mismatch. G2
also uses a 5-bit DAC to calibrate offset. In addition, G1 uses shunt peaking to achieve a nominal bandwidth of 6 GHz . The inductances occupy a huge area.

In summary, from prior work 2, we know that grouping of sub-ADCs can mitigate charge sharing between T/H switches and improve the input bandwidth of the TI ADC. But VGAs need large area and calibration.


Architecture 1


Architecture 2


Fig. 3.7 Charge sharing problem between T/H switches


Fig. 3.8 Block diagram of prior work 2

### 3.2.3.3 Prior Work 3: The 32-Channel Hierarchical Sampling Time-Interleaved 7-Bit SAR ADC [23]

As discussed in Section 3.2.2, as the number of sub-ADCs becomes large, it is challenging to drive the overall sub-ADCs at high frequency because the enormous sampling and parasitic capacitances limit the bandwidth. A broadband input buffer may be added to increase bandwidth. Nevertheless, this buffer has to consume excessive power to meet high demand [24].

Prior work 3 proposed a hierarchical sampling architecture combined with the grouping technique to overcome these issues if 32 sub-ADCs are used. This prior work implemented a 32channel 7-bit 12.8 GS/s time-interleaved SAR ADC in 65 nm CMOS. The architecture and timing diagram are shown in Fig. 3.9. The hierarchical sampling architecture consists of three rank of samplers (Rank 1-3). Each rank of samplers includes the buffer, T/H switches, and sampling
capacitance. First, Rank-1 sampler is adopted to sample the input signal when phase $\Phi_{1}$ triggers the T/H circuit. As described in Section 3.2.2, the output voltage of Rank-1 sampler is constant during the hold time. As a result, the timing mismatch caused in the Rank-2 and Rank-3 samplers does not degrade the performance. However, Rank-1 sampler still needs a lot of effort such as power and area to achieve high bandwidth.

The input bandwidth requirements in the Rank-2 and Rank-3 samplers are also alleviated due to the constant voltage of signal. Next, the Rank-2 sampler is triggered by clock phase $\Phi_{2 \mathrm{~A}<1: 4>}$. In this stage, through proper arrangement of clock phases, the buffer only need to drive one sampling capacitance at a time. Finally, in a Rank-3 stage, 32-channel sub-ADCs are divided into 4 groups. Each group has 8 -channel sub-ADCs which are driven by one buffer. The previous held sampled signal is then buffered and re-tracked by the T/H circuit of the sub-ADC. Like the Rank-2 sampler, Rank-3 sampler also drives one sampling capacitance at a time. Therefore, in Rank-2 and Rank-3 stage, parasitic and sampling capacitances are greatly reduced. The input bandwidth of the ADC is not limited.

In summary, this work use three rank architecture combining with grouping technique to lower the requirement of input bandwidth if the large number of sub-ADCs is required. In addition, unlike the prior work 1, the number of buffers is reduced to six with the aid of three rank. However, the input bandwidth of Rank-1 sampler consist of a buffer and T/H is still a problem and $\mathrm{G}_{1}$ consumes huge power.


Rank-3


Fig. 3.9 Architecture and timing diagram of prior work 3

### 3.3 Proposed Time-Interleaved SAR ADC

According to the discussion of time-interleaved architecture in Section 3.2, we summarize some techniques briefly. One rank architecture is the simplest way to implement a time-interleaved technique. However, it has both problems of the input bandwidth and timing skew. Next, two rank architecture which pluses a front-end sampler to solve the timing skew issue. The disadvantage is that the front-end sampler should operate at the full sampling rate, and its track-time is worse than one rank architecture. Both architectures have limitation of the number of interleaved channels when ADC extends to multi-GS/s operation. On the other hand, recently, grouping technique is proposed to overcome this issue.

Prior work 1 and work 2 both have the technique that groups sub-ADCs into several section. The input load is thus reduced and become unrelated to the number of whole channels. Accordingly, accomplishing the demand of high input bandwidth is much more effortless than the conventional architecture. Nevertheless, how to grouping sub-ADCs depends on the number of channels and the specifications of ADC. For example, Prior work 1 still needs 64 buffers to drive sub-ADCs because there are still 16 sub-ADCs in a group. The number of sub-ADCs in a group is still large. Prior work 2 only has 4-channel ADCs, so grouping ADC is simple. But the design of VGA is complicated and VGA consumes huge area due to calibration and the inductances. The architecture in prior work 3 combines the hierarchy with grouping technique. There rank of samplers and four group of sub-ADCs are determined to confront a large load from 32 sub-ADCs. Nevertheless, the operation of Rank-1
stage is still like the conventional two rank architecture. Acquiring enough input bandwidth should pay for excessive power consumption.

Look back to our goal, we aim to implement a low power 6-bit ADC with high sampling rate of greater than multi-GS/s. The SAR architecture is chosen as a sub-ADC for its high power efficiency in the advanced CMOS process. Furthermore, time-interleaved architecture extends this advantage properly. On the basis of the specification, we interleave 16-channel SAR ADCs and achieve the goal by combining the front-end input buffers and the grouping technique for time-interleaved architecture.

With a 16-channel time-interleaved ADC, different hierarchical architecture and grouping technique affect performance. The relationship is discussed as follows. It may provide the best optimum architecture for the 16 -channel time-interleaved ADC. Table. 3-1 summarizes this relationship. We define $M$ is the number of sub-ADCs, $H$ is the number of hierarchy and $G$ is the number of groups. $\mathrm{t}_{\text {settle }}$ is the worst time for the input buffer to settle; Duty Cycle is the ratio of the sampling time and the conversion time for a sub-ADC; and Buffer is the number of the needed buffers. For example, if $\mathrm{H}=2$, the architecture is like the two rank architecture and $\mathrm{t}_{\text {settle }}$ is too short to meet the settling requirement. Thus, $\mathrm{H}=2$ is not considered as a chosen architecture. Because the ADC only has 16-channel, $\mathrm{H}=3$ or 3-rank architecture like prior work 3 is unnecessary in this architecture. When $\mathrm{H}=1$ and $\mathrm{G}=8, \mathrm{t}_{\text {settle }}$ is longest but it needs 8 buffers and suffers from higher power consumption. On the other hand, if G is 2 , the duty cycle for the sub-ADC is unbalanced because the track-time is still short and the conversion time is too long for the 6-bit SAR ADC. It may has idle time for conversion.

Therefore, the most favorable architecture is $\mathrm{H}=1$ and $\mathrm{G}=4$. In this choice, $\mathrm{t}_{\text {settle }}$ is available for input buffers to settle and $25 \%$ duty cycle is balanced for the operation of sub-ADC.

Table 3-1 Comparison of different ADC architecture when $\mathrm{M}=16$

| M | H | G | $\mathrm{t}_{\text {settle }}$ | Duty Cycle | Buffer |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 16 | 1 | 2 | $2 \mathrm{~T}_{\mathrm{s}}$ | $12.5 \%$ | 2 |
| $\mathbf{1 6}$ | $\mathbf{1}$ | $\mathbf{4}$ | $\mathbf{4 T}_{\mathrm{s}}$ | $\mathbf{2 5 \%}$ | $\mathbf{4}$ |
| 16 | 1 | 8 | 8 T | $50 \%$ | 8 |
| 16 | 2 | 2 | $0.5 \mathrm{~T}_{\mathrm{s}}$ | $12.5 \%$ | 3 |
| 16 | 2 | 4 | $0.5 \mathrm{~T}_{\mathrm{s}}$ | $25 \%$ | 5 |

The proposed time-interleaved SAR ADC and timing diagram are shown in Fig. 3.10. The architecture is composed of 16 -channel SAR ADCs, 4 groups of front-end buffers, a clock generator and a multiplexer (MUX) circuit. In this design, the input signal is sampled by 4 groups of source followers (SF). Each SF drives one group of ADCs which consist of 4 SAR ADCs. Each T/H circuit of the sub-ADC samples the signal forwarded by the SF with the $25 \%$ duty cycle clock. Then, SAR ADCs of each group convert the input signal to 7-bit output data with 1-bit redundancy. Output data from each ADC are combined by the multiplexer (MUX) and decoded to six binary bits. Calibrations of the offset and gain mismatch between channels are performed off-chip. The timing-skew mismatch is also detected off-chip and then the delay cells of the clock generator is adjusted to suppress the timing skew.

A timing diagram is shown in Fig. 3.10. In GROUP 1, the SAR ARC 1 samples the input signal
when $\Phi_{1}$ is high, starting conversion when $\Phi_{1}$ is low. Similarly, other SAR ADCs operates sequentially. Although sampling phases overlap between groups, front-end SFs separate the path of the charge sharing between T/H circuits. In other words, by grouping sub-ADCs, only one T/H circuit of the SAR ADC samples the input signal at the same time in a group. Thus, it avoids the problem of the charge sharing between sub-ADCs with the proper duty cycle of clock phase. It solves the distortion of the sampling signal. An approximation for the total input capacitance is

$$
\begin{equation*}
C_{I N}=1 \cdot C_{\text {sample }}+4 \cdot C_{\text {switch }}+4^{2} \cdot C_{\text {wire }} \tag{3.6}
\end{equation*}
$$

The input capacitance is nearly the same as a single-channel ADC. Consequently, buffers' load greatly decreased even 16 -channel is adopted. Another benefit is that the $25 \%$ duty cycle clock used also fits the operation of the SAR ADC.



Fig. 3.10 Architecture and timing diagram of the proposed time-interleaved SAR ADC

In sub-ADC, the 6-bit SAR ADC is shown in Fig. 3.11. It is composed of transmission gate switches, arbitrary weight capacitor array (AWCA), a comparator, and SAR logic. Top-plate sampling and the monotonic switching are used for its simple digital logic and low power consumption. Although the monotonic switching may cause a signal-dependent dynamic offset, the AWCA [25] technique adopted solves this problem. Since the AWCA is based on the non-binary concept, one additional comparison is performed in 6-bit SAR ADC.


Fig. 3.11 Architecture of sub-ADC

### 3.3.1 Design Considerations

In this section, we discuss the design consideration of the proposed time-interleaved SAR ADC.

First, we analyze the input bandwidth of the ADC in Section 3.3.1.1. Second, mismatches in this design is considered in Section 3.3.1.2 and 3.3.1.3. Finally, several techniques in single-channel SAR ADC are described in Section 3.3.1.4.

### 3.3.1.1 Input Bandwidth and Settling Time Requirement

As described in Section 3.3, front-end buffers are used to drive sub-ADCs and become a key point whether the ADC have enough bandwidth.

This work chooses the source follower, rather than the operational amplifier (OPAMP) for
several reasons although the OPAMP based closed-loop configuration achieves good linearity and gain desensitization. First, the open-loop buffer that accomplishes the input signal of 6-bit accuracy is enough in this design. Second, the buffer's gain only changes the signal's range and does not affect linearity severely so the unity-gain buffer based closed-loop configuration is unnecessary. Finally, it is very hard to design an OPAMP based closed-loop residue amplifier that conforms to the require bandwidth in the gigahertz range because voltage headroom and output resistance are small in 40 nm CMOS technology. Instead, the open-loop source follower offers a higher bandwidth and faster settling time easily when compared to the OPAMP based closed-loop residue amplifier.

The PMOS source follower depicted in Fig. 3.12(a), where $C_{L}$ is the equivalent capacitance load contributed mainly by the AWCA of the SAR ADC. The bulk of the PMOST is tied to the source, which eliminates the nonlinearity due to body effect. Using the ideal current source replaces the bias circuit of the source follower to simplify analysis. As the small-signal model is shown in Fig. 3.12(b), the output resistance of $\mathrm{M}_{1}$ is $\mathrm{r}_{\mathrm{O} 1}$. The output resistance of current source is $\mathrm{r}_{\mathrm{O} 2}$. It is implemented by cascode PMOS current source in practice. .


Fig. 3.12 (a) Source follower with ideal bias current, its (b) small-signal model, and its (c) high-

## frequency equivalent model

The $\mathrm{V}_{\text {out }}$ of source follower is expressed as:

$$
\begin{equation*}
v_{\text {out }}=-g_{m}\left(v_{\text {out }}-v_{\text {in }}\right)\left(r_{O 1} / / r_{O 2}\right) \tag{3.7}
\end{equation*}
$$

Since $\mathrm{r}_{\mathrm{O} 2}$ is larger than $\mathrm{r}_{\mathrm{O} 1}$, it follows that

$$
\begin{equation*}
\frac{v_{\text {out }}}{v_{\text {in }}}=\frac{g_{m}\left(r_{O 1} / / r_{O 2}\right)}{1+g_{m}\left(r_{O 1} / / r_{O 2}\right)} \approx \frac{g_{m} r_{O 1}}{1+g_{m} r_{O 1}}=1-\frac{1}{1+g_{m} r_{O 1}} \tag{3.8}
\end{equation*}
$$

where $g_{m} r_{O 1}$ is the intrinsic gain of transistor $M_{1}$. Hence, the gain of the source follower is roughly smaller than 1, but this situation does not cause influence on the conversion of the ADC.

The input bandwidth of the source follower dominantly determines the performance of the ADC at high input frequency. It is obtained through high-frequency equivalent model shown in Fig. 3.12 (c). By applying Kirchhoff's current law at output node ( $\mathrm{V}_{\text {out }}$ ), we add up all of the current:

$$
\begin{equation*}
v_{s g} C_{G S} s+g_{m} v_{s g}+v_{\text {out }}\left(s C_{L}+\frac{1}{r_{O 1}}\right)=0 \tag{3.9}
\end{equation*}
$$

It follows that

$$
\begin{equation*}
v_{s g}=-\frac{s C_{L}+\frac{1}{r_{O}}}{g_{m}+C_{G S} s} v_{\text {out }} \tag{3.10}
\end{equation*}
$$

And

$$
\begin{equation*}
v_{\text {in }}=v_{\text {out }}-v_{s g} \tag{3.11}
\end{equation*}
$$

Substituting equation (3.10) into equation (3.11), we have

$$
\begin{equation*}
\frac{v_{o u t}}{v_{\text {in }}}=\frac{g_{m}+C_{G S} s}{s\left(C_{L}+C_{G S}\right)+g_{m}+\frac{1}{r_{O 1}}} \tag{3.12}
\end{equation*}
$$

Consequently, the unity-gain bandwidth is

$$
\begin{equation*}
\omega_{u}=\omega_{3 d B} \approx \omega_{p}=\frac{g_{m}+\frac{1}{r_{O 1}}}{C_{L}+C_{G S}} \approx \frac{g_{m}}{C_{L}+C_{G S}} \tag{3.13}
\end{equation*}
$$

Next, we calculate the requirement of the frond-end circuit on the basic of the input bandwidth and the settling time. Fig. 3.13 shows a first-order RC model of the frond-end circuit and the bandwidth at $\mathrm{V}_{\text {out }}$ is

$$
\begin{equation*}
\omega_{\text {vout }}=\frac{1}{\left(R_{O}+R_{\text {on }}\right) C_{I N}} \tag{3.14}
\end{equation*}
$$

where $\mathrm{R}_{\mathrm{O}}$ is the output impedance of the source follower and $\mathrm{R}_{\mathrm{O}}$ is $\frac{1}{g_{m}}, \mathrm{R}_{\mathrm{on}}$ is an equivalent onresistance of the $\mathrm{T} / \mathrm{H}$ switch, and $\mathrm{C}_{\text {IN }}$ is the total input capacitance.


Fig. 3.13 The RC model of the frond-end circuit

The setting error should less than $\frac{1}{2} \mathrm{~V}_{\text {LSB }}$ to attain the required accuracy. Hence, the settling time of the frond-end circuit is defined as the required time that $\mathrm{V}_{\text {out }}$ reaches to within $\frac{1}{2} \mathrm{~V}_{\text {LSB }}$ of its final value and it is written as:

$$
\begin{equation*}
V_{\text {out }}\left(t=t_{\text {sertle }}\right)=V_{\text {final }}\left(1-e^{t_{\text {setele }} / \tau}\right) \geq V_{\text {final }}\left(1-\frac{1}{2^{n+1}}\right) \tag{3.15}
\end{equation*}
$$

where $\mathrm{t}_{\text {settle }}$ is the required settling time, $\mathrm{V}_{\text {final }}$ is the final voltage of $\mathrm{V}_{\text {out }}, \tau$ is the RC time constant, and $n$ is the resolution of the ADC .

Rearrange (3.15), we have

$$
\begin{equation*}
t_{\text {settle }} \geq(n+1) \cdot \tau \cdot \ln (2) \tag{3.16}
\end{equation*}
$$

In fact, $\mathrm{t}_{\text {settle }}$ is determined by the track-time of clock phases and it is fixed. According to the value of $\mathrm{t}_{\text {settle }}$, we know the required input bandwidth of the frond-end circuit and ensure the goal of design. By rearranging (3.16) and substituting $\tau=\frac{1}{\omega_{\text {vout }}}$, the required input bandwidth is

$$
\begin{equation*}
\omega_{\text {vout }}=2 \pi f_{\text {vout }} \geq \frac{1}{t_{\text {settle }}} \cdot(n+1) \cdot \ln (2) \tag{3.17}
\end{equation*}
$$

For example, in this work, the maximum settling time is $\mathrm{T}_{\mathrm{s}} \times 16 \times \frac{1}{4}$ due to 16 -channels and duty cycle clock is $25 \%$ where $\mathrm{T}_{\mathrm{s}}$ is the sampling period. Assuming that the ADC operate at $8 \mathrm{GS} / \mathrm{s}$ sampling rate, $\mathrm{T}_{\mathrm{s}}$ is 125 ps and the maximum settling time is 500 ps . By substituting $\mathrm{n}=6$ and $\mathrm{t}_{\text {settle }}$ $=500 \mathrm{p}$ into (3.14), the required bandwidth of the front-end circuit is approximately 1.55 GHz .

Table 3-2 compares the input bandwidth requirement of time-interleaved architecture mentioned in Section 3.2 and 3.3. In these architectures, we assume a 6-bit 16-channel TI ADC under the sampling rate of $8 \mathrm{GS} / \mathrm{s}$ and clock phases do not overlap each other. Consequently, one rank timeinterleaved architecture in Fig. 3.2(b) only has $6.25 \%$ duty cycle and the available settling time is 125 ps. According to (3.14), the required input bandwidth is 6.2 GHz . Two rank time-interleaved architecture has the shorter available settling time due to its frontend sampler and the required input bandwidth is 12.4 GHz . On the other hand, the proposed architecture has the lowest requirement of input bandwidth. The available settling time is 500 ps and the requirement of input bandwidth is only 1.55 GHz . This requirement of the input bandwidth in this work is 4 time lower than that of one rank time-interleaved architecture and 8 time lower than that of two rank time-interleaved architecture. It effectively makes the design of the front-end circuit easier than other architectures.

Table 3-2 Comparison of the required input bandwidth for time-interleaved
architecture

| Architecture | Available settling time (ps) | Required input bandwidth (GHz) |
| :---: | :---: | :---: |
| Proposed | 500 | 1.55 |
| One rank (Fig. 3.2b) | 125 | 6.2 |
| Two rank | 62.5 | 12.4 |

### 3.3.1.2 Offset and Gain Mismatch Calibration

As mentioned in Section 2.4, in time-interleaved architecture, the problem of mismatch among sub-ADCs should be considered and eliminated. In our work, offset mismatch is attributed to 4 groups of source followers and the comparator of the SAR ADC among 16-channels. The sources of gain mismatch are mainly the same as that offset mismatch. Moreover, other major sources of gain mismatch are from reference voltage mismatch and capacitive DACs. Therefore, these mismatches need to be calibrated.

The effort of the calibration is simple. We use the method described in Section 2.4.1. It follows that each sub-ADC's offset is detected by calculating long term mean of its output with a quasistationary input signal such as sinusoidal wave. Then the offset value is subtracted from its respective output [15], [18]-[19]. Likewise, the gain calibration method we use is described in Section 2.4.2. Each sub-ADC's gain is detected by long term average of each absolute output with sinusoidal wave and then gain mismatch is corrected by dividing each sub-ADC output's estimated gain in digital domain [15], [18]-[19].

Fig. 3.14 shows 2-channel TI ADC as an example. According to the above algorithm, offset ${ }_{2}$ is
the estimated offset mismatch of $\mathrm{SAR}_{\mathrm{ADC}_{2}}$ relative to $\mathrm{SAR} \mathrm{ADC}_{1}$. Then, $\mathrm{y}_{2}$ call represents the offsetcalibrated digital output of $\operatorname{SAR} \mathrm{ADC}_{2}$. Next, by normalizing $\mathrm{y}_{2}$ call with respect to $\mathrm{y}_{1}$, the final calibrated digital output $\mathrm{y}_{2}$ cal2 is acquired. This calibration is realized via MATLAB for its simplicity.


Fig. 3.14 The block diagram of offset and gain mismatch calibration

### 3.3.1.3 Timing Skew Mismatch Calibration

After offset and gain mismatches are calibrated, the timing skew should be detected and corrected. We use the approach based on the correlation between the probability density of zerocrossing and the difference of the ADC output [26].

Fig. 3.15 is an example how to calculate the difference of the correlation $\varepsilon$. The difference of the correlation $\varepsilon$ is a function of the timing mismatch between $\operatorname{SAR~ADC~}_{1}$ and $\operatorname{SAR} \mathrm{ADC}_{2}$. Assuming that the input is a stationary signal with zero mean such as sinusoidal wave and $\operatorname{SAR} \mathrm{ADC}_{2}$ samples the input signal $\mathrm{X}_{2}$ at $\mathrm{t}_{2}$, which is skewed by $\Delta \mathrm{T}$. Now, we calculate the cross-correlation between the difference $\mathrm{y}_{2}-\mathrm{y}_{1}$ and the sign signal $\mathrm{S}_{2}$, obtaining $\mathrm{C}_{1}$ which is proportional to timing skew $\Delta \mathrm{T}$.

Similarly, $\mathrm{C}_{2}$ is acquired by the same computation. The difference of the correlations $\mathrm{C}_{1}$ and $\mathrm{C}_{2}$ is $\varepsilon$. The long term accumulation of $\varepsilon$ would represents the relative skew. Then, we can adjust the delay cells of the clock phase $\Phi_{1}$ to correct the timing skew $\Delta \mathrm{T}$.

The detailed explanation is as follows. The difference of digital output of ADCs between adjacent channels is

$$
\begin{equation*}
a_{i}=y_{i}-y_{i+1} \tag{3.15}
\end{equation*}
$$

where i refers the channel number.

Then, the sign signal $\mathrm{S}_{\mathrm{i}}$ belongs to $\{-1,0,+1\}$, which is obtained by zero-crossing detection (ZCD). When zero-crossing (ZC) occurs, $\mathrm{S}_{\mathrm{i}}$ produces either -1 or +1 . When ZC does not occur, $\mathrm{S}_{\mathrm{i}}$ is 0 . It is written as

$$
\begin{align*}
S_{i+1} & =\operatorname{sign}\left(y_{i}-y_{i+1}\right) \\
& =\frac{y_{i}-y_{i+1}}{\left|y_{i}-y_{i+1}\right|} \tag{3.16}
\end{align*}
$$

Next, the cross-correlation between $\mathrm{S}_{\mathrm{i}+1}$ and $\mathrm{a}_{\mathrm{i}}$ according to the definition is

$$
\begin{align*}
C_{i} & =E\left[S_{i+1} \times a_{i}\right]  \tag{3.17}\\
& =E\left[S_{i+1} \times\left(y_{i}-y_{i+1}\right)\right]
\end{align*}
$$

Substitute (3.16) into (3.17), then

$$
\begin{align*}
C_{i} & =E\left[\left(\frac{y_{i}-y_{i+1}}{\left|y_{i}-y_{i+1}\right|}\right) \times\left(y_{i}-y_{i+1}\right)\right]  \tag{3.18}\\
& =E\left[\left|y_{i}-y_{i+1}\right|\right]
\end{align*}
$$

In accordance with the property of the ZC [13], the ZC probability between $\mathrm{y}_{1}$ and $\mathrm{y}_{2}$ is proportional to the sampling interval $\mathrm{T}_{\mathrm{s}}$. If timing skew $\Delta \mathrm{T}$ occurs and then sampling interval changes
into $\mathrm{T}_{\mathrm{s}}+\Delta \mathrm{T}$, the $\mathrm{C}_{\mathrm{i}}$ becomes

$$
\begin{equation*}
C_{i}=E\left[\left|y_{i}-y_{i+1}\right|\right] \propto \sum\left|y_{i}-y_{i+1}\right| \propto\left[2 f_{i} \times\left(T_{s}+\Delta T\right)\right] \tag{3.19}
\end{equation*}
$$

where $2 f_{i} \times\left(T_{s}+\Delta T\right)$ is the probability if zero-crossing.

Eventually,

$$
\begin{align*}
\varepsilon & =C_{i-1}-C_{i} \\
& =E\left[\left|y_{i-1}-y_{i}\right|\right]-E\left[\left|y_{i}-y_{i+1}\right|\right] \tag{3.20}
\end{align*}
$$

$\varepsilon$ becomes the relative timing skew between adjacent channels.

(a)

(b)

Fig. 3.15 (a) Sampling waveform of TI ADC (b) block diagram of the timing skew mismatch detection

### 3.3.1.4 Single-Channel SAR ADC

In this section, design consideration of the single-channel SAR ADC for time-interleaved ADC is discussed.

### 3.3.1.4.1 Asynchronous Processing

Fig. 3.16 demonstrates the concept of synchronous and asynchronous processing of SAR ADC.

The SAR ADC has two main phases: sampling phase and conversion phases. In synchronous processing, the clock phase are divided into $\mathrm{N}+1$ equal cycle in N -bit resolution. One cycle is for sampling, while other N cycles are for the bits conversion. Each bit is processed in each cycle. In this technique, one disadvantages is that a clock signal which is $\mathrm{N}+1$ times higher than the conversion rate. Such clock generator consumes huge power to operate at high frequency. Moreover, synchronous processing wastes a lot of time because the time of each cycle is determined by maximum DAC settling time. To eliminate these disadvantages, asynchronous clock control is adopted [3]. In asynchronous process, when the comparator finishes comparison, the comparator starts the next comparison through the control of SAR logic. Therefore, time of every cycle is optimized. In this design, asynchronous processing exert its advantage with $25 \%$ of the clock phase.


Fig. 3.16 Synchronous and asynchronous processing

### 3.3.1.4.2 Monotonic Capacitor Switching

For SAR ADC, switching methods of capacitive DAC is a developing filed. Several powerefficient switching methods have been proposed to reduce the power consumption such as splitcapacitor switching [27], $\mathrm{V}_{\text {сом-based }}$ switching [28], and monotonic capacitor switching [29]. However, the power consumption in high-speed and low-to-medium resolution ADC is only roughly $15 \%$ or less in the overall power consumption of SAR ADC. Therefore, we choose the switching method based on other considerations.

Monotonic switching method is used because it has an advantage in speed when compared with other switching methods. Fig. 3.17 shows the monotonic switching method in a 3-bit ADC. The operation is as follows. All bottom plates of the capacitors are connected to $\mathrm{V}_{\text {REF }}$ during the sampling
phase, and next they are switched to ground step by step during the each conversion phase. $\mathrm{V}_{\text {inp }}$ or $\mathrm{V}_{\text {inn }}$ in each phase decreases and $\mathrm{V}_{\mathrm{cm}}$ gradually decreases from $\frac{1}{2} \mathrm{~V}_{\text {ref }}$ to 0 . When a PMOS type dynamic comparator is adopted, its comparison time becomes faster due to the falling $\mathrm{V}_{\mathrm{cm}}$. However, this property also causes the problem of the dynamic offset. It needs to be solved and is discussed in Section 3.3.1.4.3 and Section 3.3.1.4.4.


Fig. 3.17 Monotonic switching method

### 3.3.1.4.3 Dynamic Offset

The offset voltage of one comparator is expressed by

$$
\begin{equation*}
V_{O S}=\Delta V_{T H}+\frac{\left(V_{G S}-V_{T H}\right)}{2}\left(\frac{\Delta S}{S}+\frac{\Delta R}{R}\right) \tag{3.21}
\end{equation*}
$$

where $\Delta \mathrm{V}_{\mathrm{TH}}$ and $\mathrm{V}_{\mathrm{GS}}$ are the threshold voltage and the gate-to-source voltage of the input differential pair; $\Delta_{\mathrm{S}}$ and $\Delta_{\mathrm{R}}$ are the differential pair and their load resistance mismatch.
$\Delta \mathrm{V}_{\text {TH }}$ is a static offset which does not affect the linearity because all digital codes are shifted by
the same offset. However, the second term is the dynamic offset for its variable value $\left(\mathrm{V}_{\mathrm{GS}}-\mathrm{V}_{\mathrm{TH}}\right)$ when the monotonic switching method is used.

As shown in Fig. 3.17, ( $\left.\mathrm{V}_{\mathrm{GS}}-\mathrm{V}_{\mathrm{TH}}\right)$ stands for the common-mode voltage and descends along with the switching procedure. The dynamic offset affect the linearity heavily. Fig. 3.17 shows a 6 -bit ADC with dynamic offset. It is assumed that $\mathrm{V}_{\mathrm{os}}(\mathrm{p}+1)=\frac{\mathrm{V}_{\mathrm{os}}(\mathrm{p})}{2}$, where p is the comparison cycle, starting from 1 to 5 and $\mathrm{V}_{\mathrm{os}}(1)$ is 5 LSB . Input range is 0.4 V . There are many missing code resulting in poor linearity as shown in Fig. 3.18.


Fig. 3.18 Dynamic Offset

### 3.3.1.4.4 Arbitrary Weight Capacitor Array (AWCA) Technique

As described in Section 3.3.1.4.3, AWCA technique [25] is adopted to solve dynamic offset.

AWCA is the concept of non-binary technique. We choose arbitrary radix which is less or equal to 2 between adjacent capacitors. The capacitor weight from MSB to LSB are [1584221] in this design. The following equations which represent successive-approximation algorithm in conventional binary-weight DAC and AWCA in 6-bit resolution.

$$
\begin{align*}
& V_{\text {IN }}-\left(b_{1} \frac{32}{64} V_{\text {REF }}+b_{2} \frac{16}{64} V_{\text {REF }}+b_{3} \frac{8}{64} V_{\text {REF }}+b_{4} \frac{4}{64} V_{\text {REF }}+b_{5} \frac{2}{64} V_{\text {REF }}+b_{6} \frac{1}{64} V_{\text {REF }}\right)=V_{Q}  \tag{3.22}\\
& V_{\text {IN }}-\left(b_{1} \frac{30}{64} V_{\text {REF }}+b_{2} \frac{16}{64} V_{\text {REF }}+b_{3} \frac{8}{64} V_{\text {REF }}+b_{4} \frac{4}{64} V_{\text {REF }}+b_{5} \frac{4}{64} V_{\text {REF }}+b_{6} \frac{2}{64} V_{\text {REF }}+b_{7} \frac{1}{64} V_{\text {REF }}\right)=V_{Q} \tag{3.23}
\end{align*}
$$

where $\mathrm{V}_{\mathrm{Q}}$ is quantization error.

As a result, $\mathrm{V}_{\mathrm{Q}}$ should match the following condition.

$$
\begin{equation*}
0 \leq V_{Q}<V_{L S B} \tag{3.24}
\end{equation*}
$$

where $V_{L S B}=\frac{1}{64} V_{\text {REF }}$.
In (3.22), if $\mathrm{b}_{1}$ is falsely determined by the comparator due to the dynamic offset, $\mathrm{V}_{\mathrm{Q}}$ cannot meet the condition in (3.24). Digital output code is thus incorrect. On the contrary, if $b_{1}$ is also wrongly determined in (3.23), following bits still make $\mathrm{V}_{\mathrm{Q}}$ follow (3.24). It means that different digital codes represent the same input voltage. Since $b_{2} \sim b_{7}$ represent $\frac{35}{64} V_{\text {REF }}$ in total, the error tolerance interval of $\mathrm{V}_{\text {IN }}$ at MSB cycle is

$$
\begin{equation*}
\frac{30}{64} V_{R E F} \leq V_{I N}<\frac{35}{64} V_{\text {REF }}+V_{Q}=\frac{36}{64} V_{R E F} \tag{3.25}
\end{equation*}
$$

In (3.25), it is the range of $\mathrm{V}_{\text {IN }}$ at MSB cycle that is covered up even though $b_{1}$, or MSB, is wrongly determined. Calculating error tolerance interval at other conversion cycles is similar if a wrong decision occurs in that cycle. Table 3-2 summarizes error tolerance of corresponding bit in this
design. If dynamic offset of the comparator is designed within $\pm 3 \mathrm{LSB}$, the conversion result is still right. Although the last two cycle do not have redundancy, the variation of $\mathrm{V}_{\mathrm{cm}}$ as well as offset is small. The problem of dynamic offset is insignificant at last conversion cycles.

Table 3-3 Error tolerance of corresponding bit

| Capacitor size | 15 C | 8 C | 4 C | 2 C | 2 C | 1 C |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Weight | 30 | 16 | 8 | 4 | 4 | 2 | 1 |
| Error tolerance | $\pm 3$ | $\pm 2$ | $\pm 2$ | $\pm 2$ | 0 | 0 |  |

### 3.3.1.4.5 Thermal Noise of AWCA

In this design, total capacitance of 3 fF or unit capacitance of 0.05 fF is adequate for a 6 -bit design if RMS noise voltage is less or equal to one-tenth of $\mathrm{V}_{\text {LSB }}\left(\sqrt{\frac{k T}{C}}<\frac{1}{10} V_{L S B}\right)$. However, such a small unit capacitance is hard to use in practice and cannot meet accuracy requirement of DAC due to process limitation. Therefore, unit capacitance of 1.25 fF is chosen to achieve AWCA. RMS noise voltage is really small to be neglected.

### 3.4 Summary

This thesis presents a 16 -channel time-interleaved SAR ADC. First, by combining the front-end input buffers and the grouping technique, the proposed ADC achieves the performance of high input bandwidth. Second, design consideration of the input bandwidth and the settling time are discussed. Next, mismatch including gain mismatch, offset mismatch, and timing skew among sun-ADCs are
considered. Finally, the SAR ADC is used as a sub-ADC for its great power efficiencies.

Asynchronous processing, monotonic capacitor switching, and AWCA technique aim at making sub-

ADC become faster and robust.

## Chapter 4

## Circuit Implementation and Simulation Results

### 4.1 Introduction

In this chapter, circuit implementation and simulation results will be described in detail. Fig. 4.1 presents the building blocks of the proposed time-interleaved SAR ADC. The architecture is composed of 16 -channel SAR ADCs, 4 groups of front-end buffers, a clock generator and a multiplexer (MUX) circuit.

First, the circuit blocks and their simulation are described, including front-end circuit, bias circuit, single-channel SAR ADC, clock generator, divider and multiplexer. Second, the overall ADC simulation result is shown. Finally, performance metrics of the proposed ADC are listed in table 4-1.

4.1 Building blocks of the proposed time-interleaved SAR ADC

### 4.2 Building Blocks and Circuit Implementation

### 4.2.1 Front-End Circuit

As descried in Section 3.3.1, the front-end circuit is regard as source followers (SF), T/H switches, and AWCA of the SAR ADC. Fig. 4.2 shows this front-end circuit. The source follower is biased by cascode PMOS current source in order to suppress the effect of channel-modulation. Consequently, the linearity of the SF is less affected by the variation of the output voltage. The transmission gate is used as a T/H switch for its high speed operation. Although its linearity is worse than the bootstrapped switch, the linearity is sufficient in this design.

In Fig.4.2 (b), the high-swing current mirror with two input branch, $\mathrm{V}_{\mathrm{b} 1}$ and $\mathrm{V}_{\mathrm{b} 2}$, is adopted as the biasing circuit of the source follower and $\mathrm{I}_{\text {REF }}$ is determined outside the chip and can be tuned by manual. The ratio of $\mathrm{I}_{\text {Bias }}$ and $\mathrm{I}_{\text {Ref }}$ is 5 .


Fig. 4.2 (a) Front-end circuit (b) biasing circuit of the source follower

As discussed in Section 3.4.1, the settling time and the required input bandwidth are calculated to meet the specification of ADC. Fig. 4.3 shows the $3-\mathrm{dB}$ frequency of the front-end circuit versus Ibias. It simulates the order of $\mathrm{I}_{\text {bias }}$ which is properly chosen. The Nyquist frequency is 4 GHz in this design. The suitable range of $3-\mathrm{dB}$ frequency is from 8 GHz to 10 GHz to ensure the input bandwidth two times higher than the Nyquist frequency. The enough input bandwidth avoids the problem of bandwidth mismatch [11].

However, the large-signal behavior of the front-end circuit fits the operation of the ADC in reality. More precisely, the dynamic linearity of the front-end circuit should be ensured to achieve the precision of this SAR ADC. Therefore, through FFT simulation, Fig. 4.4 presents the simulated SFDR versus $\mathrm{I}_{\text {BIAS }}$ when input $\mathrm{V}_{\text {pp,diff }}$ is 800 mv and $\mathrm{I}_{\text {BIAs }}$ is chosen to be around 0.2 mA . The input frequency is 4 GHz . Moreover, as shown in Fig. 4.5 and Fig 4.6, the input amplitude and common mode have to be analyzed to find the most proper performance. A common mode voltage around 200 mV and input around $800 \mathrm{~m} \mathrm{~V}_{\mathrm{pp} \text {,diff }}$ are used to satisfy the requirement of this work.

As mentioned in Section 2.4.1, offset mismatch is an important issue in time-interleaved architecture and SF contributes to offset mismatch. A number of 5000 Monte Carlo simulations is used to calculate offset voltage. The result shows that the standard deviation of offset is around 2 mv (0.08 LSB).


Fig. 4.3 Simulated 3-dB frequency versus $\mathrm{I}_{\text {REF }}$


Fig. 4.4 Simulated SFDR versus $\mathrm{I}_{\mathrm{BIAS}}$ with 3.78 GHz input frequency


Fig. 4.5 Simulated SFDR versus input common mode voltage with 3.78 GHz input frequency


Fig. 4.6 Simulated SFDR versus input amplitude with 3.78 GHz input frequency

### 4.2.2 Bias Circuit

In this design, because 8 groups of source followers (SF) circuit including differential circuits are implemented, the current mismatch among SFs should be reduced in case they exceed the offset and gain mismatch calibrated range. In layout, the distance among each group of source follower is inevitably long in order to maintain the input signal systematic matching. This layout style would cause considerable current mismatch if current mirror are built through the voltage routing of bias signals.

To overcome this problem, we use a combination of voltage and current routing technique. For example, Fig. 4.7 shows the two group of SFs with their current sources. $\mathrm{M}_{1}-\mathrm{M}_{3}, \mathrm{M}_{11}-\mathrm{M}_{12}$ and other current mirrors are close together to reduce the effect of interconnect resistance and mismatch of threshold voltage. The current $\mathrm{I}_{\text {REF1 }}-\mathrm{I}_{\text {REF4 }}$ are routed to the vicinity of the building blocks for their long length. On the other hand, the cascade current source with two input branches such as $\mathrm{M}_{4}-\mathrm{M}_{10}$
and $\mathrm{M}_{13}-\mathrm{M}_{19}$ uses voltage routing locally. Accordingly, current sources of source followers do not have large parasitic capacitance and maintain agreeable high-frequency process.


Fig. 4.7 Bias circuit with current routing and voltage routing

### 4.2.3 Single-Channel SAR ADC

In Section 4.2.3, the circuit block and simulation of the single-channel SAR ADC are described, including sampling switch, Arbitrary weight capacitor array (AWCA), comparator, SAR logic.

### 4.2.3.1 Sampling Switch

Transmission gates are chosen as sampling switches for their advantage of high-speed. The detailed simulation is described in Section 4.2.1. Dummy switches and cross-coupled path are used
to cancel the effect of the signal feedthrough as shown in Fig.4.8 [29]. The effect of channel charge injection and clock feedthrough do not excessively degrade linearity for the target 6-bit design. Fig.
4.9 shows the turn-on resistance $\left(\mathrm{R}_{\mathrm{on}}\right)$ of the sampling switch versus $\mathrm{V}_{\text {IN }}\left(\mathrm{V}_{\text {op }}, \mathrm{V}_{\text {on }}\right)$. The range of $\mathrm{V}_{\text {IN }}$ used keeps from 0.4 V to 0.6 V due to linearity of SF . $\mathrm{R}_{\text {on }}$ maintains around $100 \Omega$ to have enough input bandwidth. Therefore, the sampling switch maintain acceptable linearity at high input frequency.


Fig. 4.8 Sampling switch


Fig. 4.9 Turn-on resistance ( $\mathrm{R}_{\text {on }}$ ) of the sampling switch versus $\mathrm{V}_{\text {IN }}\left(\mathrm{V}_{\mathrm{op}}, \mathrm{V}_{\text {on }}\right)$

### 4.2.3.2 Arbitrary Weight Capacitor Array (AWCA)

As mentioned from Section 3.3.1.4.4 to Section 3.3.1.4.5, AWCA technique is adopted to deal with the problem of dynamic offset. Fig. 4.10 shows AWCA in this design. The conventional capacitor array with monotonic switching method is [16 84211$]$. For AWCA technique, $1_{u}$ from MSB capacitor is taken and added into LSB capacitor, where $\mathrm{C}_{\mathrm{u}}$ is unit capacitor. Moreover, additional 1 $\mathrm{C}_{\mathrm{u}}$ is added as a new LSB capacitor while original LSB capacitor become $2 \mathrm{C}_{\mathrm{u}}$. The capacitive DAC becomes [15 84221 1], and the last $\mathrm{C}_{\mathrm{u}}$ in the conventional capacitor array is unnecessary because its function is just likes parasitic capacitance. The max error tolerance of AWCA is about $3 \mathrm{~V}_{\text {LSB }}$ ( $\sim 37.5$ mV )

The switching operation of AWCA is described in Section 3.3.1.4.2. In particular, the NAND gates and signal RESET are used to reset all the bottom plates of the capacitors. If RESET coincides with next sampling conversion, the sampling signal is disturbed by the bottom-plate switching disturbance. Alternatively, signal RESET is triggered immediately when the last bit conversion is complete in this design. This manner improves that disturbance and increases the sampling speed.


Fig. 4.10 Arbitrary weight capacitor array

Fig. 4.11 depicts behavior simulation of ENOB versus dynamic offset when ADC uses binaryweighted capacitor array and AWCA. To maintain ENOB higher than 5.5-bit, dynamic offset $\mathrm{V}_{\text {OS }}$ must be confined to within $25 \mathrm{mV}\left(2 \mathrm{~V}_{\mathrm{LSB}}\right)$ in the architecture of binary capacitor arrays. In contrast, AWCA technique broadens the tolerable $\mathrm{V}_{\text {OS }}$ range which becomes around 65 mV ( $5 \mathrm{~V}_{\mathrm{LSB}}$ ).


Fig. 4.11 ENOB versus offset using binary-weighted capacitor array and AWCA

### 4.2.3.3 Comparator

For ADC, the comparator is classified into two types: It has a static preamplifier or not. In general, the comparator with static preamplifier is often applied in high resolution due to effect of noise. On the other hand, the dynamic comparator is usually used in high-speed low resolution ADC for its high speed and low power. The design consideration of the comparator are speed, offset, and noise in high-speed SAR ADC.

Fig. 4.12 shows the schematic of the comparator in this work. It uses PMOS as input differential. Only NMOS type latch is adopted to decrease the number of transistors in low supply voltage. The operation is described as follows. When LA signal is high, the comparator is in the reset mode, and SN and SP are also reset to ground. When the comparator enters into comparison, LA signal goes to
low and SN and SP are in positive feedback loop to pull themselves to high and ground.

The key parameter in latch-type comparator designs is the regeneration time-constant [30]

$$
\begin{equation*}
\tau=\frac{C_{L}}{g_{m}} \tag{4.1}
\end{equation*}
$$

where $\mathrm{C}_{\mathrm{L}}$ is the total capacitance at the latch output and $\mathrm{g}_{\mathrm{m}}$ is the transconductance of the latch transistors. By enlarging $\frac{\mathrm{W}}{\mathrm{L}}$ ration of input transistors, $\tau$ becomes smaller and the comparator can increase speed to meet the requirement. As single-channel ADC operate at $500 \mathrm{MS} / \mathrm{s}$ and duty cycle of clock is $25 \%$, the total time that the operation of comparator do seven times of comparison and reset is 1500 ps . Average comparison time is around 100 ps .

As mentioned in Section 2.4.1, offset mismatch is an important issue in time-interleaved architecture and comparators contribute to offset mismatch. A number of 5000 Monte Carlo simulations is used to calculate offset voltage. The result shows that the standard deviation of offset is around $0.7 \mathrm{~V}_{\mathrm{LSB}}$ (about 9 mv ).

Finally, the kickback noise is ignored because the input signal ruined by regeneration mode is returned to original one due to the following reset mode. On the other hand, through a number of 5000 Monte Carlo noise analysis, the standard deviation of transient noise is less than $\frac{1}{100} \mathrm{~V}_{\text {LSB }}$ (about 0.1 mv ).


Fig. 4.12 Schematic of the dynamic comparator

### 4.2.3.4 SAR Logic

The asynchronous processing is adopted to increase conversion rate as mentioned in Section 3.3.1.4.1. Fig. 4.13 (a) shows the schematic of the asynchronous control loop and shift registers. The operation is explained as shown in Fig. 4.13 (b). When sampling phases is converted to conversion, signal Start rises to high and triggers this control loop. First, signal LA becomes low to initiate the comparator. Then, one of the comparator output Outpi or Outn ${ }_{i}$ would fall to low due to the comparison of the comparator, and signal Done rises to high from low. Next, LA goes high to reset the comparator. As the comparator has been reset, Done begins to go from high to low. Finally, LA becomes low again, initiating the following comparison. This process repeats until signal Stop is triggered from low to high. The period of signal LA can be calculated as

$$
\begin{equation*}
T_{\text {cycle }}=T_{\text {cmp }}+T_{\text {reset }}+2 \times T_{\text {gate }} \tag{4.2}
\end{equation*}
$$

where $\mathrm{T}_{\mathrm{cmp}}$ is the comparison time of the comparator, $\mathrm{T}_{\text {reset }}$ is the reset time of the comparator, and $\mathrm{T}_{\text {gate }}$ is the gate delay of logic circuit in this loop. The gate delay includes two inverters, one XNOR gate, one NOR gate, and one NAND gate.

At the same time, the shift register generates $\mathrm{SR}<1>$ to $\mathrm{SR}<7>$ to control AWCA switching and counts the number of comparisons. When ADC enters the next sampling, Start reset all shift registers to initial state.


(a)

(b)

Fig. 4.13 (a) Block diagrams and (b) waveforms of asynchronous SAR logic

The conventional DFF control logic is used to connect comparator and the capacitive DAC, as depicted in Fig. 4.14 (a). However, it needs much transistors due to differential structure.

On the contrary, in Fig. 4.14 (b), the differential control logic (DCL) replaces DFFs when the
input signal is digital code. Initially, LA is low, resetting Q and Qb to ground while D and Db are low. Then, $\mathrm{SR}\langle 1: 7\rangle$ are fed into LA of the corresponding DCL to trigger it. When D or Db becomes high, the regeneration latch reacts quickly to pull Q or Qb to high. Once Q and Qb are determined, there are two positive feedback loops to lock the DCL. Hence, the DCL output remains same no matter how D and Db vary due to the comparison of the comparator.



(a)

(b)

Fig. 4.14 (a) Conventional DFF control logic (b) DCL

### 4.2.3 Clock Generator

The clock generator creates clocks which determine the sampling and conversion phases of a SAR ADC. A duty-cycle $25 \%$ clock is used and longer conversion time provides high conversion rate
for asynchronous SAR SDC.

Fig. 4.15 shows the clock generator and corresponding waveform. The external signal CLK $_{\text {in }}$ is fed into the $\div 4$ divider of the first stage to generate clock phases $\Phi_{\mathrm{Gl}} \sim \Phi_{\mathrm{G} 4}$ and each phase generates 4 clock phases through the $\div 4$ divider of the second stage. Finally, totally 16 different clock phases $\Phi_{1} \sim \Phi_{16}$ would be generated and are distributed to each sub-ADC. For example, during the reset mode, $\Phi_{\mathrm{G} 1}$ is preloaded with high, while the other phases $\Phi_{\mathrm{G} 2} \sim \Phi_{\mathrm{G} 4}$ are low. In this way, during the normal operation mode, $25 \%$ duty clock phases $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$ are generated. Likewise, non-overlapping clock phases $\Phi_{1} \sim \Phi_{16}$ are generated to achieve proposed front-end sampling architecture and only one external clock signal $\mathrm{CLK}_{\mathrm{in}}$ is needed.

(a)

(b)

Fig. 4.15 (a) Block diagrams and (b) waveforms of clock generator

As shown in Fig. 4.16, a $\div 4$ divider consists of a ring of 4 flip-flops, one of which is preset to high, while the others are preset to low. The flip-flop adopts true single-phase clocking (TSPC) architecture for its high speed and low power dissipation. Fig. 4.17 shows a schematic of TSPC flipflop. One type of schematic is preset to high, while the other is preset to low. Dummy transistors $\mathrm{M}_{4} \sim \mathrm{M}_{5}, \mathrm{M}_{9} \sim \mathrm{M}_{10}, \mathrm{M}_{15} \sim \mathrm{M}_{16}$ are added to make two types of TSPC symmetrical. Consequently, parasitic capacitance of two type of TSPCs are nearly equal and timing skew is reduced. Moreover, TSPC have a floating net on the drain of transistor $M_{2}$ and this net has leakage current if $M_{2}$ and $M_{3}$
cannot entirely cut off. Hence, $\mathrm{M}_{2}$ and $\mathrm{M}_{2}$ are types of high voltage threshold (hvt) PMOS transistors to decrease the leakage current.

The circuit operates as follows. When CLK is low, the first stage operates as an inverter, an opposite signal of D is at A and the signal at B is high. When CLK goes high, the first stage turn off and the second and third stage become active. The signal is opposite of A at B . The signal Qb is opposite of B . Accordingly, Qb is equal to A and Q is equivalent to D .


Fig. 4.16 Diagram of $\div 4$ divider

(a)

(b)

Fig. 4.17 Schematic of TSPC flip-flop preloaded (a) with high (b) with low

### 4.2.4 Delay Line

Since different clock phases would cause timing skew, delay line is used to adjust clock phases and correct timing skew. A tolerate timing skew can be estimated as in (4.3) and we know how to design delay line [12]

$$
\begin{equation*}
\sigma_{t}^{2} \leq\left(\frac{N}{N-1}\right) \cdot\left(\frac{2}{3 \cdot 2^{2 m}}\right) \cdot\left(\frac{1}{\left(2 \pi f_{i n}\right)^{2}}\right) \tag{4.3}
\end{equation*}
$$

where $\sigma_{t}$ is the standard deviation of the timing skew among sub-ADCs; N is the number of subADCs; $m$ is the resolution of $A D C$; and $f_{i n}$ is the sinusoidal input frequency. In this design, $N=16, m$ $=6, \mathrm{f}_{\mathrm{in}}=4 \mathrm{GHz}$ are substituted into (4.3) and $\sigma_{t}$ should be less than 0.6 ps .

Delay line is shown in Fig. 4.18. It consists inverters and delay cells. Delay cells have coarse and fine delay control to provide different step size. Delay cells are MOS-capacitors and we change their capacitive load by control CTRL<1:7>. Then, different capacitive load makes clock phases have
different delay. The total correction range is based on the C-CC post-layout simulation which shows that the standard deviation of the timing skew is around 1.6 ps . Thus, the total range of delay line is 1.8 ps . The coarse delay and fine delay are designed to have 0.6 ps and 0.12 ps with a step size. The residual timing skew after correction is less than 0.6 ps .


Fig. 4.18 Schematic of delay line

### 4.2.5 Clock Divider and Multiplexer

The sampling frequency is extremely high in this design. A mix signal oscilloscope MSO 4034 in test setup cannot operate as high as 5 GHz sampling rate. The frequency of outputs' clock needs to be decimated.

In this design, the output data is decimated by 125 , which is relatively prime to the number of channels, 16. Fig. 4.19 shows the circuit of the clock divider, three $\div 5$ dividers are in series connection to generate divisor of 125 . Then, this clock signal is fed to D flip-flop to trigger digital output in each
single-channel ADC. Next, digital output are combined by Multiplexer (MUX) to compose one group of output data of time-interleaved ADC. The diagram of MUX is presented in Fig. 4.20. It is a 16-to1 MUX which consists of simple transmission gates and inverters. The selection signals of MUX, $\operatorname{Div}_{1} \sim \operatorname{Div}_{2}$ are produced a ring of 16 flip-flops. The sequence of the selection signal in MUX is related to the number of decimation, 125 . In this work, the sequence is $\left\{\right.$ channel $_{1}$, channel ${ }_{14}$, channel ${ }_{11}$, channel $_{8}$, channel $_{5}$, channel $_{2}$, channel $_{15}$, channel $_{12}$, channel $_{9}$, channel $_{6}$, channel $_{3}$, channel $_{16}$, channel $_{13}$, channel $_{10}$, channel $_{7}$, channel $\left._{4}\right\}$. Fig. 4.21 shows the timing diagram of the clock divider. For example, if the 5 GHz output data OUTPUT through this clock divider, the output signal is Final Data with 40 MHz . As a result, MSO 4034 successfully retrieves 40 MHz digital output.


Fig. 4.19 Circuit of the clock divider


Fig. 4.20 Diagram of Multiplexer


Fig. 4.21 Timing diagram of clock divider

### 4.3 Overall ADC Simulation Results

The transistor level with C-CC post-layout simulation is done by HSPICE. The dynamic performance of this ADC is characterized by FFT analysis. Fig. 4.22 shows 256 -point digital output
spectrum with sampling rate of 8GS/s and input frequency of 93.75 MHz . The SFDR and SNDR are 48.39 dB and 37.48 dB respectively. For Nyquist input frequency of $3.78 \mathrm{GHz}, 256$-point digital output spectrums are shown in Fig. 4.23 and Fig. 4.24. Fig. 4.23 shows digital output spectrums without timing skew calibration, and the SFDR and SNDR are 48.39 dB and 37.48 dB . The performance degrades because the power of noise spurs caused by timing skew increases. The worst spur is around 35 dB . After timing skew calibration, the SFDR and SNDR are 47.15 dB and 36.13 dB , as shown in Fig. 4.24. The noise spurs of timing skew are below 45 dB .

FFT PLOT


Fig. 4.22 FFT spectrum with low input frequency

FFT PLOT


Fig. 4.23 FFT spectrum with Nyquist rate input frequency before timing skew calibration


Fig. 4.24 FFT spectrum with Nyquist rate input frequency after timing skew calibration

### 4.4 Summary

This chapter provides the detailed content of circuit implementation and simulation. And the performance metrics of this work in C-CC post-layout simulation are listed in Table 4-1.

Table 4-1 Performance summary of post-layout simulation for the proposed ADC

|  | This Work |
| :---: | :---: |
|  | Post-simulation |
| Technology | TSMC 40nm CMOS GP |
| Voltage | 1.2 V |
| Resolution | 6 -bit |
| Area | $780 \times 250 ~ \mu \mathrm{~m}^{2}$ |
| Input Swing | 0.8 V pp |
| Sampling Rate | $8 \mathrm{GS} / \mathrm{s}$ |
| Input Frequency | $3.78 \mathrm{GHz}(\mathrm{Nyquist} \mathrm{rate)}$ |
| Power | $39.92 \mathrm{~mW}($ Exclude the clock buffer $)$ |
| SFDR | 47.15 dB |
| SNDR | 36.13 dB |
| ENOB | 5.71 -bit |
| Figure of Merit | $95.33 \mathrm{fJ} /$ conversion-step |

## Chapter 5

## Measurement Results

### 5.1 Introduction

In this section, several topics related to the measurement of this work are discussed. First, floor plan and layout are described. Second, the design of print circuit board (PCB) is demonstrated. Third, test setup is introduced. Fourth, we show and discuss measurement results. Finally, summary table is presented.

### 5.2 Floor Plan and Layout Design

The floor plan of time-interleaved SAR ADC is shown in Fig.5.1, and the floor plan of signalchannel is in Fig.5.2. The path of analog input signal is from top of the chip and it is tree structure for equal delay. On the other hand, the clock signal goes from the bottom of the chip to separate the disturbance from the input signal, and it is also the same layout style as the input signal. The separation of digital and analog power supply is to avoid transient noise generated by digital signals. Besides, the power supply of clock generator and out buffers are segregated to prevent the large transient current. Each supply voltage has their own decoupling capacitors to stabilize supply voltage.


Fig. 5.1 Floor plan of proposed time-interleaved SAR ADC


Fig. 5.2 Floor plan of single-channel SAR ADC

The layout of ADC is shown in Fig. 5.3. It is implemented in TSMC 40nm CMOS generalprocess (GP). Table 5-1 describes functions of the 55 pins.


Fig. 5.3 Layout of proposed time-interleaved SAR ADC

Table 5-1 Functions of 55 pins in proposed ADC

| Name | Description |
| :---: | :---: |
| VINP | Positive input signal |
| VINN | Negative input signal |
| V REFP | Positive reference voltage |
| V REFN | Negative reference voltage |
| VDD_A | Analog supply |
| VSS_A | Analog ground |
| I $_{\text {BIAS }}$ | Input current for SF's biasing circuit |
| VDD_SF | SF supply |
| VSS_SF | SF ground |
| VDD_D | Digital supply |
| VSS_D | Digital ground |
| VDD_DUTY | Other digital CKT supply |
| VSS_DUTY | Other digital CKT ground |
| VDD_POST | POST CKT supply |


| VSS_POST | POST CKT ground |
| :---: | :---: |
| VDD_CLK | CLK GEN supply |
| VSS_CLK | CLK GEN ground |
| VDD_PAD_A | Analog pad supply |
| VSS_PAD_A | Analog pad ground |
| VDD_PAD_D | Digital pad supply |
| $\mathrm{V}_{\text {ss_pad_d }}$ | Digital pad ground |
| $\mathrm{CLK}_{\text {in }}$ | Clock signal |
| $\mathrm{CLK}_{\text {div }}$ | Clock for output sampling |
| $\mathrm{CLK}_{\text {div1 }}$ | Clock of $\mathrm{CH}_{1}$ for output sampling |
| Bit<7:1> | Digital output bit |
| $\mathrm{P}\langle 1: 3>$ _de | Decoder signal |
| <A:C>_de | Decoder signal |
| RN_de | Decoder signal |
| CLK_de | Decoder signal |
| CLK_mux | Timing skew control signal |
| CLK_ctr | Timing skew control signal |
| vctr | Timing skew control signal |
| V ${ }_{\text {DD_RAMP }}$ | Timing skew control supply |
| VSS_RAMP | Timing skew control ground |
| IBIAS_Ramp | Timing skew control supply |

### 5.3 PCB Design

The goal of PCB design is to provide a platform which offers good test environment for ADC.

Power and ground noises are generated by current which flows through power and ground lines which have equivalent series resistance (ESR) and equivalent series inductance (ESL). This noise will degrade performance. Hence, careful design of the AC current feedback path is very important. Fourlayer PCB is employed to provide a power and a ground plate. Both plates have large area, so the low
resistance and inductance are achieved to lower power and ground noise. Every set of power and ground is decoupled by some $10 \mathrm{uF}, 0.1 \mathrm{uF}$, and 0.01 uF capacitors. Three different capacitor values are used to filter out noise in different frequency band.

Fig. 5.4 shows the schematic of an analog input on PCB. Two types of configuration of RF transformer are used for different requirement of input bandwidth. A single-ended analog input signal is converted to differential one by a RF transformer. Table 5-2 shows the specification of RF transformers which are used in our measurement [38]-[40]. However, different types of RF transformers impact differently on ADC. This issue is discussed in Section 5.6.

Table 5-2 Specification of RF transformers

| RF transformer | Frequency Range (MHz) |
| :--- | :---: |
| ADT4-1WT | $2-775$ |
| JTX-4-10T | $50-1000$ |
| TCM4-14+ | $200-1400$ |
| TCM4-452X+ | $20-4500$ |



Fig. 5.4 Schematic of analog input on PCB

Fig. 5.5 shows the schematic of reference voltages on PCB. A master reference voltage is produced by a Zener diode (LM385) to provide stable 2.5 V . The master reference voltage is connected to four variable resistors and generates four reference voltages including $\mathrm{V}_{\text {refp, }} \mathrm{V}_{\text {refn }}$, $\mathrm{V}_{\text {IN_Com }}, \mathrm{V}_{\mathrm{DD} \_ \text {SF }}$, and $\mathrm{V}_{\mathrm{DD} \_}$. The five reference voltages are buffered by OPAMP (AD8031) which is utilized as a unit gain buffer. Three capacitors, which values are $10 \mathrm{uF}, 0.1 \mathrm{uF}$ and 10 nF , are also used to decouple the reference voltages. The master reference voltage also generates current reference I ${ }_{\text {BIAS }}$ by connecting additional variable resistor between output terminal of AD8031 and input terminal of the chip.


Fig. 5.5 Schemtaic of reference voltage on PCB

Fig. 5.6 shows the schematic of power supplies on PCB. The power supplies VDD_D $^{\text {, }}$ VDD_POST, $V_{\text {DD_DUTY }}, V_{D D \_C L K, ~} V_{\text {DD_PAD_A }}$ and $V_{\text {DD_PAD_D }}$ are generated by voltage regulators (LT3020). Three capacitors, which values are $10 \mathrm{uF}, 0.1 \mathrm{uF}$ and 10 nF , are also used to decouple the power supply as well.


Fig. 5.6 Schematic of power supplies on PCB

The ground and signal of PCB is configured as Fig. 5.7. The mother board is used to provide the master reference voltage and power supply generation. VSS_A, VSS_D, VSS_PAD_A, VSS_PAD_D, VSs_POST, Vss_duty, and Vss_cle are not connected to each other in the chip, but on PCB by the ferrite beads and copper lines to achieve better performance. On the other hand, the daughter board is mainly used to connect analog input signal and clock signal. In this way, the daughter board can separate noise of power supply from mother board. It also decrease bond-wire effect from solder and thus the highspeed signal can be imported to the chip. Fig. 5.8 shows the photo of whole PCB with time-interleaved

ADC chip and all the components used on the PCB platform are listed in Table 5-3.


Fig. 5.7 Ground and signal of configuration on PCB


Fig. 5.8 Photo of whole PCB with time-interleaved ADC chip

Table 5-3 List of materials

| Item | Description | Manufacturer |
| :---: | :---: | :---: |
| LT3020 | Low Dropout Linear <br> Regulator | Linear Technology |
| AD8031 | Operational Amplifier | Analog Device |
| LM385 | Adjustable Micro-power <br> Voltage Reference | National <br> Semiconductor |
| ADT4-1WT / JTX-4-10T/ <br> TCM4-14+ / TCM4-452X+ | Transformer | Mini-Circuits |
| $10 \mu \mathrm{~F} / 0.1 \mu \mathrm{~F} / 10 \mathrm{nF}$ | Tantalum Capacitor |  |
| $1 \mathrm{k} \Omega / 5 \mathrm{k} \Omega / 10 \mathrm{k} \Omega$ | Variable Resistor |  |
| Ferrite Bead |  |  |
| Jumper |  |  |
| SMA |  |  |
| switch |  |  |

### 5.4 Test Setup

Test setup is shown in Fig. 5.9. Two signal generators (Agilent E8257D and SMA100) are applied to generate a clock signal and an analog input signal. Both signals are filtered by passive band-pass-filter (BPF) and separately connected by SMA cables. To convert the single-ended analog input signal to differential one, a RF transformer is chosen. The power supplies of design-under-test (DUT) are generated by voltage regulators (LT3020). A 7-bit digital output is captured by a mix signal oscilloscope (MSO4034). The digital output is downloaded to a personal computer (PC) and processed by MATLAB to obtain static and dynamic performance. The equipment photos of Agilent E8257D, SMA100, and PPT-3615 are shown in Fig. 5.10.


Fig. 5.9 Test setup

(a)

(b)

(c)

Fig. 5.10 Photos of (a) E8257D, (b) SMA100, and (c) PPT-3615

### 5.5 Measurement Results

Fig. 5.11 shows the die phot of the ADC fabricated in TSMC 40 nm CMOS 1P10M process. The chip area is $1500 * 850 \mu \mathrm{~m}^{2}=1.275 \mathrm{~mm}^{2}$. The core circuit occupies an area of $780 * 250 \mu \mathrm{~m}^{2}=0.195$ $\mathrm{mm}^{2}$.


Fig. 5.11 Die photo of the time-interleaved ADC

### 5.5.1 Static Performance

Static performance of ADC is measured at 1.2 V supply voltage while sampling rate is $4 \mathrm{GS} / \mathrm{s}$ and input frequency is 50.78 MHz . Fig. 5.12 shows DNL and INL of the single-channel ADC. The DNL and INL are $+0.15 /-0.13$ LSB and $+0.14 /-0.11$ LSB. On the other hand, DNL and INL of the time-interleaved

ADC are shown in Fig. 5.13. The DNL and INL of the time-interleaved ADC are $+0.17 /-0.29$ LSB and $+0.20 /-0.18$ LSB.



Fig. 5.12 DNL and INL of the single-channel ADC


Fig. 5.13 DNL and INL of the time-interleaved ADC

### 5.5.2 Dynamic Performance

Fig. 5.14 shows the 1024 -point FFT spectrums with sampling rate, $\mathrm{F}_{\mathrm{s}}$ of $4 \mathrm{GS} / \mathrm{s}$ and input frequency, $F_{\text {in }}$ of 50.78 MHz . The SFDR and SNDR are 51.33 dB and 35.33 dB respectively. The maximal input frequency is limited by types of RF transformers. A RF transformer (JTX-4-10T+) is used to measure dynamic performance of ADC. Its input frequency range is from 50 MHz to 1 GHz [38]. On the other hand, RF transformers (TCM4-14+) and (TCM4-452X+) whose input frequency range are higher than 1 GHz [39], [40]. It is discussed in Section 5.6 because RF transformers (TCM4$14+$ ) and (TCM4-452X+) cause harmonic distortion on the ADC.

With $F_{\text {in }}$ near 1 GHz and $\mathrm{F}_{\mathrm{s}}$ of $4.5 \mathrm{GS} / \mathrm{s}$, the 1024-point FFT spectrums is shown in Fig. 5.15. Before offset and gain calibrations, SFDR and SNDR are limited to 29.37 dB and 24.50 dB . Tones caused by offset mismatch are around -30 dB to -40 dB and degrade performance. Offset and gain mismatches are corrected by the method in Section 3.3.1.2. After offset and gain calibrations, the SFDR and SNDR are improved to 41.04 dB and 32.15 dB . The output of ADC is decimated by 125 .


Fig. 5.14 FFT spectrum for $\mathrm{F}_{\text {in }}=50.78 \mathrm{MHz}$ (decimated by a factor of 125)

(a)

(b)

Fig. 5.15 FFT spectrum for $\mathrm{F}_{\text {in }}=998 \mathrm{MHz}$ (a) before, and (b) after offset and gain mismatch calibration (decimated by a factor of 125)

Fig. 5.16 illustrates the dynamic performance versus input frequency when $\mathrm{F}_{\mathrm{s}}$ is $4.5 \mathrm{GS} / \mathrm{s}$. SNDR stays above 32 dB up to $\mathrm{F}_{\text {in }}$ of 1 GHz . The definition of THD in Chapter 5 is a reciprocal of (2.4) to compare with SNDR and SFDR. Both SFDR and THD stay above 40 dB up to $\mathrm{F}_{\text {in }}$ of 1 GHz . SNDR and SFDR versus sampling frequency with $\mathrm{F}_{\text {in }}$ of 50 MHz is shown in Fig. 5.17. The SNDR maintains above 35 dB up to $\mathrm{F}_{\mathrm{s}}$ of 5 GHz with $\mathrm{F}_{\text {in }}$ of 50 MHz .


Fig. 5.16 SNDR, SFDR and THD versus input frequency at $\mathrm{F}_{\mathrm{s}}=4.5 \mathrm{GS} / \mathrm{s}$


Fig. 5.17 SNDR and SFDR versus sampling frequency with $\mathrm{F}_{\text {in }}$ of 50 MHz

### 5.6 Discussion

In this section, first, we discuss the effect on the dynamic performance of $A D C$ with different RF transformers. Second, analysis of timing skew for simulation and measurement is discussed.

### 5.6.1 Dynamic performance with different RF transformers

Fig. 5.18 and Fig. 5.19 are the measurement when a RF transformer (TCM4-14+) is used. The input frequency range of a RF transformer (TCM4-14+) is from 200 MHz to 1400 MHz [39].

Fig. 5.18 shows the 1024 -point FFT spectrums with sampling rate, $\mathrm{F}_{\mathrm{s}}$ of $4 \mathrm{GS} / \mathrm{s}$ and input frequency, $\mathrm{F}_{\text {in }}$ of 1 GHz . SFDR and SNDR are 37.18 dB and 29.98 dB . THD is -36.9 dB . As described in Section 5.5.2, a RF transformer (JTX-4-10T+) whose input frequency range is from 50 MHz to 1 GHz [38]. When FFT is compared with that using a RF transformer (JTX-4-10T) at $\mathrm{F}_{\text {in }}$ of 1 GHz in Fig. 5.15 (b), the $2^{\text {nd }}$ harmonic tone increases 18 dB and becomes 37.18 dB , while THD becomes worse from 44.26 dB to 36.9 dB . It means that RF transformers (TCM4-14+) causes more harmonic distortion power and performance of ADC is limited. Fig. 5.19 shows SNDR, SFDR and THD when input frequency range is from 1 to 1.4 GHz . Although harmonic distortion degrades the dynamic performance, SNDR still maintains above 27 dB up to $\mathrm{F}_{\text {in }}$ of 1.4 GHz .


Fig. 5.18 FFT spectrum for $\mathrm{F}_{\mathrm{in}}=1 \mathrm{GHz}$ with RF transformers (TCM4-14+)


Fig. 5.19 SNDR, SFDR and THD versus input frequency range from 1 to 1.4 GHz

The dynamic performance of ADC is presented in Fig. 5.20 when another RF transformer (TCM4-452X+) is used. The frequency range of a RF transformer (TCM4-452X+) is from 20 MHz to 4500 MHz [40].

As shown in Fig. 5.20 (a), in the time-interleaved ADC, when $F_{\text {in }}$ is 707 MHz at $\mathrm{F}_{\mathrm{s}}$ being $4 \mathrm{GS} / \mathrm{s}$, the $2^{\text {nd }}$ harmonic tone is -33.58 db and limits the SNDR to 29.26 dB and SFDR to 33.58 dB . In addition, THD is 33 dB , being the worst performance in three RF transformers. Fig. 5.20 (b) shows the FFT of single-channel at $F_{s}$ being $4 \mathrm{GS} / \mathrm{s}$ and $\mathrm{F}_{\text {in }}$ being 707 MHz . The $2^{\text {nd }}$ harmonic tone is -31.63 dB and also limits the SNDR to 28.91 dB and SFDR to 31.63 dB . When $\mathrm{F}_{\text {in }}$ is above 700 MHz , performance is even worse and SNDR is below 20 dB . Thus, $\mathrm{F}_{\text {in }}$ of 1 GHz is not discussed here.

Finally, we also use a double transformer providing differential input signal [37] to test whether harmonic tone can be improved. However, the result is nearly the same as the single transformer. A broadband power splitter PSPL5320B BALUN [41] provides the same function as a RF transformer. It may have lower harmonic distortion than RF transformers.

Therefore, we choose the measurement of ADC using a RF transformer (JTX-4-10T) under $\mathrm{F}_{\text {in }}$ of 1 GHz as a summary.


Fig. 5.20 FFT spectrum at $\mathrm{F}_{\text {in }}=707 \mathrm{MHz}$ (a) time-interleaved result (b) single-channel

### 5.6.2 Analysis of timing skew for simulation and measurement

Timing skew for simulation and measurement is discussed as follows. First, we analyze how to obtain standard deviation of equal timing skew from FFT spectrum in measurement. There are two formulas of SNR $_{\text {skew }}$ we use. One formula is from the definition of SNR in (2.3) plus the timing skew power. It is expressed as:

$$
\begin{equation*}
\mathrm{SNR}_{\text {skew }(1)}=\frac{\mathrm{P}_{\mathrm{s}}}{\mathrm{P}_{\mathrm{q}}+\mathrm{P}_{\text {skew }}} \tag{5.1}
\end{equation*}
$$

where $P_{s}$ is signal power, $P_{q}$ is quantization noise power, and $P_{\text {skew }}$ is timing skew power. $P_{s}$ is $2^{2 \mathrm{~m}-3} \mathrm{LSB}^{2}$ and $\mathrm{P}_{\mathrm{q}}$ is $\frac{1}{12} \mathrm{LSB}^{2}$. As described in [8], $\mathrm{P}_{\text {skew }}$ is $\frac{\left(2^{m-1} \mathrm{LSB} \times 2 \pi \mathrm{f}_{\text {in }} \sigma_{\mathrm{t}}\right)^{2}}{2}$. This, (5.1) is written in another expression.

$$
\begin{equation*}
\mathrm{SNR}_{\text {skew }(1)}=\frac{2^{2 \mathrm{~m}-3}}{\frac{1}{12}+2^{2 \mathrm{~m}-1} \times \pi^{2} \mathrm{f}_{\mathrm{in}}{ }^{2} \sigma_{\mathrm{t}}{ }^{2}} \tag{5.2}
\end{equation*}
$$

where $m$ is the resolution, $f_{\text {in }}$ is input frequency, and $\sigma_{t}$ is the standard deviation of timing skew.

According to [22], the other formula is expressed as

$$
\begin{equation*}
\mathrm{SNR}_{- \text {skew(2) }}=\frac{\mathrm{P}_{\mathrm{s}}}{\mathrm{P}_{\mathrm{q}}+2^{2 \mathrm{~m}-3} \times\left(\frac{9 \pi^{2} \sigma_{\mathrm{t}}{ }^{2} \mathrm{f}_{\mathrm{in}}{ }^{2}}{4}\right)}=\frac{2^{2 \mathrm{~m}-3}}{\frac{1}{12}+2^{2 \mathrm{~m}-3} \times\left(\frac{9 \pi^{2} \sigma_{\mathrm{t}}{ }^{2} \mathrm{f}_{\mathrm{in}}{ }^{2}}{4}\right)} \tag{5.3}
\end{equation*}
$$

From FFT spectrum of the measurement, we get values represents $\mathrm{P}_{\mathrm{s},} \mathrm{P}_{\mathrm{q}}, \mathrm{P}_{\text {skew }}$ and these values are proportional to $\mathrm{SNR}_{\text {skew }}$. If we find the ratio between formula and the measurement, $\sigma_{t}$ is derived. For example, $P_{s}=2.22 \times 10^{8}, \quad P_{q}=80909$ and $P_{\text {skew }}=47394$ are from the FFT of measurement when $F_{i n}$ is 1 GHz . We obtain the following relationship between (5.2) and the measurement:

$$
\begin{equation*}
\frac{2^{2 \mathrm{~m}-3}}{\frac{1}{12}+2^{2 \mathrm{~m}-1} \times \pi^{2} \mathrm{f}_{\mathrm{in}}{ }^{2} \sigma_{\mathrm{t}}{ }^{2}}=\frac{512}{\frac{1}{12}+11369 \times 10^{18} \sigma_{t}^{2}}=\frac{2.22 \times 10^{8}}{80909+47394} \tag{5.4}
\end{equation*}
$$

where $\mathrm{m}=6$ and $\mathrm{f}_{\mathrm{in}}=10^{9}$. Note that $\frac{1}{12}$ is the ideal quantization noise power of 6 -bit resolution.
However, $\mathrm{P}_{\mathrm{q}}$ from the measurement is the quantization noise power adding other noise power. Thus, we take it into consideration and $\mathrm{P}_{\mathrm{q}}$ is equal to $0.21(\mathrm{ENOB}=5.4 \mathrm{~b})$ from the measurement with low input frequency. Ideal $\mathrm{P}_{\mathrm{q}}, \frac{1}{12}$ is replaced by 0.21 and (5.4) becomes

$$
\begin{equation*}
\frac{512}{0.21+11369 \times 10^{18} \sigma_{t}^{2}}=\frac{2.22 \times 10^{8}}{80909+47394} \tag{5.5}
\end{equation*}
$$

Therefore, $\sigma_{\mathrm{t}}$ is obtained from $\mathrm{SNR}_{\text {skew(1) }}$. Likewise, $\mathrm{SNR}_{\text {skew(2) }}$ is used to calculate $\sigma_{\mathrm{t}}$.

We substitute $\operatorname{SNR}_{\text {skew(1) }}$ and $\mathrm{SNR}_{\text {_skew(2) }}$ into (5.5), and the range of $\sigma_{\mathrm{t}}$ in measurement is from 2.5 to 3.2 ps. However, the range of delay line in our design is only +1.8 ps as described in Section 4.2.4. Moreover, $\sigma_{t}$ is calculated from timing skew in each channel in reality and the value of timing skew will be native and positive. The range of delay line cannot cover timing skew. Therefore, with $\mathrm{F}_{\text {in }}$ of 1 GHz , adjustment of delay line does not improve the SNDR which is limited by timing skew.

Second, post-layout simulation is checked to find the relationship between simulation and measurement. As described in Section 4.2.4, the C+CC post-layout simulation is calculated. Table 54 shows C+CC post-layout simulation of timing skew from adjacent channels. There are 16 values of timing skew and $\sigma_{\mathrm{t}}$ is 1.6 ps from these values. As shown in Fig. 4.13 (a), two stage of a clock generator have four $\div 4$ dividers. Values of timing skew in each $\div 4$ divider have nearly the same trend
and we can divide these values of timing skew into 4 groups of timing skew.

Now, we restart to check $\sigma_{t}$ through the consideration of parasitic resistors by $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ postlayout simulation. Simulation of timing skew from adjacent channels are presented in Table 5-5. Maximum skew is $3.27 /-3.58 \mathrm{ps}$ and $\sigma_{\mathrm{t}}$ calculated from Table $5-5$ is around 3.1 ps . From 4 groups of timing skew, we can divide these values into a trend: $[-1,+3.2,-3.6,+1.2] \mathrm{ps}$. This trend of timing skew cause the spurs of timing skew occur as shown in Fig. 5.15 (b). There are three spurs of timing skew which all around -40 dB . The $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ post-layout simulation result meets the measurement and notifies that $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ post-layout simulation should be executed to ensure the range of delay line.

Table 5-4 C+CC post-layout simulation of timing skew from adjacent channels

|  | Group1 |  | Group2 |  | Group3 |  | Group4 |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Timing skew <br> $(\mathrm{ps})$ | $\mathrm{dt}_{1}$ | -0.55 | $\mathrm{dt}_{5}$ | 1.71 | $\mathrm{dt}_{9}$ | -1.41 | $\mathrm{dt}_{13}$ | 0.49 |
|  | $\mathrm{dt}_{2}$ | -0.53 | $\mathrm{dt}_{6}$ | 1.69 | $\mathrm{dt}_{10}$ | -1.43 | $\mathrm{dt}_{14}$ | 0.34 |
|  | $\mathrm{dt}_{3}$ | -0.55 | $\mathrm{dt}_{7}$ | 1.71 | $\mathrm{dt}_{11}$ | -1.41 | $\mathrm{dt}_{15}$ | 0.04 |
|  | $\mathrm{dt}_{4}$ | -0.54 | $\mathrm{dt}_{8}$ | 1.67 | $\mathrm{dt}_{12}$ | -1.35 | $\mathrm{dt}_{16}$ | 0.62 |

Table 5-5 R+C+CC post-layout simulation of timing skew from adjacent channels

|  | Group1 |  | Group2 |  | Group3 |  | Group4 |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Timing skew <br> $(\mathrm{ps})$ | $\mathrm{dt}_{1}$ | -0.99 | $\mathrm{dt}_{5}$ | 3.22 | $\mathrm{dt}_{9}$ | -3.58 | $\mathrm{dt}_{13}$ | 1.22 |
|  | $\mathrm{dt}_{2}$ | -0.91 | $\mathrm{dt}_{6}$ | 3.26 | $\mathrm{dt}_{10}$ | -3.44 | $\mathrm{dt}_{14}$ | 1.3 |
|  | $\mathrm{dt}_{3}$ | -1.15 | $\mathrm{dt}_{7}$ | 3.3 | $\mathrm{dt}_{11}$ | -3.42 | $\mathrm{dt}_{15}$ | 1.78 |
|  | $\mathrm{dt}_{4}$ | -0.98 | $\mathrm{dt}_{8}$ | 3.27 | $\mathrm{dt}_{12}$ | -3.6 | $\mathrm{dt}_{16}$ | 0.72 |

Moreover, the different arrangement of timing skew in 4 group causes different variation on these three spurs of timing skew. Assume that 4 group timing skew are $\Delta_{A}, \Delta_{B}, \Delta_{C}, \Delta_{D}$ and their values are $[-1,+3.2,-3.6,+1.2] \mathrm{ps}$ respectively. If the arrangement of $\Delta_{A}, \Delta_{B}, \Delta_{C}, \Delta_{D}$ is changed, there will be totally six cases of arrangement as show in Table 5-6. Through MATLAB behavior, variation of three timing skew spurs in FFT is presented in Table 5-6. We conclude that there are three types of variation of timing skew spurs. The first type is case 1 and case 6 . The measurement result is also this type. Spurs of three timing skew are all around -40 dB . Next, the second type is case 2 and case 4 . In this situation, the amplitude of Spur 2 is around 35 dB and it is 8 dB higher than Spur 1 and Spur 3. The third type is case 3 and case 5 . Spur 2 is around -65 dB and its amplitude is 28 dB lower than Spur 1 and Spur 3.

Table 5-6 Variation of timing skew spurs with different arrangement of timing skew

| Case | Arrangement of timing skew | Spur 1 (dB) | Spur 2 (dB) | Spur 3 (dB) |
| :---: | :---: | :---: | :---: | :---: |
| 1 | $\Delta_{A}, \Delta_{B}, \Delta_{C}, \Delta_{\mathrm{D}}$ | -38.85 | -40.62 | -38.73 |
| 2 | $\Delta_{A}, \Delta_{\mathrm{B}}, \Delta_{\mathrm{D}}, \Delta_{\mathrm{C}}$ | -44.18 | -35.59 | -43.32 |
| 3 | $\Delta_{\mathrm{A}}, \Delta_{\mathrm{C}}, \Delta_{\mathrm{B}}, \Delta_{\mathrm{D}}$ | -37.4 | -64.67 | -37.2 |
| 4 | $\Delta_{\mathrm{A}}, \Delta_{\mathrm{C}}, \Delta_{\mathrm{D}}, \Delta_{\mathrm{B}}$ | -43.23 | -35.77 | -43.64 |
| 5 | $\Delta_{\mathrm{A}}, \Delta_{\mathrm{D}}, \Delta_{\mathrm{B}}, \Delta_{\mathrm{C}}$ | -37.25 | -66.18 | -37.35 |
| 6 | $\Delta_{\mathrm{A}}, \Delta_{\mathrm{D}}, \Delta_{\mathrm{C}}, \Delta_{\mathrm{B}}$ | -39.17 | -40.42 | -41.44 |

Finally, besides the reconsideration of delay line, we try to reduce timing skew of the clock generator further. As shown in Fig. 4.13, the main source of timing skew is the layout routing that connects the output of first stage to the input of second stage in the clock generator. The layout routing
includes clock phases $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$. Fig. 5.21 shows the floor plan of the clock generator. From this design, we increase routing length to make clock phases $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$ have the same length. The length $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$ is nearly 280 um.

However, we discover that if the routing length of $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$ is compressed as short as possible, timing skew will decrease although $\Phi_{\mathrm{G} 1 \sim} \sim \Phi_{\mathrm{G} 4}$ does not have the same length. Fig. 5.22 is the new floor plan of the clock generator. The length of $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$ is reduced to 50 um . In addition, as shown in Fig. 5.23 , routing of $\Phi_{\mathrm{G} 2} \sim \Phi_{\mathrm{G} 3}$ are surrounded by a grounded shield consisting a higher and a lower metal layer to make $\Phi_{\mathrm{G} 1} \sim \Phi_{\mathrm{G} 4}$ have the same parasitic capacitance. Table 5-7 shows the $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ postlayout simulation of timing skew of the new version of layout. $\sigma_{t}$ calculated from Table 5-7 is reduce to around 0.56 ps now.

In conclusion, through analysis between formulas and measurement, we obtain $\sigma_{t}$ from measurement. Then, we restart to check post-layout simulation to ensure that the $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ postlayout simulation meets the measurement. $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ post-layout simulation should be executed to ensure the range of delay line. Finally, the new layout version of clock generator is tried to reduce timing skew.


Fig. 5.21 Floor plan of the clock generator


Fig. 5.22 New floor plan of the clock generator


Fig. 5.23 Shielding a clock phases $\Phi_{\mathrm{Gi}}(\mathrm{i}=2 \sim 3)$

Table 5-7 $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ post-layout simulation of timing skew of the new version of layout

|  | Group1 |  | Group2 |  | Group3 |  | Group4 |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Timing skew <br> $(\mathrm{ps})$ | $\mathrm{dt}_{1}$ | 0.005 | $\mathrm{dt}_{5}$ | 0.59 | $\mathrm{dt}_{9}$ | -0.75 | $\mathrm{dt}_{13}$ | 0.26 |
|  | $\mathrm{dt}_{2}$ | 0.0009 | $\mathrm{dt}_{6}$ | 0.71 | $\mathrm{dt}_{10}$ | -0.75 | $\mathrm{dt}_{14}$ | 0.04 |
|  | $\mathrm{dt}_{3}$ | -0.0009 | $\mathrm{dt}_{7}$ | 0.69 | $\mathrm{dt}_{11}$ | -0.79 | $\mathrm{dt}_{15}$ | 0.5 |
|  | $\mathrm{dt}_{4}$ | 0.05 | $\mathrm{dt}_{8}$ | 0.74 | $\mathrm{dt}_{12}$ | -0.82 | $\mathrm{dt}_{16}$ | 0.4 |

### 5.7 Summary

Table 5-8 shows the comparison with previous high-speed, low-to-medium resolution TI ADCs
(with $6-8$ bit resolution). Power consuming proportion is shown in Fig. 5.24.

Table 5-8 Time-interleaved ADC Comparison Table

| Parameter | This work | $\text { JSSC } 2012$ <br> [17] | $\begin{gathered} \text { ASSCC } \\ 2013 \text { [34] } \end{gathered}$ | $\begin{gathered} \text { TVLSI } \\ 2014 \text { [35] } \end{gathered}$ | $\begin{gathered} \text { TCAS-I } \\ 2015 \text { [36] } \end{gathered}$ | $\text { JSSC } 2016$ [42] |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Architecture | TI-SAR | TI- <br> Subranging | FATI-SAR | TI-SAR | TI-SAR | TI-SAR |
| Technology (nm) | 40 | 65 | 45 | 65 | 40 (LP) | 65 |
| Sample rate (GS/s) | 4.5 | 2.2 | 2 | 2.5 | 2.64 | 5 |
| \# of channels | 16 | 4 | 4 | 12 | 16 | 4 |
| Supply Voltage (V) | 1.2 | 1 | 1.2 | 1.2 | 1.2 | 1 |
| Resolution (bit) | 6 | 7 | 6 | 6 | 8 | 6 |
| Power (mW) | 24.9 | 40 | 14.4 | 22 | 39 | 5.5 |
| $\mathrm{SNDR}_{\text {peak }}(\mathrm{dB})$ | 35.1 | 38.9 | 35.5 | 32 | 42.7 | 30.76 |
| $\mathrm{SNDR}_{\text {Nyq. }}(\mathrm{dB})$ | $32.2{ }^{(1)}$ | 38 | 33.1 | 31.9 | 37.9 | 30.25 |
| FoM (fJ/c.-s.) | $159{ }^{(1)}$ | 280 | 195 | 270 | 230 | 39 |
| Area (mm ${ }^{2}$ ) | 0.2 | 0.3 | 0.16 | 0.27 | 0.17 | 0.09 |

[^0]

Fig. 5.24 Power consuming proportion of the proposed ADC

## Chapter 6

## Conclusions

In this thesis, a 16 -channel 6 -bit $4.5 \mathrm{Gs} / \mathrm{s}$ time-interleaved SAR ADC is realized in TSMC 40 nm

GP. Combining the front-end input buffers and the grouping technique into time-interleaved architecture, proposed ADC achieves the performance of high-speed sampling rate and high input bandwidth. Gain and offset mismatch between ADCs are calibrated in the digital domain. A zerocrossing detection technique is employed to correct timing skew mismatch. Asynchronous processing and monotonic capacitor switching technique used in the single-channel SAR ADC makes the subADC high power-efficiency. AWCA technique solves the dynamic offset problem in the comparator of the sub-ADC.

The whole chip including pads occupies $1.275 \mathrm{~mm}^{2}$ while area of core circuit is $0.195 \mathrm{~mm}^{2}$. The measurement results show that DNL is $+0.17 /-0.29 \mathrm{LSB}$ and INL is $+0.20 /-0.18 \mathrm{LSB}$ at $4 \mathrm{GS} / \mathrm{s}$ with $\mathrm{F}_{\text {in }}$ of 50 MHz . SNDR and SFDR are 32.15 dB and 41.04 dB at $4.5 \mathrm{GS} / \mathrm{s}$ with 1 GHz input frequency by applying offset and gain calibrations. Timing-skew match is not improved because the range of delay line is insufficient. The power consumption is 24.9 mW at 1.2 V supply voltage. As a result, the FoM is $159 \mathrm{fJ} /$ conversion-step.

## Chapter 7

## Future Work

First, in order to make the performance of ADC not be limited by harmonic distortion when $\mathrm{F}_{\text {in }}$ is above 1 GHz , there are some methods which may improve the issue. First, different transformers and splitter should be tested. A broadband power splitter PSPL5320B BALUN [41] provides the same function as RF transformers. It may has lower harmonic distortion than RF transformers. Second, the operation of RF transformers may degrade due to input buffers with effect of bond wire, the lowdropout regulator is added to suppress bond-wire effect.

Second, when $\mathrm{F}_{\mathrm{s}}$ of ADC requires operating above 5 GHz , some suggestions are provide. The input common mode of a clock buffer should be designed off-chip to avoid bond-wire effect. The low-dropout regulator would be added to make the power supply of a clock buffer clean. Moreover, if a differential clock generator is used, the frequency provided by an external signal generator will be half $\mathrm{F}_{\mathrm{s}}$ of ADC and bond-wire effect will be halved.

Finally, for the calibration of timing skew, the estimated timing skew should take $\mathrm{R}+\mathrm{C}+\mathrm{CC}$ postlayout and the Monte Carlo simulation into consideration. And thus the simulated timing skew will fit the measure result. The total range of delay line should be over the $\pm 3 \sigma$ of timing skew to ensure the range of calibration.

## Bibliography

[1] K. Okada et al., "A full 4-channel $6.3 \mathrm{~Gb} / \mathrm{s} 60 \mathrm{GHz}$ direct-conversion transceiver with low-power analog and digital baseband circuitry," in IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, Feb. 2012, pp. 218-220.
[2] T. Tsukizawa et al., "A fully integrated 60 GHz CMOS transceiver chipset based on WiGig/IEEE802.11ad with built-in self calibration for mobile applications," in IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, Feb. 2013, pp. 230-231.
[3] S.-W.M. Chen and R.W. Brodersen, "A 6-bit $600-\mathrm{MS} / \mathrm{s} 5.3-\mathrm{mW}$ asynchronous ADC in $0.13-\mu \mathrm{m}$ CMOS," IEEE J. Solid-State Circuits, vol. 41, no. 12, pp. 2669-2680, Dec. 2006.
[4] J. Yang, T.L. Naing, and R.W. Brodersen, "A 1 GS/s 6 Bit 6.7 mW successive approximation ADC using asynchronous processing," IEEE J. Solid-State Circuits, vol. 45, no. 8, pp. 1469-1478, Aug. 2010.
[5] T. Jiang et al, "A single-channel, $1.25-\mathrm{GS} / \mathrm{s}$, 6 -bit, $6.08-\mathrm{mW}$ asynchronous successive approximation ADC with improved feedback delay in 40-nm CMOS," IEEE J. Solid-State Circuits, vol. 47, no. 10, pp. 2444-2453, Oct. 2012.
[6] L. Kull et al, "A $3.1 \mathrm{~mW} 8 \mathrm{~b} 1.2 \mathrm{GS} / \mathrm{s}$ single-channel asynchronous SAR ADC with alternate comparators for enhanced speed in 32 nm digital SOI CMOS," IEEE J. Solid-State Circuits, vol. 48, no. 12, pp. 3049-3058, Dec. 2013.
[7] E. Alpman et al., "A 1.1V 50mW 2.5GS/s 7b time-interleaved C-2C SAR ADC in 45nm LP digital

CMOS," in IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, Feb. 2009, pp. 76-77.
[8] P.J.A. Harpe et al., "A $0.47-1.6 \mathrm{~mW} 5$-bit $0.5-1 \mathrm{GS} / \mathrm{s}$ time-interleaved SAR ADC for low-power UWB radios," IEEE J. Solid-State Circuits, vol. 47, no. 7, pp. 1594-1602, Jul. 2012.
[9] K. Doris et al., "A $480 \mathrm{~mW} 2.6 \mathrm{GS} / \mathrm{s} 10 \mathrm{~b}$ time-interleaved ADC With 48.5 dB SNDR up to nyquist in 65 nm CMOS," IEEE J. Solid-State Circuits, vol. 46, no. 12, pp. 2821-2833, Dec. 2011.
[10] D. Stepanovic and B. Nikolic, "A 2.8 GS/s 44.6 mW time-interleaved ADC achieving 50.9 dB SNDR and 3 dB effective resolution bandwidth of 1.5 GHz in 65 nm CMOS," IEEE J. Solid-State Circuits, vol. 48, no. 4, pp. 971-982, Apr. 2013.
[11] N. Kurosawa et al., "Explicit analysis of channel mismatch effects in time-interleaved ADC systems" IEEE Trans. Circuits Syst. I, Fundam. Theory Appl., vol.48, no.3, pp.261-271, Mar. 2001. [12] M. El-Chammas and B. Murmann, "A $12 \mathrm{GS} / \mathrm{s} 81-\mathrm{mW}$ 5-bit time-in-terleaved flash ADC with background timing skew calibration," IEEE J. Solid-State Circuits, vol. 46, no. 4, pp. 838-847, Apr. 2011.
[13] C.-C. Huang, C.-Y. Wang, and J.-T. Wu, "A CMOS 6-bit 16-GS/s time-interleaved ADC using digital background calibration techniques," IEEE J. Solid-State Circuits, vol. 46, no. 4, pp. 848-858, Apr. 2011.
[14] S. Lee et al., "A 1 GS/s 10b 18.9 mW time-interleaved SAR ADC with background timing skew calibration," IEEE J. Solid-State Circuits, vol. 49, no. 12, pp. 2846-2856, Dec. 2014.
[15] N. Le Dortz et al., "A 1.62 GS/s time-interleaved SAR ADC with digital background mismatch
calibration achieving interleaving spurs below 70 dBFS ," in IEEE Int. Solid-State Circuits Conf., Dig.

Tech. Papers, Feb. 2014, pp. 386-388.
[16] C.-H. Tsai, "A single-channel 6-bit 800MS/s two-step SAR ADC", Master Thesis, Graduate Institute of Electronics Engineering, College of Electrical Engineering \& Computer Science, National Taiwan University, Nov. 2014.
[17] I.-N. Ku et al., "A 40-mW 7-bit 2.2-GS/s time-interleaved subranging CMOS ADC for lowpower gigabit wireless communications," IEEE J. Solid-State Circuits, vol. 47, no. 8, pp. 1854-1865, Aug. 2012.
[18] S.K. Gupta, M.A. Inerfield, and J. Wang, "A 1-GS/s 11-bit ADC with 55-dB SNDR, 250-mW power realized by a high bandwidth scalable time-interleaved architecture," IEEE J. Solid-State Circuits, vol. 41, no. 12, pp. 2650-2657, Dec. 2006.
[19] C.-C. Hsu et al., "An 11b 800MS/s time-interleaved ADC with digital background calibration," in IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, Feb. 2007, pp. 464-465.
[20] S. Louwsma et al., "A $1.35 \mathrm{GS} / \mathrm{s}, 10 \mathrm{~b}, 175 \mathrm{~mW}$ time-interleaved AD converter in $0.13 \mu \mathrm{~m}$

CMOS," IEEE J. Solid-State Circuits, vol. 43, no.4, pp. 778-786, Apr. 2008.
[21] A. Varzaghani et al., "A 10.3-GS/s, 6-bit flash ADC for 10G ethernet applications," IEEE J. Solid-State Circuits, vol. 48, no.12, pp. 3038-3048, Dec. 2013.
[22] H. Wei et al., "An 8 bit 4GS/s120 mW CMOS ADC," IEEE J. Solid-State Circuits, vol. 49, no.8, pp. 1751-1761, Aug. 2014.
[23] Y. Duan and E. Alon, "A 12.8 GS/s time-interleaved ADC with 25 GHz effective resolution bandwidth and 4.6 ENOB," IEEE J. Solid-State Circuits, vol. 49, no.8, pp. 1725-1738, Aug. 2014.
[24] P. Schvan et al., "A 24GS/s 6b ADC in 90nm CMOS," in IEEE Int. Solid-State Circuits Conf., Dig. Tech. Papers, Feb. 2008, pp. 544-545.
[25] H.-Y. Tai et al., "A 6-bit 1-GS/s two-step SAR ADC in 40-nm CMOS," IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 61, no. 5, pp.339-343, May 2014.
[26] T.-Y. Tang, T.-H. Tsai and K. Chen et al., "Timing mismatch background calibration for timeinterleaved ADCs," in TENCON 2012-2012 IEEE Region 10 Conference, Nov. 2012, pp. 1-4.
[27] B.P. Ginsburg et al., " $500-\mathrm{MS} / \mathrm{s} 5$-bit ADC in $65-\mathrm{nm}$ CMOS with split capacitor array DAC," IEEE J. Solid-State Circuits, vol. 42, no. 4, pp. 739-747, Apr. 2007.
[28] Y. Zhu et al., "A 10-bit 100-MS/s reference-free SAR ADC in 90 nm CMOS," IEEE J. SolidState Circuits, vol.45, no.6, pp.1111-1121, Jun. 2010.
[29] C.-C. Liu et al., "A 10-bit 50-MS/s SAR ADC with a monotonic capacitor switching procedure," IEEE J. Solid-State Circuits, vol.45, no.4, pp.731-740, Apr. 2010.
[30] B. Wicht, T. Nirschl, D. Schmitt-Landsiedel, "Yield and speed optimization of a latch-type voltage sense amplifier," IEEE J. Solid-State Circuits, vol.39, no.7, pp.1148-1158, Jul. 2004.
[31] S. Louwsma, E.V. Tuijl and B. Nauta, Time-interleaved analog-to-digital converters. New York: Springer, 2011, ch.2.
[32] W. C. Black, JR., and D. A. Hodges, "Time interleaved converter arrays," IEEE J. Solid-State

Circuits, vol. SC-15, no.6, pp. 1022-1029, Dec. 1980.
[33] K. Poulton, J.J. Corcoran, and T. Hornak. "A 1-GHz 6-bit ADC system," IEEE J. Solid-State Circuits, vol. SC-22, no.6, pp. 962-970, Dec. 1987.
[34] B.-R.-S. Sung et al., "A 6 bit 2 GS/s flash-assisted time-interleaved (FATI) SAR ADC with background offset calibration," in Proc. IEEE Asian Solid-State Circuits Conf., 2013, pp. 281-284.
[35] H. Lee et al., "A 6-bit 2.5-GS/s time-interleaved analog-to-digital converter using resistor-array sharing digital-to-analog converter," IEEE Trans. Very Large Scale Integr. (VLSI) Syst., pp. 23712383, Nov. 2014.
[36] S. Kundu et al., "A 1.2 V 2.64 GS/s 8 bit 39 mW skew-tolerant time-interleaved SAR ADC in 40 nm digital LP CMOS for 60 GHz WLAN," IEEE Trans. Circuits Syst. I, Reg. Papers, vol.62, no.8, pp.1929-1939, Aug. 2015.
[37] R. Reeder and R. Ramachandran. "Wideband A/D converter front-end design considerations When to use a double transformer configuration." Analog Dialogue 40-3. pp. 19-22. 2006.
[38] Mini-Circuits, "Surface mount RF transformer JTX-4-10T+,"
http://www.minicircuits.com/pdfs/JTX-4-10T+.pdf
[39] Mini-Circuits, "Surface mount RF transformer TCM4-14+,"

## http://www.minicircuits.com/pdfs/TCM4-14+.pdf

[40] Mini-Circuits, "Surface mount RF transformer TCM4-452X+,"
http://www.minicircuits.com/pdfs/TCM4-452X+.pdf
[41] Tektronix, " 11 GHz differential pulse splitter PSPL5320B datasheet,"
http://www.tek.com/sites/tek.com/files/media/media/resources/PSPL5320B-Datasheet-0.pdf
[42] C.-H. Chan et al., "A 6 b 5 GS/s 4 interleaved 3 b/cycle SAR ADC," IEEE J. Solid-State

Circuits, vol.51, no.2, pp.365-377, Feb. 2016.


[^0]:    (1) SNDR under $\mathrm{F}_{\text {in }}$ of 1 GHz

