# A General Sign Bit Error Correction Scheme for Approximate Adders 

Rui Zhou and Weikang Qian<br>University of Michigan-Shanghai Jiao Tong University Joint Institute<br>Shanghai Jiao Tong University, Shanghai, China<br>Email: \{zhouruisjtu, qianwk\}@sjtu.edu.cn


#### Abstract

Approximate computing is an emerging design technique for error-tolerant applications. As adders are the key building blocks in many applications, approximate adders have been widely studied recently. However, existing approximate adders may introduce sign bit error when doing two's complement signed addition, which is not tolerable for some applications. In this work, we propose a scheme that can correct sign bit error with low area and delay overhead. It is a general design applicable to many block-based approximate adders. This design not only can correct the sign bit error when it occurs, but also can fix some errors in the most significant bits even if there is no sign bit error. Experimental results on a real application, namely edge detection, showed that the approximate adders with our sign bit error correction module were up to 5.5 times better in peak signal-tonoise ratio than the original approximate adders, while the area and delay overhead is small.


## Keywords

Approximate computing, approximate adder, sign bit error correction

## 1. INTRODUCTION

With the continuous scaling down of CMOS transistors, energy consumption has become a major concern in designing VLSI chips. Approximate computing, an emerging technology that improves circuit performance and energy efficiency by allowing a little amount of inaccuracy in the computation result, attracts many research efforts recently, due to its applicability to many error-tolerant applications, such as image processing, machine learning, multimedia, and data mining [1].

As adders are key building blocks for many error-tolerant applications, a number of approximate adders were proposed recently [2-9]. Most of them fall into the category of block-based approximate adder. It splits the entire adder into several blocks of sub-adders in order to reduce computation delay. Each sub-adder takes a speculated carry-in, which is produced based on some previous input bits. The computation of each sub-adder is correct. Due to the popularity of the block-based approximate adder, it is the focus of this work.

[^0]$A_{m-1}\left(a_{\left.k_{m-1}^{m-1} \ldots a_{0}^{m-1}\right)} \quad A_{m-2}\left(a_{k_{m-2}-1}^{m-2} \ldots a_{0}^{m-2}\right) A_{m-3}\left(a_{k_{m-3}-1}^{m-3} \ldots a_{0}^{m-3}\right)\right.$
$\underbrace{B_{m-1}\left(b_{k_{m-1}-1}^{m-1} \ldots b_{0}^{m-1}\right)} \underbrace{B_{m-2}\left(b_{k_{m-2}-1}^{m-2} \ldots b_{0}^{m-2}\right)} \underbrace{B_{m-3}\left(b_{k_{m-3}-1}^{m-3} \ldots b_{0}^{m-3}\right)}$ )


Fig. 1: Error Tolerant Adder Type II (ETAII) proposed in [2].
A representative of the block-based approximate adder is the Error Tolerant Adder II (ETAII) proposed in [2], which is shown in Fig. 1. In ETAII, it segments the whole adder into $m$ blocks, and the carry-in signal of each sub-adder is produced by a carry generator operating on the input bits in the previous block. The carry-in signal of each carry generator is set as 0 . There are many other examples of block-based approximate adder. The Carry Skip Approximate Adder (CSA) proposed in [3] also has $m$ blocks, but the carry-in signal of the $i$-th sub-adder depends on input bits of the $(i-1)$-th block and the $(i-2)$-th block. The Low Relative Error Approximate Adder (LREA) proposed in [4] and the Speculative Carry Select Adder (SCSA) proposed in [5] have similar design styles.


Fig. 2: An example of sign bit error in ETAII.
However, all these typical approximate adders mentioned above may generate an error in the sign bit when doing two's complement signed addition. For example, in a 6-bit 3-block ETAII adder, the sign bit is wrong when the two inputs are 011111 (base 2) and 100010 (base 2), as shown in Fig. 2. The approximate sum is 110001 (base 2), but the accurate sum is 000001 (base 2). The error is caused by the wrong carry-in signal fed into the leftmost sub-adder, which is produced by the carry generator in the previous block.

Having a sign bit error in approximate addition may cause a catastrophic failure for many applications and may not be tolerable. There are only few existing approximate adders which can correct the sign bit error, and their designs all have some limitations. For example, the Variable Latency Speculative Adder (VLSA) proposed in [6] combines a proposed approximate adder with an accurate adder for error recovery. However, it requires a large circuit area. To ensure correct sign bit calculation, it takes
two clock cycles. Furthermore, when the correction module is enabled, its result is always correct, which is not necessary for error-tolerant applications. In the LREA, the authors proposed a sign bit error correction module, but it is only applicable to LREA itself [4].

In this paper, we proposed a low-overhead module to correct sign bit error for block-based approximate adders. Compared to the previous approaches used in [4] and [6], our module is a general design which is applicable to many existing approximate adders regardless how the speculated carry-in to the sub-adder is generated. The only requirement is that all sub-adders are correct.

In summary, the main contributions of our work are as follows:

1) We give a general and comprehensive analysis on when sign bit error occurs.
2) Based on the error analysis, we propose an efficient design to correct the sign bit error.
3) We apply the proposed design to several existing approximate adders and experimentally demonstrate its good performance.
The remainder of this paper is organized as follows. In Section 2, we give an analysis on when sign bit error occurs. In Section 3, we propose a method to correct sign bit error. In Section 4, we show our optimized circuit design. In Section 5, we show the experimental results. Finally, in Section 6, we draw the conclusion.

## 2. ERROR ANALYSIS

In this section, we will analyze all kinds of situations where a sign bit error might occur in the computation of approximate adder and provide the correction mechanism for each situation. In our analysis, we assume that the inputs do not cause an overflow.

The only pre-requisite of our analysis is that the sub-adder of each block is correct. This requirement is satisfied for all the existing block-based approximate adders. Before the formal analysis, we define the following notations.

- $A=\left(a_{n-1} a_{n-2} \ldots a_{0}\right)$ and $B=\left(b_{n-1} b_{n-2} \ldots b_{0}\right)$ are the two inputs of the adder, where $a_{i}$ and $b_{i}$ represent the $i$-th bits of $A$ and $B$, respectively
- We assume the approximate adder has $m$ blocks and the $i$-th ( $0 \leq i \leq m-1$ ) block has $k_{i}$ bits. Therefore, we have $n=$ $\sum_{i=0}^{m-1} k_{i}$.
- $s_{i}, c_{i}, p_{i}$, and $g_{i}$ represent the $i$-th $(0 \leq i \leq n-1)$ sum bit, carry-out bit, propagation bit, and generation bit of the accurate adder, respectively. $c_{\text {in }}$ represents the carry-in of the entire adder. For the ease of discussion, we define $c_{-1}=c_{i n}$. By the behavior of the adder, we have

$$
\begin{gathered}
s_{i}=a_{i} \oplus b_{i} \oplus c_{i-1}, \quad c_{i}=a_{i} b_{i}+\left(a_{i}+b_{i}\right) c_{i-1} \\
p_{i}=a_{i} \oplus b_{i}, \quad g_{i}=a_{i} b_{i} .
\end{gathered}
$$

Note that the sum bit and the carry-out bit can be calculated by the propagate bit and the generate bit as follows:

$$
s_{i}=p_{i} \oplus c_{i-1}, \quad c_{i}=g_{i}+p_{i} c_{i-1}
$$

For analysis purpose, we also split the accurate adder into $m$ blocks. For each block of approximate adder and accurate adder, we define the following signals.

- $C_{\text {app_sum_in }}^{i}$ : the carry-in signal of the $i$-th sub-adder of the approximate adder. This signal varies for different approximate adders.
- $C_{\text {acc_sum_in }}^{i}$ : the carry-in signal of the $i$-th block of the accurate adder.
- $S_{\text {app }}^{i}$ : the $k_{i}$-bit sum of the $i$-th sub-adder of the approximate adder.
- $S_{a c c}^{i}$ : the $k_{i}$-bit sum of the $i$-th block of the accurate adder.
- $s_{a p p, j}^{i}\left(0 \leq j \leq k_{i}-1\right)$ : the sum bit at position $j$ in block $i$ of the approximate adder.
- $s_{a c c, j}^{i}\left(0 \leq j \leq k_{i}-1\right)$ : the sum bit at position $j$ in block $i$ of the accurate adder.
- $c_{a p p, j}^{i}\left(0 \leq j \leq k_{i}-2\right)$ : the carry-out bit at position $j$ in the $i$ th sub-adder of the approximate adder.
- $c_{a c c, j}^{i}\left(0 \leq j \leq k_{i}-2\right)$ : the carry-out bit at position $j$ in block $i$ of the accurate adder.
- $A_{i}$ and $B_{i}$ : the $2 k_{i}$-bit inputs of block $i$.
- $a_{j}^{i}$ and $b_{j}^{i}\left(0 \leq j \leq k_{i}-1\right)$ : the 2 input bits at position $j$ in block $i$.
- $p_{j}^{i}\left(0 \leq j \leq k_{i}-1\right)$ : the propagate bit at position $j$ in block $i$.
- $g_{j}^{i}\left(0 \leq j \leq k_{i}-1\right)$ : the generate bit at position $j$ in block $i$.

Some of the above signals for approximate adder and accurate adder are shown in Fig. 3(a) and 3(b), respectively.

(a)

(b)

Fig. 3: Signal definitions for (a) an approximate adder and (b) an accurate adder.

The necessary condition for a sign bit error to occur is that for block $m-1, C_{\text {app_sum_in }}^{m-1} \neq C_{\text {acc_sum_in }}^{m-1}$. This condition can be further divided into two cases:

- Case 2.1: $C_{\text {app_sum_in }}^{m-1}=0$ and $C_{\text {acc_-sum_in }}^{m-1}=1$;
- Case 2.2: $C_{\text {app_sum_in }}^{m-1}=1$ and $C_{\text {acc_sum_in }}^{m-1}=0$.

We will discuss them separately in the following sections.
$2.1 C_{a p p_{-} s u m_{-} i n}^{m-1}=0$ and $C_{a c c_{-} s u m_{-} i n}^{m-1}=1$
Without loss of generality, we assume that the input $A \geq B$. Then, there are four cases where a sign bit error occurs:

1. $A \geq 0, B \geq 0, S_{\text {app }}^{m-1}<0$;
2. $A<0, B<0, S_{a p p}^{m-1} \geq 0$;
3. $A \geq 0, B<0, S_{a c c}^{m-1} \geq 0, S_{a p p}^{m-1}<0$;
4. $A \geq 0, B<0, S_{a c c}^{m-1}<0, S_{a p p}^{m-1} \geq 0$.

We consider these four cases in the following subsections.

### 2.1.1 The Case where $A \geq 0, B \geq 0, S_{a p p}^{m-1}<0$

This case is shown in Fig. 4(a). In this case, we must have $S_{a p p}^{m-1}=11 \ldots 11$ and $S_{a c c}^{m-1}=00 \ldots 00$. However, since $C_{\text {app_sum_in }}^{m-1}=0$ and $s_{a p p, j}^{m-1}=1$ for $j=0,1, \ldots, k_{m-1}-2$, we must have $a_{j}^{m-1} \oplus b_{j}^{m-1}=1$ for $j=0,1, \ldots, k_{m-1}-2$. Thus, the speculated carry-in to the sign bit is $c_{a p p, k_{m-1}-2}^{m-1}=0$. Given that the sign bits of the inputs $A$ and $B$ are both 0 , we must have $S_{a p p}^{m-1}=01 \ldots 11$, which contradicts with the claim that $S_{a p p}^{m-1}=$ $11 \ldots 11$. Therefore, this situation cannot happen.

### 2.1.2 The Case where $A<0, B<0, S_{a p p}^{m-1} \geq 0$

This case is shown in Fig. 4(b). In this case, we must have $S_{a p p}^{m-1}=01 \ldots 11$ and $S_{a c c}^{m-1}=10 \ldots 00$. This situation is possible. For example, when $A_{m-1}=1010$ and $B_{m-1}=1101$, then we have $S_{a p p}^{m-1}=0111$ and $S_{a c c}^{m-1}=1000$. To correct the error in this case, we need to invert the sign bit and set the other sum bits in block $(m-1)$ to 0 .


Fig. 4: Illustration of (a) Case 2.1.1, (b) Case 2.1.2, (c) Case 2.1.3, and (d) Case 2.1.4.


Fig. 5: Illustration of (a) Case 2.2.1, (b) Case 2.2.2, (c) Case 2.2.3, and (d) Case 2.2.4.

### 2.1.3 The Case where $A \geq 0, B<0, S_{a c c}^{m-1} \geq$ $0, S_{a p p}^{m-1}<0$

This case is shown in Fig. 4(c). In this case, we must have $S_{a p p}^{m-1}=11 \ldots 11$ and $S_{a c c}^{m-1}=00 \ldots 00$. This situation is possible. For example, when $A_{m-1}=0010$ and $B_{m-1}=1101$, then we have $S_{a p p}^{m-1}=1111$ and $S_{a c c}^{m-1}=0000$. To correct the error in this case, we need to invert the sign bit and set the other sum bits in block $(m-1)$ to 0 .

> 2.1.4 The Case where $A \geq 0, B<0, S_{a c c}^{m-1}<$ $0, S_{a p p}^{m-1} \geq 0$

This case is shown in Fig. 4(d). In this case, we must have $S_{a p p}^{m-1}=01 \ldots 11$ and $S_{a c c}^{m-1}=10 \ldots 00$. However, applying the same argument used for Case 2.1.1, we can show that $S_{a p p}^{m-1}$ should be $11 \ldots 11$. Therefore, this situation cannot happen.

## $2.2 C_{a p p_{-} \text {sum_in }}^{m-1}=1$ and $C_{a c c \_s u m_{-} i n}^{m-1}=0$

We consider the same four cases as shown in Section 2.1.

### 2.2.1 The Case where $A \geq 0, B \geq 0, S_{a p p}^{m-1}<0$

This case is shown in Fig. 5(a). In this case, we must have $S_{a p p}^{m-1}=10 \ldots 00$ and $S_{a c c}^{m-1}=01 \ldots 11$. This situation is possible. For example, when $A_{m-1}=0010$ and $B_{m-1}=0101$, then we have $S_{a p p}^{m-1}=1000$ and $S_{a c c}^{m-1}=0111$. To correct the error in this case, we need to invert the sign bit and set the other sum bits in block $(m-1)$ to 1 .

### 2.2.2 The Case where $A<0, B<0, S_{a p p}^{m-1} \geq 0$

This case is shown in Fig. 5(b). In this case, we must have $S_{a p p}^{m-1}=00 \ldots 00$ and $S_{a c c}^{m-1}=11 \ldots 11$. However, since $C_{a c c \_ \text {sum_in }}^{m-1}=0$ and $s_{a c c, j}^{m-1}=1$ for $j=0,1, \ldots, k_{m-1}-2$, we must have $a_{j}^{m-1} \oplus b_{j}^{m-1}=1$ for $j=0,1, \ldots, k_{m-1}-2$. Thus, the correct carry-in to the sign bit is $c_{a c c, k_{m-1}-2}^{m-1}=0$. Given that the sign bits of the inputs $A$ and $B$ are both 1, we must have $S_{a c c}^{m-1}=$ $01 \ldots 11$, which contradicts with the claim that $S_{a c c}^{m-1}=11 \ldots 11$. Therefore, this situation cannot happen.

### 2.2.3 The Case where $A \geq 0, B<0, S_{a c c}^{m-1} \geq$ $0, S_{a p p}^{m-1}<0$

This case is shown in Fig. 5(c). In this case, we must have $S_{a p p}^{m-1}=10 \ldots 00$ and $S_{a c c}^{m-1}=01 \ldots 11$. However, applying the same argument used for Case 2.2.2, we can show that $S_{a c c}^{m-1}$ should be $11 \ldots 11$. Therefore, this situation cannot happen.
2.2.4 The Case where $A \geq 0, B<0, S_{a c c}^{m-1}<$
$0, S_{a p p}^{m-1} \geq 0$
This case is shown in Fig. 5(d). In this case, we must have $S_{a p p}^{m-1}=00 \ldots 00$ and $S_{a c c}^{m-1}=11 \ldots 11$. This situation is possible. For example, when $A_{m-1}=0010$ and $B_{m-1}=1101$, then we have $S_{a p p}^{m-1}=0000$ and $S_{a c c}^{m-1}=1111$. To correct the error in this case, we need to invert the sign bit and set the other sum bits in block $(m-1)$ to 1 .

### 2.3 Summary

The above discussion considers all cases where a sign bit error might occur when we do two's complement addition. To sum up, if a sign bit error happens, we need to identify whether the case is Case 2.1 or Case 2.2. To correct a sign bit error for Case 2.1, we invert the sign bit and set all the other sum bits in block $(m-1)$ to 0 . To correct a sign bit error for Case 2.2, we invert the sign bit and set all the other sum bits in block $(m-1)$ to 1 .

## 3. METHODOLOGY FOR CORRECTING SIGN BIT ERROR

In this section, we show a general methodology for correcting sign bit error.

First, we define the group propagate signal $p p_{i}$ for each block. For any block $0 \leq i \leq m-2, p p_{i}=\prod_{j=0}^{k_{i}-1} p_{j}^{i}$. If $p p_{i}=1$, then all the propagate signals of block $i$ are 1 . Thus, in the accurate adder, the carry-in of block $i$ propagates to the carry-out of block $i$, which is just the carry-in of block $(i+1)$. Therefore, we have $C_{\text {acc_sum_in }}^{i+1}=C_{\text {acc_sum_in }}^{i}$ when $p p_{i}=1$.

The group propagate signal for block $(m-1)$ is defined slightly different from the other blocks in that it does not include
$p_{k_{m-1}-1}^{m-1}$. Specifically, $p p_{m-1}=\prod_{j=0}^{k_{m-1}-2} p_{j}^{m-1}$. For the ease of discussion, we also define $p p_{-1}=0$.

When $p p_{m-1}=0$, the carry-out signal $c_{a p p, k_{m-1}-2}^{m-1}$ in the ( $m-$ 1)-th sub-adder of the approximate adder does not depend on the speculated carry-in signal of the sub-adder. Therefore, it is equal to the correct carry-out signal $c_{a c c, k_{m-1}-2}^{m-1}$. As a result, the sign bit produced by the approximate adder is correct. Thus, in the following, we assume that $p p_{m-1}=1$.

From the analysis shown in Section 2, we can see that to check whether there is a sign bit error, we need to know $C_{\text {acc_sum_in }}^{m-1}$.

To obtain $C_{\text {acc_sum_in }}^{m-1}$, we first modify each sub-adder in the approximate adder by adding some extra logic to produce a carryout. The carry-out of the $i$-th sub-adder, denoted as $C_{\text {app_out }}^{i}$, can be calculated by the existing signals in the sub-adder as

$$
C_{\text {app_out }}^{i}=g_{k_{i}-1}^{i}+p_{k_{i}-1}^{i} \cdot c_{\text {app }, k_{i}-2}^{i} .
$$

The circuit is shown in Fig. 6. For the ease of discussion, we define $C_{\text {app_out }}^{-1}=c_{\text {in }}$.


Fig. 6: Circuits for generating the carry-out signal of a subadder.

It is easy to see that when $p p_{i}=0$, the carry-out $C_{a p p_{-} \text {out }}^{i}$ is equal to the correct carry-out in the accurate adder. Indeed, in our proposed design, the modification to each sub-adder is not necessary, as long as when $p p_{i}=0$, there is a signal equal to the correct carry-out of block $i$. This is true for many existing approximate adders in which for any block $0 \leq i \leq m-2$, there is a carry generator that produces a speculated carry-out signal based on at least all the input bits in that block. For example, the ETAII shown in Fig. 1, the CSA proposed in [3], and the LREA proposed in [4] all satisfy this condition. For this type of approximate adder, when $p p_{i}=0$, the speculated carry-out produced by the carry generator is equal to the correct carry-out of block $i$. We can just use the speculated carry-out as the signal $C_{\text {app_out }}^{i}$, which takes no extra circuitry.

Since we assume $p p_{m-1}=1$ and $p p_{-1}=0$, there must exist a $0 \leq t \leq m-1$ such that $p p_{m-1}=p p_{m-2}=\cdots=p p_{t}=1$ and $p p_{t-1}=0$. This value $t$ plays an important role in our correction scheme. Since $p p_{t-1}=0$, by the above discussion, the signal $C_{\text {app_out }}^{t-1}$ in the approximate adder gives the correct carry-out of block $t-1$, which is just the correct carry-in of block $t$, $C_{\text {acc_sum_in }}^{t}$. Because $p p_{m-2}=p p_{m-3}=\cdots=p p_{t}=1$, the accurate carry-in of block $m-1, C_{\text {acc_sum_in }}^{m-1}$ is equal to $C_{\text {acc_sum_in }}^{t}$, which in turn is equal to $C_{\text {app_out }}^{t-1}$. This gives us a way to obtain $C_{\text {acc_sum_in }}^{m-1}$. Note that the speculated carry-in $C_{\text {app_sum_in }}^{m-1}$ is also available in the given approximate adder. Then we can compare these two signals to check for sign bit error. If $C_{\text {acc_sum_in }}^{m-1}=C_{\text {app_sum_in }}^{m-1}$, there is no sign bit error. Otherwise, the computation might have a sign bit error and we can fix the error based on the different correction strategies discussed in Section 2.

However, in our actual implementation, we use a better strategy that not only can fix sign bit error when it occurs, but also can fix some errors in the most significant bits even if there is no sign bit error.

Our idea utilizes the signal $C_{\text {app_out }}^{t-1}$ and the fact that $p p_{m-1}=$ $\cdots=p p_{t}=1$. When $C_{\text {app_out }}^{t-1}=0$, for the accurate adder, we have $s_{a c c, j}^{i}=1$ for $t \leq i \leq m-2$ and $0 \leq j \leq k_{i}-1, s_{a c c, j}^{m-1}=1$ for $0 \leq j \leq k_{m-1}-2$, and $c_{\text {acc, } k_{m-1}-2}^{m-1}=0$. When $C_{\text {app_out }}^{t-1}=1$, for the accurate adder, we have $s_{a c c, j}^{i}=0$ for $t \leq i \leq m-2$ and $0 \leq j \leq k_{i}-1, \quad s_{a c c, j}^{m-1}=0 \quad$ for $0 \leq j \leq k_{m-1}-2, \quad$ and $c_{a c c, k_{m-1}-2}^{m-1}=1$.

Summarizing the above patterns, we can conclude that no matter whether $C_{\text {app_out }}^{t-1}$ is 0 or 1, for the accurate adder, we have $\frac{s_{\text {acc, }, j}^{i}}{}=\overline{C_{\text {app_out }}^{t-1}}$ for $t \leq i \leq m-2$ and $0 \leq j \leq k_{i}-1, s_{\text {acc, } j}^{m-1}=$
$C_{\text {app_out }}^{t-1}$ for $0 \leq j \leq k_{m-1}-2$, and $c_{\text {acc, } k_{m-1}-2}^{m-1}=C_{\text {app_out }}^{t-1}$. Thus, no matter whether the above output bits in the approximate adder have error or not, we can simply set them to their correct values shown above. Once we get the correct carry-out $c_{a c c, k_{m-1}-2}^{m-1}$, the correct sign bit can be easily obtained as

$$
s_{a c c, k_{m-1}-1}^{m-1}=p_{k_{m-1}-1}^{m-1} \oplus c_{a c c, k_{m-1}-2}^{m-1}
$$

For the remaining sum bits in blocks $t-1, \ldots, 0$, we just keep them unchanged.

## 4. CIRCUIT DESIGN

In this section, we show the circuit design for correcting the sign bit error, which implements the methodology discussed in Section 3.
The implementation needs to address two key questions: (1) how to identify the specific value $t$ for any input vector? (2) how to obtain the value $C_{\text {app_out }}^{t-1}$ ?

To address the above questions, we define the following signals.

- $p c_{i}=p p_{m-1} \cdot p p_{m-2} \cdot \cdots \cdot p p_{i}$, for $0 \leq i \leq m-1$. By the definition of the value $t$, for $t \leq i \leq m-1$, we have $p c_{i}=1$. For $0 \leq i<t$, since $p p_{t-1}=0$, we have $p c_{i}=0$. This gives us a way to test whether $i \geq t$ or $i<t$.
- $p t_{i}=p c_{i} \cdot \overline{p p_{i-1}}$, for $0 \leq i \leq m-1$. By the definition of the value $t$, we have $p t_{t}=1$. For any $t<i \leq m-1$, since $p p_{i-1}=1$, we have $p t_{i}=0$. For any $0 \leq i<t$, since $p c_{i}=0$, we have $p t_{i}=0$. In summary, among all $p t_{i}$ 's, only $p t_{t}=1$. This gives us a way to identify $t$.
- $c v=\sum_{i=0}^{m-1} p t_{i} \cdot C_{\text {app_out }}^{i-1}$. Since among all $p t_{i}$ 's, only $p t_{t}=1$, we have $c v=C_{\text {app_out }}^{t-1}$. This gives us a way to obtain $C_{\text {app_out }}^{t-1}$. The circuits for the signals $p c_{i}, p t_{i}$, and $c v$ are shown in Fig. 7.


Fig. 7: Circuits for (a) the signal $p c_{i}$, (b) the signal $p t_{i}$, and (c) the signal $\boldsymbol{c v}$.

For any $0 \leq i \leq m-2$, if $p c_{i}=1$, then by the property of signal $p c_{i}$, we must have $i \geq t$. Based on the correction methodology discussed in Section 3, for any $0 \leq j \leq k_{i}-1$, we set $s_{\text {new }, j}^{i}=\overline{c v}$, where $s_{\text {new }, j}^{i}$ is the new sum bit at position $j$ in block $i$ of the approximate adder. For any $0 \leq i \leq m-2$, if $p c_{i}=0$, then we must have $i<t$. Based on the correction methodology, we keep $s_{n e w, j}^{i}$ as the old approximate value $s_{\text {app }, j}^{i}$. If $p c_{m-1}=1$, then we set $s_{n e w, j}^{m-1}=\overline{c v}$, for any $0 \leq j \leq k_{m-1}-2$,
and set $c_{\text {new }, k_{m-1}-2}^{m-1}=c v$, where $c_{\text {new }, k_{m-1}-2}^{m-1}$ is the new carry-out bit at position $k_{m-1}-2$ in block $m-1$ of the approximate adder. If $p c_{m-1}=0$, the old sum bits $s_{a p p, j}^{m-1}\left(0 \leq j \leq k_{m-1}-2\right)$ and the old carry-out bit $c_{a p p, k_{m-1}-2}^{m-1}$ are kept.

Based on the above discussion, we can realize $s_{\text {new }, j}^{i}$ by a MUX, as shown in Fig. 8(a). The new sign bit $s_{n e w, k_{m-1}-1}^{m-1}$ is realized by the circuit shown in Fig. 8(b).


Fig. 8: Circuits for (a) the new sum bits and (b) the new sign bit of the approximate adder.

## 5. EXPERIMENTAL RESULTS

In this section, we present the experimental results on our proposed sign bit error correction module. The circuits were synthesized using Synopsys Design Compiler [10] and mapped to a 45 nm Nangate cell library [11].

### 5.1 Overhead for Different Approximate Adders

We applied the proposed sign bit error correction module to three existing approximate adders, ETAII [2], CSA [3], and LREA [4]. Table 1 shows the area, delay, power consumption, and power-delay product (PDP) of the three approximate adders with (denoted by "sign") and without (denoted by "basic") the sign bit error correction module. For comparison purpose, we also list the four metrics of the conventional ripple carry adder (RCA) and carry look-ahead adder (CLA). All the adders are 16-bit adders. The three approximate adders are divided into 4 blocks of equal size of 4 . For each approximate adder with the sign bit error correction module, the percentage increase of each metric due to the sign bit error correction module is shown inside the parentheses after the absolute value.
Table 1. Area, delay, power consumption, and power-delay product (PDP) of different 16-bit adders.

|  | Area <br> $\left(\mu m^{2}\right)$ | Delay <br> $(n s)$ | Power <br> $(m W)$ | PDP <br> $(\mathrm{fJ})$ |
| :--- | :---: | :---: | :---: | :---: |
| ETAII(basic) | 99.5 | 0.97 | 22.3 | 21.6 |
| ETAII (sign) | $129.1(30)$ | $1.12(15)$ | $22.8(2.2)$ | $25.5(18)$ |
| CSA (basic) | 114.6 | 1.16 | 23.0 | 26.7 |
| CSA (sign) | $138.1(21)$ | $1.31(13)$ | $23.9(3.9)$ | $31.3(17)$ |
| LREA(basic) | 121.0 | 1.24 | 25.8 | 31.9 |
| LREA (sign) | $139.6(15)$ | $1.28(3.2)$ | $26.2(1.6)$ | $33.5(5.0)$ |
| RCA | 109.8 | 2.32 | 23.9 | 55.4 |
| CLA | 145.3 | 1.78 | 29.9 | 53.2 |

From the table, we can see that by adding the sign bit error correction module, the area increases $15 \% \sim 30 \%$, the delay increases $3.2 \% \sim 15 \%$, the power consumption increases $1.6 \% \sim 3.9 \%$ and the PDP increases $5.0 \% \sim 18 \%$. The overhead for ETAII is the largest, since ETAII design is the simplest among the three. Overall, the overhead of the sign bit error correction module is small. Compared with RCA, although the approximate adders with the sign bit error correction module have slightly larger
power consumption, they have much smaller delay and PDP. Compared with CLA, all of the four metrics of the approximate adders with the sign bit error correction module are smaller.

### 5.2 Overhead for Same Approximate Adder of Different Sizes

In this experiment, we applied the proposed sign bit error correction module to ETAIIs of different design parameters. We considered 5 different ETAIIs, which are listed in Table 2. The pair $(n, k)$ indicates the adder is an $n$-bit adder with $n / k$ blocks of equal size $k$. For example, the pair $(32,4)$ indicates a 32-bit ETAII with 8 blocks of equal size 4 . Table 2 shows the area, delay, and power consumption of them with (column "sign") and without (column "basic") the sign bit error correction module. The values in the parentheses are the percentage increases.

Table 2. Area, delay, and power consumption of different ETAIIs with and without the sign bit error correction module.

| ETAII <br> parameters | Area $\left(\mu m^{2}\right)$ |  | Delay $(n s)$ |  | Power $(m W)$ |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  | basic | sign | basic | sign | basic | sign |
| $(16,2)$ | 91.0 | 112.4 <br> $(24)$ | 0.68 | 1.05 <br> $(54)$ | 22.3 | 22.4 <br> $(0.4)$ |
| $(16,4)$ | 99.5 | 129.1 <br> $(30)$ | 0.97 | 1.12 <br> $(15)$ | 22.3 | 22.8 <br> $(2.2)$ |
| $(32,2)$ | 178.8 | 255.3 <br> $(43)$ | 0.57 | 1.76 <br> $(209)$ | 23.2 | 23.8 <br> $(2.6)$ |
| $(32,4)$ | 208.5 | 261.8 <br> $(26)$ | 0.97 | 1.42 <br> $(46)$ | 23.6 | 23.8 <br> $(0.8)$ |
| $(32,8)$ | 223.4 | 257.8 <br> $(15)$ | 1.64 | 1.76 <br> $(7.3)$ | 25.9 | 26.3 <br> $(1.5)$ |

From the table, we can see that when the number of blocks increases, the overhead in area and delay due to the sign bit error correction module also increases. The reason is that some submodules in the sign bit error correction module, such as the circuits for the signals $p c_{i}$ and $c v$, have area and delay complexity proportional to the number of blocks.

### 5.3 Performance Study on Edge Detection Application

To evaluate the effect of our sign bit error correction module in real applications, we applied several approximate adders with and without the sign bit error correction module to an image processing application, namely Roberts cross-based edge detection [12]. We used peak signal-to-noise ratio (PSNR) of the image as the performance metric. It is defined as

$$
P S N R=10 \cdot \log _{10}\left(\frac{M A X_{I}^{2}}{M S E}\right),
$$

where $M A X_{I}$ is the maximum possible pixel value of the image. MSE is the mean square error (MSE), which is defined as

$$
M S E=\frac{1}{W H} \sum_{i=0}^{W-1} \sum_{j=0}^{H-1}[I(i, j)-K(i, j)]^{2},
$$

where $W$ and $H$ are the width and the height of an image, respectively. The values $I(i, j)$ and $K(i, j)$ are the results calculated by the accurate adder and the approximate adder, respectively, at location $(i, j)$ in the image. For an accurate adder, its $M S E=0$ and hence, its $P S N R=+\infty$. The higher the PSNR is, the better the performance of the adder is.
We used three existing approximate adders, ETAII, CSA, and LREA in our experiment. We applied these adders with and without the sign bit error correction module to perform edge detection on 10 sample images chosen from an online library [13].

Table 3 shows the average PSNR over the 10 images in our simulation. The adder parameter pairs $(10,2)$ and $(10,5)$ in the table have the same meaning as before. The pair $(10,3)$ refers to an adder with 4 blocks, of which the leftmost three are of size 3 and the rightmost one is of size 1 . Note that some entries in the table is $+\infty$ because the MSE is either zero or very small. From the table, we can see that the approximate adders with the error correction module are up to $5.5 \times$ better in PSNR than the original ones.

Table 3. PSNRs of different approximate adders with and without the sign bit error correction module.

| Adder <br> parameter | ETAII |  | CSA |  | LREA |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  | basic | sign | basic | sign | basic | sign |
| $(10,2)$ | 9.34 | 28.2 | 7.95 | 33.8 | 9.88 | 42.7 |
| $(10,3)$ | 11.0 | 31.7 | 13.3 | 42.6 | 9.83 | 54.5 |
| $(10,5)$ | $+\infty$ | $+\infty$ | $+\infty$ | $+\infty$ | 17.8 | $+\infty$ |

To visually demonstrate the effectiveness of our proposed module, we show one sample image processed by the accurate adder and the LREAs with and without the sign bit error correction module for $n=10$ and $k=2,3,5$ in Fig. 9 .

(a) Input image

(c) $k=2$, no correction module

(e) $k=3$, no correction module

(g) $k=5$, no correction module

(b) Accurate adder

(d) $k=2$, with correction module

(f) $k=3$, with correction module

(h) $k=5$, with correction module

Fig. 9: Roberts cross-based edge detection using different adders.

From Fig. 9, we can clearly see that the approximate adders with our sign bit error correction module have much better performance than those adders without the correction module.

Finally, we did another comparison between two approximate adders. One is an LREA with $(n, k)=(10,5)$ and no sign bit error correction module. The other is an LREA with $(n, k)=$
$(10,2)$ and the sign bit error correction module. The comparison of their PSNRs, areas, delays, and power consumptions is shown in Table 4.

Table 4. Comparison of two approximate adders.

|  | PSNR | Area <br> $\left(\mu \mathrm{m}^{2}\right)$ | Delay <br> $(n s)$ | Power <br> $(\mathrm{mW})$ |
| :---: | :---: | :---: | :---: | :---: |
| LREA (10, 5), basic | 17.8 | 76.3 | 1.40 | 19.0 |
| LREA (10, 2), sign | 42.7 | 85.3 | 1.01 | 18.2 |

From the table, we can see that the LREA with $(n, k)=(10,2)$ and the sign bit error correction module is $12 \%$ larger in area than the LREA with $(n, k)=(10,5)$ and no sign bit error correction module. However, the delay and the power consumption of the former are $28 \%$ and $4.2 \%$ smaller than those of the latter, respectively. Furthermore, the PSNR of the former is $140 \%$ better than that of the latter. This result shows that in the case where the circuit area, delay, and power consumption are comparable, an approximate adder with the proposed sign bit error correction module is much better in PSNR than an approximate adder without the correction module.

## 6. CONCLUSION

In this paper, we proposed a general sign bit error correction scheme, which is applicable to many block-based approximate adders. The only requirement our scheme relies on is that each sub-adder in the approximate adder is correct, which is usually held. Our design is very efficient with low area, delay, and energy overhead. According to the properties of each specific approximate adder, the proposed design may be further simplified. The proposed module not only can correct the sign bit error when it occurs, but also can fix some errors in the most significant bits even if there is no sign bit error. Thus, it can further reduce the relative error and error rate of the original approximate adder. We applied our proposed module to an edge detection application and demonstrated the good performance of it.

## ACKNOWLEDGEMENTS

This work is supported by National Natural Science Foundation of China (NSFC) under Grant No. 61574089.

## 7. REFERENCES

[1] J. Han and M. Orshansky, "Approximate computing: An emerging paradigm for energy-efficient design," in ETS, 2013, pp. 1-6.
[2] N. Zhu, W. L. Goh, and K. S. Yeo, "An enhanced low-power high speed adder for error-tolerant application," in ISIC, 2009, pp. 69-72.
[3] Y. Kim, Y. Zhang, and P. Li, "An energy efficient approximate adder with carry skip for error resilient neuromorphic VLSI systems," in ICCAD, 2013, pp. 130-137.
[4] J. Hu and W. Qian, "A new approximate adder with low relative error and correct sign calculation," in DATE, 2015, pp. 1449-1454.
[5] K. Du, P. Varman, and K. Mohanram, "High performance reliable variable latency carry select addition," in DATE, 2012, pp. 12571262.
[6] A. K. Verma, P. Brisk, and P. Ienne, "Variable latency speculative addition: A new paradigm for arithmetic circuit design," in DATE, 2008, pp. 1250-1255.
[7] A. B. Kahng and S. Kang, "Accuracy-configurable adder for approximate arithmetic designs," in DAC, 2012, pp. 820-825.
[8] R. Ye, T. Wang, F. Yuan, R. Kumar, and Q. Xu, "On reconfiguration-oriented approximate adder design and its application," in ICCAD, 2013, pp. 48-54.
[9] H. R. Mahdiani, A. Ahmadi, S. M. Fakhraie, and C. Lucas, "Bioinspired imprecise computational blocks for efficient VLSI implementation of softcomputing applications," IEEE Transactions on Circuits and Systems I, vol. 57, no. 4, pp. 850-862, 2010.
[10] Design Compiler, Synopsys Inc., http://www.synopsys.com
[11] Nangate 45nm Library, Nangate Inc., http://www.nangate.com
[12] L. G. Roberts, "Machine perception of three-dimensional soups," Ph.D. dissertation, Massachusetts Institute of Technology, 1963.
[13] Image databases: standard test images, http://www.imageprocessingplace.com/root_files_V3/image_databas es.htm


[^0]:    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org.
    GLSVLSI'16, May 18-20, 2016, Boston, MA, USA.
    © 2016 ACM. ISBN 978-1-4503-4274-2/16/05 ... $\$ 15.00$
    DOI: http://dx.doi.org/10.1145/2902961.2903012

