EGT1
ENGINEERING TRIPOS PART IIA

Tuesday 03 May 2022 9:30 to 11:10

Module 3B2

## ELECTRICAL ENGINEERING

Answer not more than three questions.
All questions carry the same number of marks.
The approximate percentage of marks allocated to each part of a question is indicated in the right margin.

Write your candidate number not your name on the cover sheet.

## STATIONERY REQUIREMENTS

Single-sided script paper.

## SPECIAL REQUIREMENTS TO BE SUPPLIED FOR THIS EXAM

CUED approved calculator allowed.
Engineering Data Book.

10 minutes reading time is allowed for this paper at the start of the exam.

You may not start to read the questions printed on the subsequent pages of this question paper until instructed to do so.

You may not remove any stationery from the Examination Room.

## Version OBA/2

1 (a) Explain how field-programmable gate arrays (FPGA) implement logic functions, such as gates, in a programmable way. How are the inputs and outputs of a logic function realised in this concept? In what way does this differ from complex programmable logic devices (CPLD)? Which circuit elements can implement the programmable part representing logic functions in an FPGA?

## Solution:

A memory (also called look-up table in the FPGA world) represents logic functions. The content of the memory cell represents the output of the logic function (and is programmable). The address of the memory represents the input. CPLDs, an alternative approach for programmable logic, in contrast have mostly standard logic functions inside and mostly provides programmable interconnection resources in array form (which FPGAs often have on top). The memory in FPGAs can be implemented with typical memory circuits, such as all kinds of programmable ROMs (EPROM) or static memory (SRAM; Flip-Flops, etc.).
(b) Which two ways of describing the behaviour of a logic block (entity) exist in VHDL? What is the key difference between both?

## Solution:

Architecture: Architectures describe the logic functions and interconnections as a declaration. The individual lines of the code are interpreted during the synthesis phase to form an implementation that follows that declaration. Importantly, all lines are practically running at the same time so that the order does in most cases not matter.

Process: A process mimics higher-level programming languages, such as $\mathrm{C} / \mathrm{C}++$. The individual lines are practically executed sequentially, one by one.
(c) Which function is represented by the VHDL code in Fig. 1? Simplify the function and write it as a product of sums or a sum of products.

## Version OBA/2

## Solution:

$$
\begin{aligned}
f(x) & =\left(\overline{\left(x_{4} x_{3} x_{2} x_{1}+x_{4} \overline{\left.\overline{x_{3}} \overline{x_{2}} x_{1}+x_{1} \overline{x_{4}}+x_{1} x_{3}+x_{1} \overline{x_{2}} \overline{x_{1}} x_{4}+x_{1} \overline{x_{2}} x_{3}+x_{1} x_{4} x_{2} \overline{x_{3}}\right)}\right.} \begin{array}{rl} 
& \left.=\overline{\left(x _ { 1 } \left(x_{4} x_{3} x_{2}+x_{4} \overline{\overline{x_{3}}} \overline{x_{2}}+\overline{x_{4}}+x_{3}+x_{4} x_{2} \overline{x_{3}}\right.\right.}\right) \\
& =\overline{\left(x_{1}\left(x_{4} x_{3} x_{2}+x_{4} \overline{\overline{x_{3}} \overline{x_{2}}+\overline{x_{4}}+x_{3}+x_{4} \overline{x_{2}} \overline{x_{3}}}\right) \text { with } x_{3}\right.}=x_{4} x_{3}+\overline{x_{4}} x_{3} \\
& =\overline{\left(x_{1}\left(x_{4}\left(x_{3} x_{2}+\overline{x_{3}} \overline{x_{2}}+x_{2} \overline{x_{3}}+x_{3}\right)+\overline{x_{4}}\left(1+x_{3}\right)\right)\right)} \\
& =\overline{\left(x_{1}\left(x_{4}\left(x_{3} x_{2}+\overline{x_{3}} \overline{x_{2}}+x_{2} \overline{\overline{x_{3}}}+x_{3}\right)+\overline{x_{4}}\right)\right)} \\
& =\overline{\left(x_{1}\left(x_{4}\left(\overline{x_{3}}\left(\overline{x_{2}}+x_{2}\right)+x_{3}\right)+\overline{x_{4}}\right)\right)} \\
& =\overline{\left(x_{1}\left(x_{4}\left(\overline{x_{3}}(1)+x_{3}\right)+\overline{x_{4}}\right)\right)} \\
& =\overline{\left(x_{1}\left(x_{4}(1)+\overline{x_{4}}\right)\right)} \\
& =\overline{\left(x_{1}\left(x_{4}+\overline{x_{4}}\right)\right)} \\
& =\overline{x_{1}}
\end{array}\right.
\end{aligned}
$$

(d) Derive a function in standard binary coded decimal (BCD) that outputs TRUE (1) for every prime number up to 15 as sum of products ( 0 and 1 are not considered prime numbers). In the application, the previous and subsequent processing steps do not evaluate multiples of 5 so that these can return any result. Simplify this function to use as few terms as possible.

## Solution:

Prime numbers:

$$
2(0010)
$$

$$
\begin{equation*}
3 \text { (0011) } \tag{0101}
\end{equation*}
$$

Do-not-cares:
5 (0101)
10 (1010)
15 (1111)
$\Longrightarrow f_{\text {prime }}(\mathrm{x}: 4$ downto 1$)=(\operatorname{NOT} \mathrm{x}(1)$ AND $\mathrm{x}(2)$ AND NOT $\mathrm{x}(3)$ AND NOT $\mathrm{x}(4))$ OR

## Version OBA/2

(x(1) AND $x(2)$ AND NOT $x(3)$ AND NOT $x(4))$ OR (x(1) AND NOT $x(2)$ AND $x(3)$ AND NOT $x(4)$ ) OR ( $x(1)$ AND $x(2)$ AND $x(3)$ AND NOT $x(4))$ OR (x(1) AND $x(2)$ AND NOT $x(3)$ AND $x(4))$ OR (x(1) AND NOT $x(2)$ AND $x(3)$ AND $x(4))$

$$
\begin{align*}
f_{\text {prime }} & =\overline{x_{4}} \overline{x_{3}} x_{2} \overline{x_{1}}+\overline{x_{4}} \overline{x_{3}} x_{2} x_{1}+\overline{x_{4}} x_{3} \overline{x_{2}} x_{1}+\overline{x_{4}} x_{3} x_{2} x_{1}+x_{4} \overline{x_{3}} x_{2} x_{1}+x_{4} x_{3} \overline{x_{2}} x_{1}  \tag{1}\\
f_{\text {dnc } 1} & =x_{4} \overline{x_{3}} x_{2} \overline{x_{1}}  \tag{2}\\
f_{\text {dnc } 2} & =\overline{x_{4}} x_{3} \overline{x_{2}} x_{1}  \tag{3}\\
f_{\text {dnc } 3} & =x_{4} x_{3} x_{2} x_{1} \tag{4}
\end{align*}
$$

Each can be added to $f$ in inverted and non-inverted if helpful for simplification.
Simplification (Karnaugh map or through de Morgan)


Hence, $f=x_{2} \overline{x_{3}}+x_{1} x_{3}$

## Version OBA/2

```
LIBRARY ieee;
USE ieee.std_logic_1164.all;
ENTITY Q1 IS
        PORT ( x : IN STD_LOGIC_VECTOR(4 DOWNTO 1);
            f : OUT STD_LOGIC );
END Q1;
ARCHITECTURE Behavior OF Q1 IS
    SIGNAL mysig1 : STD_LOGIC_VECTOR(4 DOWNTO 1) := "0000";
    SIGNAL mysig2 : STD_LOGIC_VECTOR(4 DOWNTO 1) := "0000";
    CONSTANT one : STD_LOGIC_VECTOR(4 DOWNTO 1) := "1111";
BEGIN
    mysig2 <= (x(1) AND NOT x(2) AND x(3)) OR (x(1) AND x(4)
AND x(2) AND NOT x(3));
        WITH x SELECT
            f <= 'O' WHEN one,
                'O' WHEN mysig1,
                '0' WHEN "1001",
                'O' WHEN mysig2,
                '1' WHEN OTHERS;
        mysig1 <= (x(1) AND NOT x(4)) OR (x(1) AND x(3)) OR (x(1)
AND x(2) AND NOT x(1) AND x(4));
END Behavior;
```

Fig. 1

Assessor's Comments: This was a popular question with 54 attempts out of a total of 75 students. The mean was 59.7 out of 100 with a standard deviation of 21.4. The part (a) was about the main principles and differences of FPGA and CPLD. Most students clearly explained the fundamentals for FPGA, CPLD and their working principles. Part (b) focused on different ways of describing the behaviour of a logic block. Vast majority of the students answered correctly as well. Part (c) was about assessing the basic understanding of a given VHDL code, which included a logic function. The students were mostly very successful reading the VHDL code, however, most did not entirely completed the simplification of the function. A large group of students have minimised the function counting prime numbers in part (d). Overall, the students showed that they grasped all basic concepts regarding the FPGAs, VHDL, deriving logic functions and their minimisation.

## Version OBA/2

2 Consider the circuit shown in Fig. 2.


Fig. 2


Fig. 3
(a) Find the simplest expression for the node $p$, where $p=f(k, l, m)$.

## Solution:

$n=k \oplus l, o=l \oplus m$ and $p=n \oplus o$. Thus, $p=(k \oplus l) \oplus(l \oplus m)$ and by simplifying this, we get $p=k \oplus m$.
(b) To implement the circuit in Fig. 2 with minimum number of lookup table(s), how many inputs do each of the lookup table(s) require? Explain your answer.

## Solution:

Any function with 4 inputs can be implemented with a 4-input lookup table (LUT). For the LUT equivalent to circuit in Fig. 2, the inputs of LUT will be $a, b, c, d$ and the output will be $p$.

## Version OBA/2

(c) If only 3-input lookup tables (3-LUTs) are available, what is the minimum number of 3-LUTs required to implement this circuit? Indicate the input and output of each LUT according to the node names in Fig. 2. Mark unused input(s) of a lookup table with X. Do not attempt to simplify the gate-level circuit shown in Fig. 2.

## Solution:

5 LUTs are required without simplifying the gate-level circuit shown in Fig. 2 as follows.

| LUT\# | INPUT1 | INPUT2 | INPUT3 | OUTPUT |
| :---: | :--- | :--- | :--- | :--- |
| $\mathbf{1}$ | b | c | d | k |
| $\mathbf{2}$ | a | b | x | g |
| $\mathbf{3}$ | g | c | d | l |
| $\mathbf{4}$ | a | c | d | m |
| $\mathbf{5}$ | k | l | m | p |

The answers which simplify the circuit based on results of part (a) would require 3 LUTs (LUT\#2 and LUT\#3 would not be needed and inputs of LUT\#5 would be $k, m$, and $x$ ). This answer also gets full mark provided that input/output relations of the LUTs are correct.
(d) Using the Full-adder (FA) shown in Fig. 3, implement a logic circuit to subtract two 4-bit numbers.

## Solution:



Assessor's Comments: This question was about the LUTs and combinational logic circuits, which are among the fundamental building blocks of FPGA operation. Part (a) asked for the simplest expression for the node p in the LUT circuit given in Fig 2. It was pleasing that a large number of candidates could find it correctly. The main concepts of LUTs were examined in part (b), which were also mostly understood and well answered. In part (c), the 3-LUTs implementation of the circuit was asked, many students carried the results of part (a) in terms of the simplified version of the function, nevertheless they also received full mark provided that they have correctly indicated inputs and outputs of each LUT. Part (d) focused on implementing a 4-bit binary subtractor using the given full-adder in Fig 3. Again, a large group of candidates performed this implementation correctly. Overall, the question was attempted by less number of candidates, i.e., 49 out of 75 , however, pleasingly answered very well with the mean of 67.2 out of 100 .

## Version OBA/2

3 (a) Give two disadvantages of the use of resistive load compared to transistor (active) load in MOS logic circuit implementations.

## Solution:

Resistive loads are more expensive, they use the surface area inefficiently, the parasitic capacitances/inductances in the overall processor increases due to the use of long conductors in close vicinity.
(b) The propagation delay and power dissipation values of different technologies are given in Fig. 4.


Fig. 4
(i) Suggest a technology for an integrated digital electronics device with a longlasting battery, operating around $1-2 \mathrm{GHz}$. Give your reasoning.
Solution:
$1-2 \mathrm{GHz}$ corresponds to $0.5-1 \mathrm{~ns}$ propagation delay. All technologies shown here satisfies the frequency requirement. However, since we want the device to have a long-lasting battery, the technologies with lower power dissipation are chosen. Among CMOS and GaAs, the cheaper technology needs to be selected.
(ii) Compare NMOS and CMOS from the perspectives of power consumption, manufacturing cost and area efficiency.
Solution:
NMOS has a higher power consumption than CMOS due to the static leakage of NMOS. NMOS is cheaper to manufacture because for the PMOS part, we first need to build an n-well into the p-substrate. (Or simply saying that the procedure involves more steps is enough). NMOS technology is more area efficient than CMOS technology, because PMOS part of the CMOS occupies more area due to the low hole mobility.

Version OBA/2
(c) A CMOS 2-input NAND gate is given in Fig. 5. Take $k_{N}^{\prime}=2 k_{P}^{\prime}$. Use

$$
\begin{gathered}
k^{\prime}=\frac{\mu \varepsilon_{o x}}{t_{o x}} \\
k=k^{\prime}\left(\frac{W}{L}\right),
\end{gathered}
$$

where $k$ is the transconductance parameter, $\mu$ is the carrier mobility, $\varepsilon_{o x}$ is the permittivity of the oxide layer, $t_{o x}$ is the oxide thickness and $\left(\frac{W}{L}\right)$ is the width/length ratio of the device. For NMOS transistors, take $\left(\frac{W}{L}\right)=5$.


Fig. 5
(i) Find the required $\left(\frac{W}{L}\right)$ ratio for PMOS transistors of the NAND gate in Fig. 5 such that the rise time, $t_{r}^{N A N D}$, for the output when the input is $\mathrm{AB}=00$, is equal to the fall time, $t_{f}^{N A N D}$.
Solution:
We know that $t_{r}^{N A N D}$ and $t_{f}^{N A N D}$ are proportional to the time constants $\tau_{r}=R_{r} C_{L}$ and $\tau_{f}=R_{f} C_{L}$, respectively. Thus, if $t_{f}^{N A N D}=t_{r}^{N A N D}$ when $\mathrm{AB}=00, R_{f}=R_{r}$ when $\mathrm{AB}=00$. Therefore, $R_{f} \sim \frac{1}{k_{N}}+\frac{1}{k_{N}}=\frac{2}{k_{N}}=\frac{2}{5 k_{N}^{\prime}}$. For $\mathrm{AB}=00$, both PMOS transistors are ON , so that $R_{r}$ is composed of two parallel resistors, each proportional to $1 / k_{P}$, i.e., $R_{r} \sim \frac{1}{2 k_{P}}=\frac{2}{5 k_{N}^{\prime}}$
Substituting $k_{N}^{\prime}=2 k_{P}^{\prime}$, we have $\frac{1}{2 k_{P}}=\frac{1}{5 k_{P}^{\prime}}$, which gives $k_{P}=2.5 k_{P}^{\prime}$, and $(W / L)=2.5$.
(ii) Due to a fabrication error, $\left(\frac{W}{L}\right)$ for M2 is $20 \%$ less than what it should have been when matched properly. How do you think this will impact the overall operation of the gate?
Solution:
$I_{D S}$ is proportional with $k_{P}^{\prime}\left(\frac{W}{L}\right)$. Concurrently, if $W / L$ is smaller than what it should be, $I_{D S}$ is smaller. Since propagation delay is a measure of the charging of the load capacitor, it is inversely proportional with $k_{P}^{\prime}\left(\frac{W}{L}\right)$. As a result, whenever M 2 is ON , the propagation delay is higher than it would be for a non-erroneous M2. Thus, the maximum achievable operational frequency of this gate is lower than it would be for a correctly manufactured gate. To compansate for the fabrication errors, the overall operational frequency of the device should be reduced accordingly.
(iii) Draw a transistor-level implementation of a CMOS 2-input NOR gate.

## Solution:


(iv) Find the required $\left(\frac{W}{L}\right)$ ratio for all transistors of the CMOS 2-input NOR gate, if the fall time of NOR gate, $t_{f}^{N O R}$, when NOR input is $\mathrm{AB}=11$, is equal to the fall time of NAND gate, $t_{f}^{N A N D}$, and the rise time of NOR gate, $t_{r}^{N O R}$, is equal to the rise time of NAND gate, $t_{r}^{N A N D}$, when NAND input is $\mathrm{AB}=00$.
Solution:
In part c (i), we state that $t_{r}^{N A N D} \sim \tau_{r}=R_{r} C_{L}$ and $t_{f}^{N A N D} \sim \tau_{f}=R_{f} C_{L}$.
For $\mathrm{AB}=11$, NOR pull-down happens with two parallel NMOS transistors, so that $t_{f}^{N O R} \sim \frac{1}{2 k_{N}}$. Equating $t_{f}^{N O R}$ with $t_{f}^{N A N D}, \frac{1}{2 k_{N}}=\frac{2}{5 k_{N}^{\prime}} \Longrightarrow k_{N}=1.25 k_{N}^{\prime}$, which results in (W/L) $=1.25$ for NMOS transistors.
For $\mathrm{AB}=00$, NAND pull-up happens with two parallel PMOS transistors, and using our result in part c (i), we have $t_{r}^{N A N D} \sim \frac{1}{5 k_{P}^{\prime}}$. Since NOR pull-up happens through two series transistors $t_{r}^{N O R} \sim \frac{2}{k_{P}}$. Equating these two, $\frac{1}{5 k_{P}^{\prime}}=\frac{2}{k_{P}} \Longrightarrow k_{P}=10 k_{P}^{\prime}$, making $(W / L)=10$ for PMOS transistors.
(v) Both NAND and NOR gates are universal, i.e., any logic function can be expressed solely by NAND or NOR gates. Which one, i.e., NAND or NOR, is a better choice in terms of area efficiency? Give your reasoning.
Solution:
NAND gates are smaller in size due to the higher mobility of electrons, i.e., $k_{N}^{\prime}>k_{P}^{\prime}$, which is assumed to be $k_{N}^{\prime}=2 k_{P}^{\prime}$ in this question. Hence, using NMOS transistors in series is better in terms of area efficiency.

## Version OBA/2

Assessor's Comments: The question was about CMOS fundamentals and how they affect certain performance metrics such as power, cost, frequency of digital circuits. The question was also popular as 55 candidates out of 75 attempted. The average was 59.6 out of 100 with a standard deviation of 18 . Part (a) asked the main disadvantages of the use of resistive load compared to transistor load in MOS logic implementations. While overall majority of the candidates answered correctly, surprisingly a large number of candidates did not consider the surface area inefficiency of the resistive loads. Part (b)(i) was about finding suitable transistor technology for a given operating frequency and power constraints using Fig. 4. Most students successfully identified potential solutions, while few recommended to use ECL and BiCMoS despite more power-efficient alternatives are available. Majority of the students attempted the part (b)(ii) were very successful in comparing NMOS and CMOS from the perspectives of power, manufacturing cost and area efficiency. Part (c)(i),(ii),(iv) were about the relation between device dimensions, transistor currents, and hence, switching performance under different settings. Some of the candidates performed very well in these questions. Part (c)(iii) was about a transistor-level implementation of a CMOS 2-input NOR gate. The students were able to provide this implementation without major difficulty. Part (c)(v) focused on the comparison of NAND and NOR gates in terms of the area efficiency. While some students directly concluded that they would perform the same as they have the same number of transistors, however, most students acknowledged the differences in the circuit structures of these gates yielding NAND as more favorable in terms of the area efficiency.

## Version OBA/2

4 (a) What are the advantages and disadvantages of Schottky barrier diodes for logic gates compared to PN diodes?

## Solution:

SBD usually has a faster switching rate and better noise margins, but it performs very bad for large scale circuitry because it has a leakage current three orders larger than PN.
(b) How is the switching frequency of a bipolar transistor impacted by charge storage? How can this be mitigated?

## Solution:

Fast switching in bipolar transistors is prevented by the removal of minority carrier charges in the base region during saturation. There are three options to this effect: (i) Transistortransistor logic which uses high conductance paths to drive the base, (ii) Emitter-Coupled logic which avoids saturation by controlling base emitter voltage, (iii) Schottky-Transistor logic which uses Schottky transistors with modified base-collector structure, preventing saturation.
(c) Consider the circuit shown in Fig. 6. Two inputs, A and B are provided for an output $V_{\text {out }}$.
(i) Which technology is used for this circuit?

Solution:
BiCMoS
(ii) Which logic function is implemented by this circuit?

Solution:
NAND operation. The active load transistors, given in part (d) give clues on which transistors are responsible for the logical operations. When we remove the amplifier stage as well, which is given in part (e), the circuit is very similar to the NAND gate of CMOS.
(iii) Explain the operation of this circuit for $\mathrm{AB}=[00,10,11]$. Comment on which transistors are ON and which are OFF.
Solution:
For $\mathrm{AB}=00$, M 1 and M 2 are ON . Q 1 base voltage is $V_{C C}-V_{T P}$, so M 7 is ON . M7 discharges Q2 Base. Q1 base is charged by both M1 and M2. Q1 is in forward active.
For $\mathrm{AB}=10$, it is the same as $\mathrm{AB}=00$ except that M 2 is OFF . Q 1 is only charged by M1.
For $\mathrm{AB}=11$, M1 and M 2 are OFF. M3 and M 4 are ON . Q1 is discharged through M3-M4 path. M5 and M6 are ON. Q2 base is charged through M5-M6 path. Q2 is

## Version OBA/2



Fig. 6
in forward active.
(iv) The active loads M3, M4 and M7 are all NMOS. Briefly explain why active loads are NMOS rather than PMOS.

## Version OBA/2

## Solution:

NMOS devices are area efficient due to the higher mobility of electrons, therefore they are almost exclusively used as active loads. While this is enough, students can also mention that PMOS loads require negative potential, complicating the design further.
(v) Briefly explain why the amplifier transistors, Q1 and Q2, are both NPN, rather than PNP.

## Solution:

Similar to the case of NMOS, NPNs are also area efficient. Since the only purpose of Q1 and Q2 is to amplify the output current, there is no reason to use a PNP which would require a higher surface area for the same amplification.

Assessor's Comments: This was the most popular question, with 61 out 75 candidates attempted. It has the average of 69.4 and the standard deviation of 21.7. The question measures the candidates' understanding of the transistor logic, their shortcomings and mitigation approaches. Part (a) and (b) were about the fundamentals, pros/cons of Schottky barrier diodes and bipolar devices, respectively. Students have successfully answered these. Part (c) focused on a given BiCMOS NAND gate and its detailed operation with certain inputs, which were mostly correctly identified by a large number of candidates as well.

## END OF PAPER

Version OBA/2

THIS PAGE IS BLANK

Page 16 of 16

