Fixed-Point Representation

The fixed point representation is actually the fractions extension for 1s complement and 2s complement for negative numbers. In that, we simply specify how many bits we intend to use for both the whole number and fraction part of the numbers.

The name fixed-point is because the position of the dot (i.e., decimal point) is fixed within the number system.

Representation

Fixed-Point

Note that this dot is a purely imaginary construct. In the actual bit representation, none of the bits correspond to the dot. For instance, if we have 8-bit numbers, we can allocate 6-bits for whole number part and 2-bits for fraction part. In this case, the assumed binary point¹ as as seen on the image on the right. The yellow boxes are the whole number parts and the green boxes are the fraction parts.

Fixed-point representation works with any negative number representations, although the Excess-N representation is rarely used.

Examples

1s Complement2s Complement

Using 1s complement, we can represent the following numbers with 6-bits whole number and 2-bits fraction:

(26.75)₁₀ = (011010.11)_1s
(-1.25)₁₀ = (111110.10)_1s
- From (1.25)₁₀ = (000001.01)₂
- Invert all the bits: (111110.10)_1s

Using 2s complement, we can represent the following numbers with 6-bits whole number and 2-bits fraction:

(26.75)₁₀ = (011010.11)_2s
(-1.25)₁₀ = (111110.11)_2s
- From (1.25)₁₀ = (000001.01)₂
- Invert all the bits: (111110.10)_1s
- Add smallest one: (111110.11)_2s

Resolution

As the examples above show, there is a smallest resolution of number that we can represent. To put it simply, this resolution basically states that all number that can be represented are multiples of this resolution. The resolution depends purely on the number of bits used for the fraction part.

Consider the 6-bits whole number and 2-bits fraction fixed-point representation above, we can see that all numbers are multiples of 0.25.

26.75 = 107 × 0.25
-1.25 = -5 × 0.25

As you can see, the numbers are indeed multiples of 0.25. In fact, we can say something more about this resolution. Firstly, look at the multiples in binary.

1s Complement2s Complement

26.75 = 107 × 0.25
- (107)₁₀ = (01101011)_1s
- Add back binary point: (011010.11)_1s
-1.25 = -5 × 0.25
- (-5)₁₀ = (11111010)_1s
- Add back binary point: (111110.10)_1s

26.75 = 107 × 0.25
- (107)₁₀ = (01101011)_2s
- Add back binary point: (011010.11)_2s
-1.25 = -5 × 0.25
- (-5)₁₀ = (11111011)_2s
- Add back binary point: (111110.11)_2s

Notice how in both cases, the binary representation of the multiples correspond to the binary representation without the binary point.

Approximation

The resolution brings us to the reason why fixed-point representation is merely an approximation of real numbers. Consider numbers that are not multiples of the resolution such as (0.125)₁₀. The representation of this number in binary is (0.001)₂. We need 3-bits fraction to fully represent this number. As such, if we use 6-bits whole number and 2-bits fraction, we must either represent this as:

Round Up: (000000.01)₂ = (0.25)₁₀
Round Down: (000000.00)₂ = (0.0)₁₀

There are many ways to round a number and we will not discuss those. Most often, we will simply truncate the binary representation to the number of bits available.

Exercises

Decimal to Fixed-Point

QuestionAnswer

Convert -36.03125₁₀ to 16-bits fixed-point number represented as 1s complement with 10-bits integer and 6-bits fraction. The bit arrangement are shown below:

10-bits + 6-bits

Write your answer in binary. Truncate any excess bits (if any).

1111011011.111101_1s

Steps

Convert 36.03125₁₀ to binary:
- 100100.00001₂
Add bits until we have 10-bits integer and 6-bits fraction:
- 0000100100.000010₂
Convert to negative using 1s complement:
- Invert: 1111011011.111101_1s

Because decimal point is actually for base 10. The base 2 counterpart is called binary point. ↩