Part 7 Implementing the Owl-2820 instruction set

Over the last couple of posts we’ve added memory to the Owl-2820 VM and we’ve seen how we can implement system calls.

However, we haven’t implemented all of the instructions in the Owl-2820 instruction set, as we’ve limited ourselves to just enough instructions to be able to run programs that implement Fibonacci. The Owl-2820 VM needs to be able to do more than run Fibonacci, so in this post we’re going to implement the remaining instructions in the Owl-2820 instruction set.

Recap

We implemented these instructions in part 1.

Instruction	Description
`add r0, r1, r2`	add the values in registers `r1` and `r2` and stores the result into `r0`
`addi r0, r1, imm12`	adds the value in register `r1` to an immediate value and stores the result into `r0`
`beq r0, r1, offs12`	compares the values in registers `r0` and `r1` and branches to `offs12` if they’re equal
`bltu r0, r1, offs12`	compares the values in registers `r0` and `r1` and branches to `offs12` if `r0` is less than `r1`
`j offs20`	jumps to `offs20`
`li r0, imm12`	loads an immediate value into register `r0`
`lui r0, uimm20`	loads an immediate value into the upper bits of register `r0`
`mv r0, r1`	copies the value in `r1` into `r0`

We implemented call, ret and ecall in part 4.

Instruction	Description
`call offs20`	calls a subroutine at `offs20`, saving the return address in the return address register, ra
`ret`	returns from a subroutine by jumping to the address in the return address register, ra
`ecall`	invokes a system call

We added memory access instructions in part 5.

Instruction	Description
`lb r0, imm12(r1)`	Loads a byte from address `imm12(r1)` and sign-extends it into `r0`
`lbu r0, imm12(r1)`	Loads a byte from address `imm12(r1)` and zero-extends it into `r0`
`lh r0, imm12(r1)`	Loads a little-endian halfword from address `imm12(r1)` and sign-extends it into `r0`
`lhu r0, imm12(r1)`	Loads a little-endian halfword from address `imm12(r1)` and zero-extends it into `r0`
`lw r0, imm12(r1)`	Loads a little-endian word from address `imm12(r1)` into `r0`.
`sb r0, imm12(r1)`	Stores the lowest byte of `r0` into address `imm12(r1)`
`sh r0, imm12(r1)`	Stores the lower halfword of `r0` into address `imm12(r1)` in little-endian order
`sw r0, imm12(r1)`	Stores the word in `r0` into address `imm12(r1)` in little-endian order

The rest of the Owl

We learned in part 1 that the Owl-2820 CPU is based on RISC-V, specifically the RV32I base integer instruction set which has forty unique instructions. We’ve implemented nineteen - less than half that number. In fact, some of what we’ve implemented doesn’t actually have an RV32I equivalent. For example, you may be surprised to learn that j, li, mv, call and ret are not really RISC-V instructions, but are instructions that I’ve added to the Owl-2820 instruction set to enhance decoding performance.

Unlike previously, I am not going to go into the implementation of each and every instruction otherwise this post would become very long. In most cases, we’ve already seen how at least one instruction in each group is implemented, and the remainder of the instructions in the group tend to be fairly similar to each other.

For example, instructions in the Branch instructions group differ only in the branch condition, such as less-than or greater-than. Similarly, arithmetic instructions in the Register-register instructions group differ only in the operator, such as plus or minus.

The full Owl-2820 instruction set

Here’s an overview of the full Owl-2820 instruction set. It is essentially the RV32I instruction set, translated into Owl-2820 terminology, plus some additional Owl-only instructions.