Title: Debug Diversion
Date: December 13, 2020
Tags: altair programming
========================================

As I was finishing up my assembler, I knew I was going to need a way to save
assembled programs off the Altair so I starting working on a new bootloader-type
program.  Writing, assembling, and testing revealed a new problem of trying to
debug programs that I don't have the hand assembled version of.

Without hand assembling, I don't easily know where instructions will land in
memory or what the value of the instructions will be.  Before, when I had a
problem, I could look at my assembled program and use the front panel to go to
where I though the problem was.  Usually I'd replace an instruction with HLT and
execute the program to that point.  Then remove the HLT and put the original
instruction back and single step from there.

But, now that I have an assembler doing all the work for me, it was suddenly
really easy to insert a CALL to a debug subroutine.  I can put the CALL where I
think the code goes awry, reassemble and execute it just like putting the HLT in
via the front panel.

I wrote the subroutine into the monitor I was working on so it would get
assembled with it and I could CALL it by a label.  The subroutine could also
reuse the existing serial IO routines.

The debug subroutine prints out the Stack Pointer and Program Counter from
before the CALL to debug, and all of the CPU registers.  Upon RET, the program
can continue where it left off.  I basically rediscovered breakpoints.

Using the front panel for debugging doesn't even give you access to see the
Stack Pointer (except during a stack operation) or the CPU registers.

With the source code in the text editor on my laptop (which is also my terminal)
and the symbol table output from the assembler showing the addresses of the
labels, I can easily keep track of where I am in the code and in memory and
what's expected to be in the registers while debugging a program.

I added a couple additional features for a good first pass of the debug
subroutine.  I print the flag register as a string instead of just the octal
byte so I don't have to remember which bit is the Zero flag or Carry flag, etc.
I also added the ability to examine the byte at any memory address.

I could add some extras like changing memory or even registers but I haven't
needed to change registers before and the assembler can change memory for me.
And I still have access to the front panel for anything I was doing before.


# Software Debuggers #

If you've used a software debugger, say in an IDE, you might already see what's
left to build a (more or less) fully featured debugger.  Instead of adding a
CALL and reassembling, I'd need to go back to my original process of replacing
instructions.  By saving the bytes of the assembled program and replacing them
with the CALL to the debug subroutine directly in memory I could dynamically
place breakpoints.  And, of course, remembering to replace the original bytes
and fixing up the program counter when continuing execution.

Setting dynamic breakpoints can be a bit tricky, though.  You don't want to have
to set it by address, because you don't know the addresses of the code anymore,
and you don't want to put the CALL in the middle of data or an immediate value
or over the address part of another CALL or a JMP.

The debugger would need to be aware of the source code so it can set breakpoints
on a line of executable code.  Couple the source with the symbol table (you've
heard of needing debug symbols when using a C debugger?) and you have a map of
addresses of where certain things are and could do some quick counting to get to
the specific instruction the breakpoint needs to go on.

Dynamically adding breakpoints opens up the possibility for single stepping,
stepping into a subroutine or stepping over a subroutine or other such features.
It can be done by dynamically adding or moving the breakpoint CALL.

When it has access to the symbol table, you can also view variables by name.  In
assembly, that would be an EQU, a SET or a labeled address used for data
storage.  An extra feature would be to be able to tell the debugger if the
variable is a byte, a word, or even a string so it can display the full multi-
byte value.

Of course, there is a lot of detail to get a reliable debugger that can
understand the source and the symbol table and the assembler has to be written
with that in mind.  I haven't done all of that work.  I have my laptop with the
source and I can save the symbol table output there as well.

I also make some assumptions here about the program being debugged like that
it's using serial IO, has set up the Stack Pointer and has enough stack room for
the debug subroutine to use, and is not using interrupts.  I don't know when I'd
not have serial IO or a stack but at least the interrupts would need to be
worked around.

What I've done for now is leave the debug subroutine in my new monitor located
at an easy to remember address so I can add a CALL to that address from any
program I am writing and reassemble it.


# Debug Subroutine #

Getting the Stack Pointer, Program Counter, and Registers was a fun project.  It
took me a couple rewrites to get it to do what I wanted and be reasonably
compact.  I'm sure smarter people than I could shave some more bytes off (which
goes for all my assembly, I'm not exactly brilliant at this, I'm just getting
by).

When executing a CALL, what happens is that the Program Counter gets pushed to
the stack and gets set to the address CALLed to where execution continues.  So
keep track of that.  If we want to print the Program Counter from before the
CALL, it just got pushed to the stack for us at the location the Stack Pointer
was at before the CALL.  That's 2 pieces of the information we want to output.

To preserve the CPU registers, we need to PUSH each pair to the stack before
doing anything else.  The value of the Stack Pointer continues to decrement by 2
for each PUSH.


DEBUG	PUSH	PSW	; save registers for resetting
	PUSH	BC
	PUSH	DE
	PUSH	HL


Now everything is safe and we just need to pull it back out to print it, but
also to save it so we can restore the registers back into the CPU, reset the
Stack Pointer, and then RET will bring us back to where the Program Counter was
pointing.

We need to move up the stack without moving the Stack Pointer because we use
CALLs while in the DEBUG subroutine which would overwrite whatever was on the
stack which we are trying to preserve.  We need our own Stack Pointer which we
can create by setting HL to 000000Q and then adding the current value of the
Stack Pointer to it.  This is the only way I know to get the value of the Stack
Pointer.  In C, and probably other languages, the stack is used to pass
variables and is called the stack frame so I'll call my copy of the Stack
Pointer the frame pointer.  I think a CALL or a RST (which is just a CALL to a
fixed address) and reading off the stack is the only way to get the Program
Counter.

So now we have a frame pointer and we know it was at Program Counter + 2 bytes
per Register down the stack.  So increment the frame pointer back up by that
amount.  The value is now what the Stack Pointer was before we CALLed debug.
Print the value of our frame pointer.


	LXI	HL,000Q		; get SP for frame pointer
	DAD	SP
	LXI	DE,012Q		; add 10 to get to top of stack
	DAD	DE
	XCHG			; save frame pointer in DE
;SP
	LXI	HL,SPSTR	; load string pointer
	CALL	PRNTSTR		; print string
	MOV	A,D		; high byte
	CALL	PRNTOCT		; print octal byte as ascii to terminal
	MVI	B,' '
	CALL	PRNTCHR
	MOV	A,E		; low byte
	CALL	PRNTOCT


Now we move back down the stack.  The next 2 bytes is the Program Counter that
the CALL to debug saved for us and we can print that.  Then each of the next 2
bytes are the register pairs.  Print as we go.


;PC
	LXI	HL,PCSTR	; load string pointer
	CALL	PRNTSTR		; print string
	LXI	BC,177775Q	; -3 
	DCX	DE
	LDAX	DE		; get high byte
	MOV	H,A
	DCX	DE
	LDAX	DE		; get low byte
	MOV	L,A
	DAD	BC		; subtract 3 because we inserted CALL DEBUG
	MOV	A,H
	CALL	PRNTOCT		; print an octal byte as ascii to terminal
	MVI	B,' '
	CALL	PRNTCHR
	MOV	A,L		; L is safe from PRNTOCT
	CALL	PRNTOCT
;PSW
	LXI	HL,AREGSTR
	CALL	PRNTREG
	LXI	HL,FREGSTR
	CALL	PRNTSTR
	DCX	DE
	LDAX	DE		; get F
	CALL	PRNTFLG		; special print the flags
;BC
	LXI	HL,BREGSTR
	CALL	PRNTREG
	LXI	HL,CREGSTR
	CALL	PRNTREG
;DE
	LXI	HL,DREGSTR
	CALL	PRNTREG
	LXI	HL,EREGSTR
	CALL	PRNTREG
;HL
	LXI	HL,HREGSTR
	CALL	PRNTREG
	LXI	HL,LREGSTR
	CALL	PRNTREG


Meanwhile. the real Stack Pointer remained at the bottom of the stack below all
the registers and we can CALL and RET all day long without overwriting them.

When we're done debugging, we can POP the registers back into place which will
use the real Stack Pointer and then RET back to where we left off in the program
and it has no idea that we were gone.


;continue
DASK	LXI	HL,CONTSTR
	CALL	PRNTSTR
DLOOP2	CALL	GETCHR
	MOV	A,B		; copy to A
	CPI	003Q		; ^C
	JZ	DCONT
	CPI	033Q		; esc
	JZ	RESET
	CPI	'R'
	JZ	DRMEM
	CPI	'r'
	JZ	DRMEM
	JMP	DLOOP2
DCONT	POP	HL		; restore registers
	POP	DE
	POP	BC
	POP	PSW
	RET			; go back


And here are the helper subroutines, most notably how I print the flag register.


;print reg
; HL = prefix string pointer
PRNTREG
	CALL	PRNTSTR
	DCX	DE
	LDAX	DE		; get reg value
	CALL	PRNTOCT
	RET

;read mem address
; TODO on error, drop back to DEBUG, not monitor
DRMEM	LXI	HL,ADDRSTR
	CALL	PRNTSTR
	CALL	READADDR
	MVI	B,':'
	CALL	PRNTCHR
	MVI	B,' '
	CALL	PRNTCHR
	MOV	A,M		; read byte
	CALL	PRNTOCT
	JMP	DASK

;print flags
; A: register
; SZ0Ac0P1C
; Sign Zero (0) Aux carry (0) Parity (1) Carry
PRNTFLG	LXI	HL,FLGSTR	; flag string
	MOV	C,A		; save flag reg
PFLOOP	MOV	A,M		; get char
	CPI	000Q		; check for \0
	RZ
	MOV	B,A		; save char
	MOV	A,C		; restore flag reg
	RLC			; flag into carry
	MOV	C,A		; save flag reg
	JC	PFPRNT
	MOV	A,B		; restore char
	ADI	040Q		; lower case
	MOV	B,A
PFPRNT	CALL	PRNTCHR
	INX	HL
	JMP	PFLOOP

FLGSTR	DB	"SZ"
	DB	020Q		; needed because the assembler doesn't let us
	DB	"A"		; input lowercase. :(
	DB	020Q		; TODO, use lowercase and SUI in subroutine so
	DB	"P1C\0"</pre>   ;   we can use all printable chars here


I left out the strings other than for printing the flags but you can see what
they are from the output when we CALL the debug subroutine.


SP: 370 002
PC: 003 073
A: 000 F: sZ0A0P1c
B: 060 C: 001
D: 000 E: 370
H: 374 L: 000
^C: CONT, R: READ MEM, ESC: QUIT TO MONITOR
ADDR? 200 000: 061
^C: CONT, R: READ MEM, ESC: QUIT TO MONITOR


There are some TODOs in the code and there are things to fix in general.  It
assumes a number of things and if you mistype when entering an address to view,
it uses the monitor's error subroutine (as it's borrowing the monitor's GETADDR
subroutine) and will drop you to the monitor resetting the stack and basically
blowing up your debugging.

I've already burned it into a PROM chip so, eh, I'll fix it eventually.