Make BASIC Fast Again, part 4

Welcome to part 4 of a multi-part series on simple things you can do to speed up Microsoft BASIC on the Radio Shack Color Computer. These tips may be applicable to other systems with Microsoft (or other) BASICs.

See also: Introduction, part 1 , part 2 and part 3.

Part 4 – GOTO and GOSUB

Unless you are writing a program that starts at the top and runs to the end, you will likely need to make use of the GOTO command. Nothing defines a BASIC program more than GOTO. It was one of the first commands I learned so I could do something like this…

10 PRINT "HELLO WORLD!"
20 GOTO 10

As we learn more BASIC commands, we learn about GOSUB. GOSUB will go to a line number, run some code, then the RETURN command will return to directly after the GOSUB. It is very useful for reusing code.

10 PRINT "ALLEN";
20 GOSUB 100
30 PRINT "ICE CREAM";
40 GOSUB 100
50 PRINT "MAKING BASIC FASTER";
60 GOSUB 100
70 END
100 PRINT " IS GREAT!"
110 RETURN

In the above silly example, we get to re-use the “function” at line 100, calling it three times. When line 20 does a GOSUB 100, execution skips down to line 100 and runs to the RETURN statement, then it pops back up to directly after the GOSUB 100 command and continues from there.

With that in mind, let’s get two quizzes out of the way…

GOTO: Which is faster?

Version 1

10 GOTO 200
...lots of code, numbered by 10s...
200 REM SOME STUFF
210 END

Version 2

10 END
...lots of code, numbered by 10s...
200 GOTO 10

Yeah, it’s confusing. Pretend that in version 1, you started with something like “RUN 10”, and with version 2, you started with something like “RUN 200”.

The question here is … is it faster to GOTO forwards or backwards?

For word, or for worse

When you use GOTO, BASIC first has to answer one question:

Is the line number you are trying to reach AFTER where you currently are, or BEFORE?

If the line you are trying to GOTO is after your current line, BASIC will scan forward trying to find that line. This is fairly fast, since BASIC doesn’t have to parse each line as it scans. At the start of each line in memory is the line number and how long the line is. BASIC simply looks at that number, and if it’s not the line you are looking for (move along), it jumps past the length of that line and looks at the next line. If BASIC finds a line number higher than the one it is trying to find, it knows it must not exist and you get the fun “unlisted line number” error:

?UL ERROR

If the line number you are trying to GOTO is before your current line, BASIC will jump to the first line of the program and start scanning from there. Again, BASIC skips any line number that doesn’t match and as soon as it finds a number higher than what it is looking for, you get the “?UL ERROR” message.

With that said, you really can’t answer the quiz without knowing more about the program. Consider this:

0 GOTO 11
1 REM THIS IS SOME LINE
2 REM THIS IS SOME LINE
3 REM THIS IS SOME LINE
4 REM THIS IS SOME LINE
5 REM THIS IS SOME LINE
6 REM THIS IS SOME LINE
7 REM THIS IS SOME LINE
8 REM THIS IS SOME LINE
9 REM THIS IS SOME LINE
10 REM THIS IS SOME LINE
11 PRINT "THIS IS SOME LINE!"

If you run that, in line 0, BASIC has to start scanning forward looking for line 11. Since all BASIC is doing is looking at line numbers, then skipping the length of the line if it’s not a match, the length of the lines probably won’t make too much of a difference. (You may want to test this and prove me wrong.) In this example, BASIC has to scan through 10 line number entries to find line 11.

If it had looked like this:

1 REM THIS IS SOME LINE
2 REM THIS IS SOME LINE
3 REM THIS IS SOME LINE
4 REM THIS IS SOME LINE
5 REM THIS IS SOME LINE
6 GOTO 12
7 REM THIS IS SOME LINE
8 REM THIS IS SOME LINE
9 REM THIS IS SOME LINE
10 REM THIS IS SOME LINE
11 REM THIS IS SOME LINE
12 PRINT "THIS IS SOME LINE!"

…the BASIC would only have to scan through 5 line number entries to find line 12. That GOTO should be twice as fast as the previous one.

When the line number is earlier in the program:

0 PRINT "THIS IS SOME LINE!":END
1 REM THIS IS SOME LINE
2 REM THIS IS SOME LINE
3 REM THIS IS SOME LINE
4 REM THIS IS SOME LINE
5 REM THIS IS SOME LINE
6 REM THIS IS SOME LINE
7 REM THIS IS SOME LINE
8 REM THIS IS SOME LINE
9 REM THIS IS SOME LINE
10 REM THIS IS SOME LINE
11 GOTO 5

…BASIC rewinds to the first line of the program and starts there. In this example, if you started it with RUN 11, it would find line 0 immediately. But, if the line number was further down…

1 REM THIS IS SOME LINE
2 REM THIS IS SOME LINE
3 REM THIS IS SOME LINE
4 REM THIS IS SOME LINE
5 REM THIS IS SOME LINE
6 REM THIS IS SOME LINE
7 REM THIS IS SOME LINE
8 REM THIS IS SOME LINE
9 REM THIS IS SOME LINE
10 REM THIS IS SOME LINE
11 PRINT "THIS IS SOME LINE!"
12 GOTO 11

…BASIC rewinds to the first line, then scans forward through every line trying to find line 11. Thus, doing a GOTO from the first line of a program to the last line should be about as slow as doing a GOTO from the last line of the program to the line directly before it. It is for this reason that many programs put the main code at the top, and stuff subroutines way at the bottom. This makes the subroutines slower, but makes the core code faster as it is either jumping back to the front or moving forward, never touching all the high line numbers used for subroutines.

But that’s not good if you want your subroutine to be fast.

GO SUB YOURSELF!

Another option is GOSUB, which will jump to routine and then return back to where it was called from. Just like GOTO, a GOSUB has to scan to find the target line number, and the same rules apply (scanning forward or backwards).

But, unlike GOTO, GOSUB will remember exactly where it came from, so when RETURN is executed, it can quickly pop back there without any line number scanning. This is can be a huge time savings!

For example:

0 GOSUB 11:END
1 REM THIS IS SOME LINE
2 REM THIS IS SOME LINE
3 REM THIS IS SOME LINE
4 REM THIS IS SOME LINE
5 REM THIS IS SOME LINE
6 REM THIS IS SOME LINE
7 REM THIS IS SOME LINE
8 REM THIS IS SOME LINE
9 REM THIS IS SOME LINE
10 REM THIS IS SOME LINE
11 PRINT "THIS IS A SUBROUTINE!"
12 RETURN

At the GOSUB in line 0, the program would have to scan forward through 10 lines to find the subroutine in line 11. When the RETURN is encountered in line 12, the program immediately jumps back to directly after the GOSUB 11 without any line scanning at all! This would be significantly faster than…

0 GOTO 0
1 END
2 REM THIS IS SOME LINE
3 REM THIS IS SOME LINE
4 REM THIS IS SOME LINE
5 REM THIS IS SOME LINE
6 REM THIS IS SOME LINE
7 REM THIS IS SOME LINE
8 REM THIS IS SOME LINE
9 REM THIS IS SOME LINE
10 REM THIS IS SOME LINE
11 PRINT "THIS IS A SUBROUTINE!"
12 GOTO 1

But how do we benchmark that?  One way might be to call it as a subroutine. This would add the overhead of the subroutine, but at least that would be consistent between tests. Let’s try a GOTO/GOTO version like this:

0 REM BENCH.BAS
5 DIM TE,TM,B,A,TT
10 FORA=0TO3:TIMER=0:TM=TIMER
20 FORB=0TO1000
30 GOSUB 100
70 NEXT
80 TE=TIMER-TM:PRINTA,TE
90 TT=TT+TE:NEXT:PRINTTT/A:END
100 GOTO 205
105 RETURN
110 REM THIS IS SOME LINE
120 REM THIS IS SOME LINE
130 REM THIS IS SOME LINE
140 REM THIS IS SOME LINE
150 REM THIS IS SOME LINE
160 REM THIS IS SOME LINE
170 REM THIS IS SOME LINE
180 REM THIS IS SOME LINE
190 REM THIS IS SOME LINE
200 REM THIS IS SOME LINE
205 REM THIS IS A SUBROUTINE
210 GOTO 105

This ugly code will sit in the benchmark loop and call GOSUB 100 each time. At 100, the test code will GOTO down to 205, then GOTO back to 105, where it returns to directly after the GOSUB 100.

This test shows 308.

Now let’s do the bottom test code using GOSUB/RETURN:

0 REM BENCH.BAS
5 DIM TE,TM,B,A,TT
10 FORA=0TO3:TIMER=0:TM=TIMER
20 FORB=0TO1000
30 GOSUB 100
70 NEXT
80 TE=TIMER-TM:PRINTA,TE
90 TT=TT+TE:NEXT:PRINTTT/A:END
100 GOSUB 205
105 RETURN
110 REM THIS IS SOME LINE
120 REM THIS IS SOME LINE
130 REM THIS IS SOME LINE
140 REM THIS IS SOME LINE
150 REM THIS IS SOME LINE
160 REM THIS IS SOME LINE
170 REM THIS IS SOME LINE
180 REM THIS IS SOME LINE
190 REM THIS IS SOME LINE
200 REM THIS IS SOME LINE
205 REM THIS IS A SUBROUTINE
210 RETURN

This version is almost identical, except we do a GOSUB 205, and then RETURN in line 210, versus a GOTO 205 and a GOTO 105 in the previous example.

Running this version gives us … 351. Well, that was a bit of a let down… I thought it would be faster!

In this case, it seems the first example that does “GOTO 105” knows it has to pop to the top of the program, then then only skips over 9 lines to get to where it wants to go. Meanwhile, the GOSUB version has the same amount of forward scanning, but has to do a bit more work to save off where it is (so it can return) and it looks like that overhead might take more than just scanning 9 lines.

As you have seen so far, many things in BASIC are somewhat predictable. There are specific places where you get a break-even at doing something a different way. Had this example been 1000 lines instead of 10, GOSUB would surely be faster.

Or would it? Am I telling the truth? Does anyone want to test this? :) I’d love to see what you come up with.  (Hint: You would want to make the returning GOTO do more work, which would give GOSUB/RETURN an advantage.)

Even though this example was a fail, it does reinforce how useful benchmarking is at understanding what the code is doing. We could figure out how many lines of scanning does it take before the GOSUB and its overhead is faster.

This is similar to “profiling” in higher level languages, where the program is run and monitored, and at the end, you get a summary of where the program spent most of it’s time. Those become the places you would optimize.

But I digress.

Know your place

Because of this, any routines you want to be executed as fast as possible should be either at the start of the program, or as close as possible after the routine that is using them. This gets real tricky, but here is an example.

Suppose you want to build your own custom input as people are typing. I made one like this:

1000 REM IN : IM=INPUT MAX
1001 REM RET: IN$=STR, IN=SIZE
1002 REM USE: CH$
1010 IN$="":IN=0
1015 IF IN<IM THEN PRINT CHR$(128);
1020 CH$=INKEY$:IF CH$="" THEN 1020
1025 IF IN<IM THEN PRINT CHR$(8);
1030 CH=ASC(CH$)
1040 IF CH=8 THEN IF IN=0 THEN 1090 ELSE IN=IN-1:IN$=LEFT$(IN$,IN):PRINT CH$;:GOTO 1015
1050 IF CH=13 THEN RETURN
1060 IF CH>=32 AND CH<=127 AND IN<IM THEN IN$=IN$+CH$:IN=IN+1:PRINT CH$;:GOTO 1015
1090 SOUND 200,1:GOTO 1015

This is a BASIC input routine I came up with in another series on my website. It echos characters as they are typed, but stops at a max length (beeping to let you know you are at the end of the input), or when you press ENTER. It handles backspace, and only allows printable characters to be typed. It even prints a black square as the cursor.

But it’s not very fast.

You can use it like this:

10 PRINT"USERNAME :";:IM=20:GOSUB 1000
20 PRINT:PRINT"YOU TYPED: ";IN$
30 PRINT"LENGTH   :";IM
40 END

(In line 20, I print an extra line, because the input routine does not echo ENTER. This allows it to be used in forms on the screen without erasing anything after or below it, similar to what something like PRINT@64,”PASSWORD:”; would do.)

If you try this, you will be able to type up to a 20 character username. But, if you type fast, you may see it loses characters. If you are a slow typist, you probably won’t notice ;-)

However, if this was at the bottom of a huge program with hundreds of lines, those GOTO 1015s would really slow things down. They would jump back to the start of the BASIC program and scan forward through every line number trying to find 1015 at the bottom! And, the INKEY$ loop that loops back to 1020 may look simple, but every time it is executed BASIC starts at the top of the program and scans down until it finds 1020. If the program is large enough, things would slow down enough to bother even a hunt-and-peck typist.

(I did test this one. I created bogus REM lines from 100 to 999, and then typing became unbearable. Even though the routine at the end was the same!)

No cheating

A simple solution to this is just to put the subroutines that have to be fast at the start of the program. If typing is the most important user experience, you might want it to be first. Then you could start your program with a simple GOTO to the real start line number, so typing “RUN” would still work. This one line would be a quick scan to get to the subroutines:

0 GOTO 100
1 REM IN : IM=INPUT MAX
2 REM RET: IN$=STR, IN=SIZE
3 REM USE: CH$
10 IN$="":IN=0
15 IF IN<IM THEN PRINT CHR$(128);
20 CH$=INKEY$:IF CH$="" THEN 20
25 IF IN<IM THEN PRINT CHR$(8);
30 CH=ASC(CH$)
40 IF CH=8 THEN IF IN=0 THEN 1090 ELSE IN=IN-1:IN$=LEFT$(IN$,IN):PRINT CH$;:GOTO 15
50 IF CH=13 THEN RETURN
60 IF CH>=32 AND CH<=127 AND IN<IM THEN IN$=IN$+CH$:IN=IN+1:PRINT CH$;:GOTO 15
90 SOUND 200,1:GOTO 15
100 REM BEGIN REAL PROGRAM

By doing this, now the GOTOs in the routine only have to scan through a few lines of the routine. If I try this, along with the hundreds of REM lines I created earlier, typing remains fast.

However…

…unless you really want to cheat, that is

It was pointed out to me by a commenter named George Phillips (and James Gerrie and Johann Klasek) that you could do a rather sneaky trick and get the benefits of a RETURN without needing a GOSUB. He suggested something like this:

10 FORA=1TO2STEP0
...do stuff here...
20 NEXT

The NEXT returns to just after the FOR statement… But since STEP 0 is there, it never increments, and never escapes! The “NEXT” is now like a RETURN, without a GOSUB. To get out, just set I=2 (the value of the end number of the FOR), and the loop is complete.

Sneaky.

And, more recently, VitNO contributor James Jones (who is writing a series on the speedy BASIC09), proposed more sneakiness:

10 FOR I=0 TO 0
...do stuff here...
XX I=(expression that is TRUE if you want another iteration)
20 NEXT

In his case, the default is to drop out, unless I is otherwise set to TRUE (-1 in BASIC). This means you could simulate something like C’s do/while:

do
{
  /* do some code */
  answer = GetAnswer();
} while (answer != 42);

In BASIC, that might be:

10 FOR I=0 TO 0
20 INPUT AN
30 I=(AN<>42)
40 NEXT

*blink*

If you check the return value of a comparison, you either get 0 (false) or -1 (true):

PRINT 5=6 'False
0

PRINT 5=5 'True
-1

With this knowledge, you can set a variable to a comparison and get the true/false result of the comparison.

Wow, do/while in BASIC! That’s just crazy.

Unfortunately, the overhead of doing comparisons like that “I=(AN<>42)” are quite slow, but you could certainly speed things up using variables and just set I=TR or I=FL to escape (true and false, get it?).

Leftovers

One other downside of this trick is that there is a bit of overhead every time you start a new FOR loop with a new variable. That FOR “status” is preserved until the loop is complete. Thus, if you did this trick and decided to GOTO outside of the loop, you would still have the memory associated with the FOR reserved. This is why you should not escape from a loop like this:

10 FOR I=1 TO 3
20 INPUT "ENTER PIN CODE";AN
30 IF AN=1234 THEN 100
40 NEXT
50 PRINT "YOU ONLY GET THREE TRIES!"
60 GOTO 60 'LOCK UP!
100 PRINT "YOU GOT IT!"
...and then the rest of the program...

If you did something like that, and the user types the correct PIN and escapes the FOR/NEXT loop with a GOTO, but the status of that FOR would remain until a NEXT completes it.

FOR/NEXT consumes memory until the loop is satisfied.

Above, we start out with 22823 bytes of free memory for BASIC.

When we allocate memory for a variable using “DIM I”, memory decreases to 22816 bytes. Each variable takes up 7 bytes of overhead.

When we create a FOR loop, memory decreases to 22798. This tells us FOR uses 18 bytes of memory. It needs to know the current value and the end value, and some kind of pointer in the code where NEXT should return to.

After the FOR/NEXT loop is complete, that 18 bytes is released, and memory returns to 22816. Only the variable I is still allocated.

If we go a bit further…

FOR/NEXT memory remains in use until the loop is complete.

Above, we start start with our 22816 memory (only variable I is allocated).

We create a FOR/NEXT loop, and check the memory, which is now down to 22798.

From this point on, until we NEXT to complete that loop, that memory remains allocated by BASIC. We can reuse the I variable, but the FOR memory remains.

If you ever want to escape from a FOR/NEXT loop, you can do this by setting the variable to the end value, doing a NEXT, and then the GOTO:

0 M=MEM
10 FOR I=1 TO 3
20 INPUT "ENTER PIN CODE";AN
30 IF AN=1234 THEN 100
40 NEXT
50 PRINT "YOU ONLY GET THREE TRIES!"
60 GOTO 60 'LOCK UP!
100 PRINT "YOU GOT IT!"
110 PRINT M,MEM

If you run this, and you enter the correct PIN (1234, of course), you will see it print how much memory was free at the start of the program, and at the end. On my system I see 22668 and 22635 (33 bytes less). The 33 bytes is the three variables (M, I and AN, each taking 5 bytes) plus the overhead of the FOR loop (18 bytes).

If I make a change so it forces the FOR loop to complete before I exit the program:

0 M=MEM
10 FOR I=1 TO 3
20 INPUT "ENTER PIN CODE";AN
30 IF AN=1234 THEN I=3:NEXT:GOTO 100
40 NEXT
50 PRINT "YOU ONLY GET THREE TRIES!"
60 GOTO 60 'LOCK UP!
100 PRINT "YOU GOT IT!"
110 PRINT M,MEM

… I see 22659 and 22644 (15 bytes less). The reason the numbers are different is because I’ve added more code (shown in red). It is the difference we care about. 15 bytes is what the three variables being used would consume (M, I and AN). The FOR has been released.

And yes, the program grew by 9 bytes to add this extra code, but that still saves more than letting the FOR loop linger.

I guess this one was more about saving memory than making things faster.

Forget I mentioned it.

Next time, maybe we can show some “real world use” of items discussed here, like an example of loading DATA statements, doing INPUT and PRINTing items to the screen.

Until then…

In 1982, I received my first computer: a $299.99 Commodore VIC-20. A year later, I moved on to a 64K Radio Shack Color Computer ("CoCo"). In 1990, I co-founded Sub-Etha Software "in Support of the CoCo and OS-9". This later led me to a job at Microware, creator of OS-9. I am author of the CoCoFest Chronicles, a compilation of my fest reports covering the 1990s era. I also host the CoCopedia.com wiki. These days, I am enjoying excavating my original VIC-20 tapes and thousands of CoCo floppy disks...

2 thoughts on “Make BASIC Fast Again, part 4

  1. Very interesting too
    But, it would be better than the basic, in the first run (changing the basic of Microsoft clear), scan the program (or is is written and saved with the source) and in each goto or gosub will keep the exact adress of the lines of destiny ?, is a small change in the basic that would make all the difference, besides the people would not have to worry about count place the things or where they are going.
    I think that sincerely the basic is very bad, I suppose that due to the hurry in its construction, we already know that Microsoft is not efficient.
    And if you ask me what happens if you write a line or character between the goto and its destination line, then, as I said, just execute it (or precompile) and the gout and others will be reappointed.
    That is the basic we need.
    Also you can have labels. and have or not line numbers
    Additionally you can have goto x, where x = 10, x = 20 or so. so if you have to look in the traditional way.

    1. I like the idea of storing the offset for the GOTO, just like Assembly does. But every time you edited a line, it would need to rescan and update all the pointers, which would be slow at creation. It would be nice to have that option.

Leave a Reply

Your email address will not be published. Required fields are marked *