#forth on 2020-09-28 — irc logs at freenode.irclog.whitequark.org

00:32 mark4 has quit [Ping timeout: 264 seconds]

01:24 dave0 has quit [Quit: dave's not here]

01:50 boru` has joined #forth

01:50 boru has quit [Disconnected by services]

01:50 boru` is now known as boru

03:01 <MrMobius> cheater, it depends on what you're trying to accomplish. if you need good performance, forth on the 6502 is an especially bad match

03:02 <MrMobius> but if youre drawn to forth for other reaons like playing around for fun or being able to compile it on the machine rather than cross compiling it could be good

03:03 <MrMobius> cheater, I did this comparison of assembly, C, and forth for the 6502 and forth was 10-20x slower than assembly http://calc6502.com/RobotGame/summary.html

03:03 <MrMobius> ymmv if youre not going for raw performance though

03:05 <cheater> MrMobius: why is it so bad for 6502?

03:05 <cheater> and yeah i was thinking about performance

03:07 <MrMobius> for one, cells are 16 bits in size and the 6502 is an 8 bit processor so 8 bit computations are automatically more than twice as slow

03:07 <MrMobius> as opposed to a 16 bit processor where 8 and 16 bit calculations make take the same amount of time

03:08 <MrMobius> another is that to get good performance on the 6502 when looping through data, you need to use the X or Y registers to index into memory

03:08 <MrMobius> keeping a 16 bit pointer on the stack and incrementing it like you do in forth is many times slower than indexing

03:09 <MrMobius> also you take a big speed hit using stack addressing on the 6502 compared to using static zero page addresses

03:11 <MrMobius> these are all performance issues though. you may not care about any of that if you dont need max performance

03:13 <MrMobius> other reaons too. you can also use index registers as counters. storing loop counters on the return stack as forth does is several times slower

03:22 <MrMobius> cheater, thinking of doing forth on a 6502?

03:30 jsoft has joined #forth

03:53 gravicappa has joined #forth

05:12 <cheater> MrMobius: i'm thinking what a good higher level language would be for a 6502 based computer

05:12 <cheater> MrMobius: how do you think forth compares to basic?

05:12 <cheater> in terms of speed

05:14 <cheater> MrMobius: also, why do cells have to be 16 bit?

05:28 <siraben> MrMobius: oh, I recall your article on comparing assembly, C and Forth.

05:29 <siraben> the performance hit seems very high

05:51 xek has joined #forth

05:58 xek has quit [Ping timeout: 240 seconds]

06:09 <proteusguy> would be incredibly rare for a basic to outperform a decent implementation of forth. basic is usually implemented as a token interpreter. lots of runtime overhead to keep its space small.

07:25 tabemann has quit [Remote host closed the connection]

07:25 tabemann has joined #forth

08:00 mtsd has joined #forth

08:32 gravicappa has quit [Ping timeout: 246 seconds]

08:50 xek has joined #forth

09:43 mtsd has quit [Quit: mtsd]

09:56 jsoft has quit [Quit: Leaving]

10:05 mtsd has joined #forth

10:12 mtsd has quit [Quit: mtsd]

10:18 dave0 has joined #forth

10:46 xek_ has joined #forth

10:49 xek__ has joined #forth

10:50 xek has quit [Ping timeout: 272 seconds]

10:52 xek_ has quit [Ping timeout: 264 seconds]

11:00 <MrMobius> cheater, probably pretty well if you go with an STC like Tali Forth 2

11:01 <cheater> STC?

11:01 <MrMobius> subroutine threaded forth

11:02 <MrMobius> there are different ways to organize things internally. one way is to have a list of addresses of subroutines and jump to each one by one

11:03 <MrMobius> it's a lot faster though to have the same list with a jump to subroutine instruction before each address

11:03 <MrMobius> so the call takes up 50% more room but you avoid pointer calculations which are really slow on the 6502

11:04 <MrMobius> cheater, cells could also be 32 bit or even 64 bit. it just makes sense to keep them big enough to hold a pointer

11:07 <cheater> why not 8 bit?

11:08 <MrMobius> then you would have 8 bit pointers and only be able to address 256 bytes

11:09 <MrMobius> or have a scheme where pointers take up 2 cells

11:17 <cheater> i mean for short jumps that's fine enough, right?

11:17 <cheater> it's only long jumps where that's a problem, and you should have two or more cells

11:17 Zarutian_HTC has joined #forth

11:18 <cheater> 65c816 has 24 bit address space

11:18 <cheater> so you might want to use 3 bytes for the pointers

11:18 <cheater> i guess you'd have to push them all on the stack and then jump, or something

11:19 <cheater> for example, you could actually push the machine code instruction for jump onto the stack, then the 24 bits of the pointer, and then have a forth function called "interpret_jump"

11:19 <MrMobius> they will basically all be long jumps and evrn short jumps wouodnt work well since youre either jumping into the first 256 bytes where the stack is or doing a relative jump which will be slow

11:20 <cheater> yeah

11:21 <MrMobius> ya someone made a forth for 816 and says he got a huge speedup

11:21 <cheater> wonder how?

11:21 <cheater> oh yea the 816 has 16-bit registers

11:21 <cheater> nice

11:22 <cheater> i wonder if you could make a basic interpreter that runs on forth?

11:22 <MrMobius> and better addressing so some of that is less painful. havent uses it myself though. some people say the processor is unpleasant to work with

11:23 <cheater> wonder why they say that

11:23 <MrMobius> sure. you would hate your life and it would be ungodly slow if on the 6502 but you could do it

11:24 <MrMobius> several reasons. its not a 16 bit redesign. its an 8 bit 6502 with 16 bit stuff frankensteined on top

11:27 <cheater> i'm mostly thinking of putting it on the 816 to be honest

11:28 <cheater> also let's be honest with each other no one's going to write huge programs like this, for anything even medium complex they'd write some other language on some other machine and cross compile

11:29 <cheater> so maybe if you have an 8 bit pointer and can only address 256 bytes at first, maybe that's perfectly fine for a simple program

11:33 <MrMobius> like most 16 bit machines will have an 8 bit and a 16 bit version of an instruction like ADD but the 816 just has an 8 bit mode and a 16 bit mode for each register and only one ADD instruction

11:33 <cheater> oh

11:33 <cheater> interesting

11:33 <cheater> MrMobius, basically the 6502 was meant to treat the zero page as registers, that's why operating on that is cheap (or supposed to be, i don't actually know)

11:34 <MrMobius> so when you jump into a function or into an interrupt you may not know what modes your in which is a real pain

11:35 <MrMobius> cheater, the problem is though that using 8 bit pointers doesnt just mean your program has to fit in 256 bytes but also that the whole forth interpretter does if you want to be able to access the address of primitives or other stuff

11:36 <cheater> hmmm right

11:36 <MrMobius> and even then you cant fit much of a meaningful program in 256 bytes. the code for my robot game is 40k or so I think

11:36 <cheater> what if i made it a thing where /writing/ to the stack is more expensive but reading from it is cheaper?

11:37 <MrMobius> how would you do that?

11:37 <cheater> i.e. when you write to the stack, you basically write the machine code you'd want to have

11:37 <cheater> but that'll have to be taken from memory somewhere else, which is more expensive

11:37 <cheater> some sort of partly pre-compiled forth

11:38 <MrMobius> hmm, it doesnt work like what youre describing

11:38 <cheater> why not?

11:38 <MrMobius> what machine code are you writing when you write to the stack? you dont put instructions on the stack, just data

11:41 <cheater> i was thinking of one of two designs. design 1: have the stack grow backwards, i.e. opposite to the way machine code is interpreted. then run that. design 2: when committing data to the stack, pre-place holes that you will later populate with instructions once you're ready to. once you're at the point of executing that instruction together with its data, you take the ultimate result, place it where the instruction was, and discard the rest (i.e. the

11:41 <cheater> locations where the hole's data was). this means the stack structure is retained and the result of this instruction can be used in the next instruction, for which there will be a hole as well.

11:42 <MrMobius> why would you put machine code on the stack though? there's no reason to jump into the stack and start executing things

11:43 <MrMobius> if youre leaving a hole there to jump into, you can only put on instruction there

11:44 <MrMobius> even the simplest forth instruction takes several steps so needs a dozen or more bytes of instructions

11:45 <cheater> i mean yeah. i guess this is a thing that would need to be integrated into the design. i was just trying to present the idea in a simple fashion

11:46 <cheater> nothing's stopping you from having larger holes, or having multiple-hole systems

11:46 <cheater> like maybe commit to two holes: the instruction to jump to, followed by its data (not holes), then at the end you have a hole for another jump instruction

11:47 <MrMobius> doesnt sound like it would be any faster than what already exists, but what you could do is get a forth for your pc and learn it really well then see if you can implement your improve version on the 816 or some other platform

11:48 <cheater> tbh i'd probably write an 816 emulator and try to figure how to build something for that

11:48 <MrMobius> ya what your describing is the the thread of instructions not the data stack. you can inline data in the thread and use the return address to get the address of the data

11:48 <cheater> i mean 8502

11:48 <cheater> er 6502

11:48 <cheater> what's wrong with me today

11:48 <cheater> what is the thread of instructions? that's not a standard forth feature, is it?

11:49 <MrMobius> its the list of subroutine calls I was talking about before. every forth has some sort of thread

11:50 <MrMobius> go for it. there are a lot of 6502 emulators. it's a good project to learn about the chip. there's even a pretty well tested verification program you can run to test your emulator

11:56 <cheater> don't you think a 6502 forth would be faster without a subroutine thread?

11:56 <cheater> looks like refering to non-immediate data is the real perf killer here

11:57 mtsd has joined #forth

11:59 <cheater> that's why i was thinking of this hole system, because that makes the data always immediate to the instructions

12:01 <cheater> how many bytes of arguments does a 6502 machine instruction take? is this variable? what about the 816?

12:03 <MrMobius> cheater, faster? how else are you going to run anything?

12:04 <MrMobius> I see what you mean. there is a form of this you can use in some situations

12:05 <MrMobius> you write instructions to memory and modify their arguments at run time. this is called self modifying code and only works in specific circumstances. you couldn't do everything that way. there is overhead to filling the holes

12:06 <MrMobius> yes variable. 0-2 on 6502. 0-3 I think on 816 but not sure

12:15 gravicappa has joined #forth

12:21 dave0 has quit [Quit: dave's not here]

12:48 <MrMobius> hmm, looks like they added incremental compilation to the zig programming language

12:48 <cheater> yeah, i was thinking of smc, but instead of modifying the instruction arguments, you put up arguments and you modify the instruction that's being called

12:48 <MrMobius> I wonder if we'll see NASA running that instead of forth on satelites

12:49 <MrMobius> cheater, how would that be better though? it takes time to write the instruction there and the instruction only does one thing. it takes longer to put the instruction in the hole then to execute it

12:50 <MrMobius> it only makes sense if the overhead of changing the instruction is less than the improvement you get from using the immediate version which seems like it would only be in a loop

12:51 <cheater> it's a trade off, either the cost is large at write (my way) or at read (your way)

12:53 <cheater> so for example, you might be able to optimize the writes. if you know you'll be doing only one specific thing with the data once that data is on the stack, just include the instruction with the data, so it's put on the stack together; and the jump-back instruction, which is the other part that gets modified, that still gets "fixed up" by the runtime

12:54 <cheater> or let's say you have a bunch of data, and you want to fold or map over it, doing basically the same thing to each piece of the data

12:54 <cheater> you'd load the data into memory with holes, then load one instruction into a zero-page "register" for easy access, and write it to all the holes

12:56 <cheater> you might need to do that multiple times due to there being multiple things you need to write into the holes

12:56 <cheater> but in general you could transform data like this. it's very reminiscent to the map/fold paradigm you'd find in functional programming.

12:58 mtsd has quit [Quit: Leaving]

12:58 <MrMobius> cheater, that's just not how that works at all. what youre describing does not match the way forth works or more importantly how the processor works

13:00 <cheater> would you like to explain the issues to me? it's okay if you don't feel like it, i'm just curious

13:01 <MrMobius> lets say you have an 8 bit stack, which I think is a bad idea and you should not do

13:01 <MrMobius> you can put an instruction to add for example in the hole and that will add the value to the accumulator but what then?

13:01 <MrMobius> that value then needs to be written somewhere and the stack pointer needs to be adjusted

13:02 <MrMobius> it would be better to use a 16 bit stack but that is even worse since nothing you put in the hole can do a 16 bit add

13:03 <cheater> the hole can be multiple bytes long

13:03 <MrMobius> so you could leave an enormous hole and put a whole bunch of instructions in there but then we're back to the huge overhead of copying that in

13:03 <cheater> also if you just have one stack, then you can use the program counter

13:03 <cheater> which is 16 bit

13:04 <MrMobius> again, thats just not how that works. the program and stack are different

13:04 <cheater> why must they be different?

13:04 <MrMobius> also, if you have the hole multiple bytes long, you arent using the immediate mode any more so you lose the speed up you get from loading an immediate

13:05 <cheater> not necessarily. the hole can be actually multiple holes: one small hole for the instruction with arguments following it, and then another longer hole for the handler

13:06 <MrMobius> because as the program counter advances, you leave the results of calculations behind you in the space for data. you need some way to pull that forward and keep working on it. thats what the data stack does. its always in the same place no matter where your program counter is

13:07 <MrMobius> cheater, are you on windows?

13:07 <cheater> yeah

13:08 <MrMobius> download this 6502 simulator+assembler and give it a try http://exifpro.com/utils.html

13:08 <MrMobius> its a really cool program. i keep it running on my taskbar 24/7 to test asm snippets

13:08 <cheater> haha that's nice

13:09 <MrMobius> try to implement what youre describing there and youll see that what I mean

13:09 <MrMobius> you may not be able to see that this doesnt work until you implement it yourself

13:09 <MrMobius> also ##6502 is very active

13:11 <cheater> *hacker voice* i'm in

13:11 <cheater> what do i do now?

13:11 <cheater> i guess i'll look for a hello world

13:11 <cheater> http://www.rosettacode.org/wiki/6502_Assembly

13:11 <cheater> nice

13:13 <cheater> http://www.rosettacode.org/wiki/Even_or_odd#6502_Assembly hmm... this is Apple II code... is the syntax right for the exifpro assembler? and the apple ii specific stuff?

13:13 <MrMobius> hmm, no

13:13 <MrMobius> probably better to discuss in ##6502

13:15 <cheater> yep let me join

13:50 <cheater> cool tidbit about forth on c64 https://www.reddit.com/r/c64/comments/j0y1rv/is_forth_a_good_match_for_6502/g6whc4p/?context=3

14:06 cp- has quit [Quit: Disappeared in a puff of smoke]

14:07 cp- has joined #forth

14:09 cp- has quit [Client Quit]

14:09 cp- has joined #forth

15:28 mark4 has joined #forth

15:29 <mark4> tabemann, that would actually be the primary use for zero page :)

15:30 <mark4> i would consider using index x for a stack pointer somewhere other than 0x100 area, leave that for the processor stack but have two sofware stacks elsewhere in ram. 64k is freeking huge :)

15:30 <mark4> i would also put the forth kernel uder the basic rom which can be paged out to reveal the ram under it

15:31 Zarutian_HTC has quit [Ping timeout: 258 seconds]

15:39 <MrMobius> mark4, yep. probably want to put the x-based data stack in zero page

15:46 <mark4> sp yes, stack no :)

15:46 <mark4> i would have sp and rp on zero page variables pointing to 0x200 and 0x300

15:46 <mark4> tho for a c64 that would probabely be $200 and $300 :)

15:48 <MrMobius> mark4, hardware stack is fixed at $100 so no choice about that

15:48 <mark4> right but i would not use that for parameters

15:48 <mark4> actually i guess that could be the return stack?

15:49 <MrMobius> right, r stack

15:49 <mark4> i forget do you have access to SP on the c64?

15:49 <MrMobius> not directly. you can transfer SP to X, modify then transfer it back

15:49 <mark4> 0x00 to 0xff is zer0 page, page 1 is stack

15:49 <mark4> aha thats right. thats good enough :)

15:50 <mark4> been a very very long time since i looked at a c64 :)

15:50 <mark4> very many fond memories :)

15:50 <MrMobius> hehe

15:51 <mark4> i learned 6502 in 2 weeks and then bought a c64 :)

15:51 <mark4> i did a hand reverse engineer of a disassembler that was published in CCI :)

15:51 <mark4> took 2 weeks and then i knew almost eveyr opcode by heart :)

15:52 <MrMobius> nice :)

15:52 <MrMobius> I didnt get one until 10 or 15 years ago off ebay. it was fun to play around with but didnt work any more when I dug it out of the closet a few years ago

15:53 <mark4> awww

15:53 <mark4> wish i still had mine

15:53 <MrMobius> tried to repair it last year and some of the ttl chips have been replace and have been socketed before it got to me since they didnt do that at the factory. problems match that chip being bad

15:54 <mark4> thers a guy on youtube that repairs c64s :)

15:54 <MrMobius> they are neat. I had a lot of ideas for neat things but kind of lost interest when I found out the process they used for the chips was flawed and the chips' lives shorten the longer you use it

15:54 <MrMobius> so no sense in leaving it on to run an 8 hour simulation or something even if it would be neat or running a server on it

15:55 <MrMobius> yep! I would be totally lost trying to fix anything without that :P

15:58 <mark4> yea and you can do it in an emulator too :)

15:58 <mark4> bbl have an errand to run

16:01 Zarutian_HTC has joined #forth

16:23 cantstanya has quit [Ping timeout: 240 seconds]

16:26 xek__ has quit [Ping timeout: 264 seconds]

17:19 xek__ has joined #forth

17:51 WickedShell has joined #forth

18:15 Zarutian_HTC has quit [Remote host closed the connection]

18:28 gravicappa has quit [Ping timeout: 256 seconds]

18:29 gravicappa has joined #forth

19:37 <bluekelp> MrMobius: say more about the chip life being shortened with use?

19:40 <bluekelp> I have an old 64 in a box again after booting it up and confirming it worked a few years ago. much nostalgia.

19:43 <MrMobius> it was boron or something they used in the process and didnt have it exactly right so the chips MOS made themselves will die with use

19:49 <bluekelp> interesting. I'll research more.

20:02 <MrMobius> bluekelp, I think I might be partly wrong. I heard Bil Herd talk about it in a video but it might just be the PLAs

20:02 <MrMobius> and you can get PLA replacements

20:11 remexre has quit [Ping timeout: 240 seconds]

20:13 remexre has joined #forth

20:45 gravicappa has quit [Ping timeout: 256 seconds]

20:58 xek__ has quit [Ping timeout: 256 seconds]

21:08 _whitelogger has joined #forth

21:12 <f-a> a friend of mine is trying to get me into soldering

21:12 <f-a> maybe I should dust off forth too

21:26 WQX has joined #forth

21:54 f-a has quit [Quit: leaving]

22:16 WQX has left #forth [#forth]

23:49 dave0 has joined #forth