#forth on 2020-02-28 — irc logs at freenode.irclog.whitequark.org

00:12 kori has joined #forth

00:12 kori has quit [Changing host]

00:12 kori has joined #forth

00:23 kori has quit [Read error: Connection reset by peer]

00:44 kori has joined #forth

01:08 tabemann has joined #forth

01:27 jsoft has joined #forth

01:29 <tabemann> hey guys

01:29 <Croran> Hi Tabemann

02:07 X-Scale` has joined #forth

02:07 X-Scale has quit [Ping timeout: 258 seconds]

02:07 X-Scale` is now known as X-Scale

02:12 tabemann has quit [Ping timeout: 240 seconds]

02:24 boru` has joined #forth

02:24 boru has quit [Disconnected by services]

02:24 boru` is now known as boru

02:30 proteus-guy has joined #forth

03:00 tabemann has joined #forth

03:13 tp has quit [Read error: Connection reset by peer]

03:14 tp has joined #forth

03:14 tp has quit [Changing host]

03:14 tp has joined #forth

03:22 rdrop-exit has joined #forth

03:22 <tabemann> wahoo - I fixed my problems with loops!

03:22 <tabemann> it turned out that the PC is offset by 4 in Thumb-2

03:22 <rdrop-exit> kudos!

03:22 <tabemann> so I was miscalculating my jump offsets

03:22 <tp> ahh

03:22 <tp> thats because 4 8 * = 32

03:22 <tp> ?

03:23 <tabemann> that's because the processor does instruction lookahead

03:23 <tabemann> so the PC is always the current instruction's address plus 4

03:23 <tp> I know I have to add 4 to the pc when reading 32 bit words

03:24 <tp> oops

03:24 <tp> I mean add 4 to the address!

03:24 <rdrop-exit> that's how PCs usually work, they point to the next instruction while the current instruction is executed

03:24 <tp> I have no idea what my PC is doing

03:26 <tabemann> rdrop-exit: for Thumb-2, though, instructions can be 16 or 32 bit

03:27 <tabemann> so with 16-bit instructions, it actually points to the instruction *after* the next instruction

03:28 <rdrop-exit> that sounds wonky, when is it adjusted back?

03:30 <rdrop-exit> (i.e. at what point in the instruction cycle?)

03:31 <tabemann> the reason why is that it fetches 16 bits, and then fetches another 16 bits, and decides whether the instruction is to be 16-bit or 32-bit

03:31 <tabemann> the reason why it does two fetches is that 32 bit instructions are only 16 bit aligned

03:32 <rdrop-exit> aha

03:34 <rdrop-exit> thanks, my Google-fu didn't bring up a quick diagram of the ARM instruction cycle

03:34 <rdrop-exit> actually, my duckduckgo-fu

03:35 <rdrop-exit> thumb seems a hack

03:35 <tabemann> thumb is annoying

03:35 <tp> probably is as it stated life as the acorn

03:36 <tabemann> the thing about thumb is it isn't consistent

03:36 <rdrop-exit> the way RISC-V handles this aspect is much better

03:37 <tabemann> like why in hell are instructions that set the status register typically 16 bit and instructions that do not are typically 32 bit

03:37 <tp> the good thing about Forth is it makes thumb skills unnecessary usually

03:37 <tabemann> I can't decide whether to implement inlining or fix string handling next

03:37 <tp> apparently ALL internal ARM instructions are 32 bit anyway

03:38 <rdrop-exit> what's fix string handling?

03:38 <tabemann> rdrop-exit: words like .( and ." are horrifically broken in zeptoforth ATM

03:38 <rdrop-exit> ah, string literals

03:38 <tp> gtg, back in a few hours, cya folks!

03:39 <tabemann> see ya tp

03:39 <rdrop-exit> ciao tp!

03:42 <rdrop-exit> IIRC "standard" Forth doesn't provide explicit inlining related words

03:43 <tabemann> the thing is that RAM to Flash calls in zeptoforth are really expensive

03:43 <rdrop-exit> Some Forths do implicit inlining in their optimizer, I prefer explicit inlining

03:43 <tabemann> so a significant gain can be obtained through inlining common builtin words

03:44 <tabemann> a single call from RAM to Flash is 10 bytes

03:44 <rdrop-exit> inlining can also be useful for factoring return stack related code

03:45 <tabemann> it'd also make it cheaper processing-time-wise, as you say, because then I don't need to push and pop return addresses

03:45 <rdrop-exit> might be cheaper for you to load code from flash into ram at startup

03:46 <tabemann> rdrop-exit: yes, Flash to Flash calls are far cheaper, only being 4 bytes

03:46 <rdrop-exit> explicit inlining is very simple to implement, I've never bothered with implicit inlining

03:47 <tabemann> I could probably implement implicit inlining pretty easily

03:47 <rdrop-exit> implicit requires too much special handling

03:48 <tabemann> when compiling a word, check to see if any calls are made in the word, or if the word is over a certain size

03:48 <tabemann> if neither are true, set a flag in the word

03:48 <tabemann> when the word is finalized

03:49 <tabemann> then, when compiling that word into another word

03:49 <tabemann> check for that flag

03:49 <tabemann> if it is set

03:49 <tabemann> strip the push {lr} and pop {pc} instructions from the start and end of the word

03:49 <tabemann> and insert it into the word being compiled

03:50 <rdrop-exit> you're assuming the word doesn't do anything special with the return stack

03:50 <tabemann> that's true

03:50 <rdrop-exit> if the word has early exits for example

03:50 <tabemann> I'm better of only doing that if an inline flag is set explicitly by the user

03:50 <rdrop-exit> implicit requires understanding what the word is up to

03:51 <rdrop-exit> explicit doesn't

03:51 <tabemann> I probably with explicit will still have a check for whether any calls are made

03:51 <rdrop-exit> it's just you explicitly saying this word is for inlining rather than calling

03:52 <rdrop-exit> much simpler

03:52 <rdrop-exit> I use ;inline

03:52 <rdrop-exit> e.g. : foo ... ;inline

03:53 <rdrop-exit> that's me explicitly indicating to the compiler that this word should be inlined at compile time

03:53 <rdrop-exit> the compiler doesn't require any smarts, I told it what I want

03:55 <rdrop-exit> COMPILE, takes care of the actual inlining

03:56 <rdrop-exit> it's up to me to make sure I know what I want

03:57 <rdrop-exit> and that it makes sense

03:58 <rdrop-exit> If the inline-able word is not compile-only then it should have a ret at the end that doesn't get inlined

03:58 <rdrop-exit> that way you can interpret it normally

04:02 <rdrop-exit> I find implicit inlining has too many gotchas which require too much smarts to be built into the optimizer/compiler, and then you need workarounds when the smarts get in the way of what you actually need

04:02 <rdrop-exit> But that's just my personal take on inlining in Forth

04:03 <tabemann> back

04:03 <rdrop-exit> wb :)

04:04 <tabemann> yeah, I'm just going to do explicit

04:05 <rdrop-exit> I posted a description of one approach to explicit inlining on reddit a while back

04:05 <rdrop-exit> just a sec

04:07 <rdrop-exit> https://www.reddit.com/r/Forth/comments/9u0h7u/idea_for_making_forth_compiler_more_pluggable/e91ne3l?utm_source=share&utm_medium=web2x

04:07 <rdrop-exit> see if that link works for you

04:10 gravicappa has joined #forth

04:21 <rdrop-exit> the most difficult part of x11 so far is figuring out what's unofficially deprecated

04:32 <tabemann> back

04:33 <tabemann> like server-side font rendering

04:33 <rdrop-exit> WB

04:33 <tabemann> apparently everyone just renders their fonts client-side these days, and then sends over the rendered fonts as images

04:34 <tabemann> you'd better familiarize yourself with the likes of freetype2

04:34 <tabemann> (which, if you're doing this all in forth, requires writing either an FFI layer, or a separate process that offloads font rendering)

04:35 <rdrop-exit> I'm planning on using my own raster font, won't need to deal with that, i'll just be pushing pixmaps to the server with the font already rendered

04:36 <rdrop-exit> X won't have anything to do with my fonts

04:37 <rdrop-exit> my window will be fixed-sized so no rescaling of fonts required

04:39 <rdrop-exit> X will only receive pixels to put on the screen

04:45 <rdrop-exit> I'm not plaaning on using any libraries, just the x11 wire protocol

04:46 <rdrop-exit> my needs are simple enough that I think I can get away with ignoring big chunks of the X ecosystem

04:48 <rdrop-exit> I do need to figure out which protocol extensions I need though, probably the Sync one, the newer keyboard one, and if performance is too slow I might have no choice but to look into the shared memory extension

04:50 <rdrop-exit> apparently the double buffering extension is deprecated, but there are simpler ways of accomplishing its purpose

04:52 <rdrop-exit> I'm spending most of my time figuring out what can or should be ignored, so much cruft

04:52 <tabemann> yess!! inlining works!

04:53 <rdrop-exit> bravo! :)

04:56 <rdrop-exit> late lunch, catch you later :)

04:56 rdrop-exit has quit [Quit: Lost terminal]

05:05 WickedShell has quit [Remote host closed the connection]

05:52 tp has quit [Read error: Connection reset by peer]

05:52 tp has joined #forth

05:52 tp has quit [Changing host]

05:55 nonlinear has joined #forth

07:19 dddddd has quit [Ping timeout: 255 seconds]

08:05 dys has joined #forth

08:38 <veltas> tabemann: That's how Z80's PC works as well for relative jumping

08:39 <veltas> The relative offset is referred to as 'e', but the stored value in machine code is e-2

08:40 <veltas> And the range of e is -126 to 129

08:47 rdrop-exit has joined #forth

08:50 <rdrop-exit> veltas, it seems ARM thumb works differently from what tabemann described

08:54 <rdrop-exit> something about the intermixing of 16-bit and 32-bit instructions

08:59 <veltas> Right

08:59 <veltas> I thought thumb was all 16-bit instructions

09:02 <rdrop-exit> I have just about zero ARM knowledge

09:07 <veltas> Me too but that is part of my approx. 0

09:12 <tp> hey guys

09:12 <rdrop-exit> I remember reading in the RISC-V docs that ARM's thumb is a separate ISA, while with RISC-V 16 bits is just an optional extension

09:12 <tp> arm uses 32 bits internally always

09:12 <rdrop-exit> hello Forth Master Technician (tm)!

09:12 <tp> hey rdrop-exit, Zen Forth Guru!

09:12 <tp> welocme back veltas !

09:13 <tp> but thumb goes thru a code converter so that while the user code is thumb, it all winds up as 32 bit arm before executing in the cpu

09:14 <tp> unlike risc-v you dont get a choice of 32 or 16 bit with cortex-m0 at least

09:15 <veltas> I mean the instruction sizes

09:15 <rdrop-exit> So if I understood correctly with ARM you're either in thumb mode or regular mode, while with RISC-V the compressed instructions are just extra instructions added to the base ISA

09:15 <veltas> ARM is like 32-bit PowerPC in that everything is done with 32-bit words, right?

09:15 <veltas> Going to work!

09:15 <tp> rdrop-exit, with cortex-m0 there is no 'regular' mode, it's thumb or nothing

09:16 <tp> even tho deep in the guts of the cpu, it's all 32 bit

09:16 <tp> i think m3 is the same

09:17 <rdrop-exit> some chips may be limited to one ISA or the other, but the point I think is that it's not a superset/subset relation

09:17 <rdrop-exit> here's a quote from some of the RISC-V lit:

09:17 <rdrop-exit> "

09:17 <rdrop-exit> RV32I instructions are indistinguishable in RV32IC. Thumb-2 is actually a separate ISA with 16-bit instructions plus most but not all of ARMv7. For example, Compare and Branch on Zero is in Thumb-2 but not ARMv7, and vice versa for Reverse Subtract with Carry."

09:18 <tp> Cortex-M use the 32/16 bit Thumb2 instruction set, except the M0/M0+ which use almost pure Thumb1 16 bit instructions with just a few system management 32 bit instructions. Choose Thumb2 or ARMv7-M for both. They don’t support original ARM instructions at all.

09:19 <rdrop-exit> the point I think is that there is no superset/subset relation between non-thumb and thumb ISAs

09:19 <tp> i think risc-v probably has a lot of modern advantages but it's still new

09:19 <tp> rdrop-exit, agreed

09:20 <tp> for instance Mecrisp-Stellaris on STM32F103 (cortex-m3) is faster than mecrisp-quintus on the gd32VF103 risc-v

09:21 <rdrop-exit> yes, RV is definitely still wet behind the ears

09:22 <tp> my GCD benchmark is the same for cortex-m3 at 75 mhz and risc-v at 104 MHz

09:22 <tp> which surprised me

09:22 <tp> by the same token, at 75 Mhz, cortex-m3 is 3.5x faster than cortex-m0 at the same speed with the same benchmark

09:23 <tp> I'm not one for benchmarks anyway as I have all the speed ill ever need with a m0 at 8Mhz

09:23 <rdrop-exit> I would assume your ARM optimizer is more mature than your RV one

09:23 <rdrop-exit> or is this a pure assembly benchmark?

09:23 <tp> thats what Im assuming also

09:24 <tp> no, theyre all Forth benchmarks

09:24 <tp> mecrisp-quintus is fairly new compared to the arm versions

09:24 <rdrop-exit> makes sense

09:24 <tp> the arm versions are years old in fact

09:25 <tp> the dodgy risc-v doc isnt helping either

09:26 <tp> in comparison the STM/ARM documentation is a masterpiece of comprehensive and easy to read tech info

09:26 <X-Scale> If you want to get a feel of the earlier ARM and Thumb, check this precious little doc from 1996

09:26 <X-Scale> http://www.home.marutan.net/arcemdocs/ARM-ARM-RevB.pdf

09:26 <rdrop-exit> I've only read the standards for RV, not the docs of a particular implementation.

09:27 <rdrop-exit> I found the standards to be fairly legible as far as such things go.

09:27 <tp> I'm now forming the opinion that the Chinese Gigadevice company who make the GD32VD103 have copied a lot of the ARM doc

09:27 <tp> and they have been quite slack about it

09:28 <rdrop-exit> For peripherals that's not surprising

09:29 <tp> for instance the GD32VF103 (risc-v) doc just copies stuff from the GD32F103 (cortex-m3) doc verbatim in some cases, yet they are utterly different ISA

09:29 <tp> thats what I assumed at first also

09:29 <tp> I mean the GD32F103 is STM32F103 'compatible'

09:29 <tp> both are M3 cores

09:30 <tp> the GD32VF103 is a risc-v core

09:30 <tp> they even have the same pinouts

09:30 <tp> thanks X-Scale !

09:31 <tp> I think all this copying will set GD back years because of all the confusion

09:31 <tp> customers will just tire of all the wrong info and wasted time

09:32 <rdrop-exit> It may be RV core, I imagine they're trying to ease adoption for current ARM users by making the rest as close to what you'd expect with existing ARM based products

09:32 <tp> i dont think thats the reason

09:32 <rdrop-exit> (by they, I mean Gigadevices)

09:33 <rdrop-exit> cheaper for them too, if they make them as close as they can

09:33 <tp> I think GD 'cloned' the STM32F103 peripherals so they could make immediate profits

09:33 <tp> i mean if you clone a chip, mark it as the chip that you cloned, you can sell it immediately into espablished markets

09:34 <tp> if you make your own version with your own part numbering the profits will take years longer to be realised

09:34 <tp> as the market adopts it or not etc

09:35 <tp> sadly I think GD had no Zen masters on the board because they didnt seem to realise that everything has two sides

09:36 <tp> quick profits have resulted in a market that has little trust of GD

09:36 <rdrop-exit> sure, that's the case with their ARM clone, but the RV is a different CPU

09:36 <tp> thats right ... except ....

09:36 <rdrop-exit> drum roll

09:36 <tp> again to realise quick profits, GD used the same 'compatible' peripherals they used in their STM32F103 fakes

09:37 <tp> I agree the GD32VF103 is a huge improvement, a chance for GD to go 'legit'

09:39 <tp> my new stm32f103 diagnostics binary found a GD32F103 in a Chinese 'blue pill' board recently

09:39 <rdrop-exit> neat

09:39 <tp> the chip has poor quality markings but they are of a STM32F103

09:39 <tp> so a blantant fake

09:40 <tp> someone is relabelling GD32F103's in China as STM32F103's

09:41 <tp> and putting them in 'blue pill' boards, but the Western buyers are now wise to this stuff

09:41 <rdrop-exit> hopefully the RV market will mature, so that we see more than just clones of ARM-based products with the CPU switched out

09:42 <tp> yeah, it's time for Chinese boards with their own Chinese MCUs, no fakes

09:42 <tp> their prices are good, the chips seem fine

09:42 <tp> the doc sucks, but that can improve

09:44 xek__ has joined #forth

09:44 <tp> I also found out that Russian interests have licensed some ARM cores

09:45 <tp> theyre making low volume variants legally, mainly rad hardened versions!

09:45 <tp> so silicon on saphire I guess

09:46 gravicappa has quit [Ping timeout: 265 seconds]

09:46 <rdrop-exit> gotta walk the dogs, catch you later :)

09:46 rdrop-exit has quit [Quit: Lost terminal]

09:46 <tp> cya, thanks for the chat

09:47 gravicappa has joined #forth

10:56 iyzsong has joined #forth

13:11 iyzsong has quit [Quit: ZNC 1.7.1 - https://znc.in]

13:25 dddddd has joined #forth

13:43 Kumool has quit [Ping timeout: 240 seconds]

14:08 Kumool has joined #forth

15:05 <tabemann> tp: I got inlining working

15:15 jsoft has quit [Ping timeout: 258 seconds]

15:32 tabemann has quit [Ping timeout: 240 seconds]

15:47 tpbsd has joined #forth

15:47 tpbsd has quit [Changing host]

15:47 tpbsd has joined #forth

15:47 tp has quit [Read error: Connection reset by peer]

16:02 tp___ has joined #forth

16:02 tp___ has quit [Changing host]

16:02 tp___ has joined #forth

16:02 tpbsd has quit [Read error: Connection reset by peer]

16:06 tp___ has quit [Read error: Connection reset by peer]

16:06 tpbsd has joined #forth

16:34 tp___ has joined #forth

16:34 tp___ has quit [Changing host]

16:34 tp___ has joined #forth

16:34 tpbsd has quit [Remote host closed the connection]

16:39 tp___ has quit [Ping timeout: 240 seconds]

16:39 tp has joined #forth

16:39 tp has quit [Changing host]

16:39 tp has joined #forth

16:41 jpsamaroo has quit [Ping timeout: 260 seconds]

16:41 tp has quit [Read error: Connection reset by peer]

16:41 tp has joined #forth

16:41 tp has quit [Changing host]

16:41 tp has joined #forth

16:45 tpbsd has joined #forth

16:45 tpbsd has quit [Changing host]

16:45 tpbsd has joined #forth

16:45 tp has quit [Read error: Connection reset by peer]

16:59 tpbsd has quit [Remote host closed the connection]

17:00 tpbsd has joined #forth

17:00 tpbsd has quit [Changing host]

17:00 tpbsd has joined #forth

18:03 dave0 has quit [Quit: dave's not here]

18:26 dys has quit [Ping timeout: 248 seconds]

19:39 WickedShell has joined #forth

19:44 gravicappa has quit [Ping timeout: 258 seconds]

20:32 nmz has joined #forth

20:32 [2]MrMobius has joined #forth

20:33 MrMobius has quit [Ping timeout: 258 seconds]

20:49 jpsamaroo has joined #forth

20:56 MrMobius has joined #forth

20:58 [1]MrMobius has joined #forth

20:58 [2]MrMobius has quit [Ping timeout: 258 seconds]

21:01 MrMobius has quit [Ping timeout: 255 seconds]

21:01 [1]MrMobius is now known as MrMobius

21:24 [1]MrMobius has joined #forth

21:26 MrMobius has quit [Ping timeout: 255 seconds]

21:26 [1]MrMobius is now known as MrMobius

21:47 [1]MrMobius has joined #forth

21:48 MrMobius has quit [Ping timeout: 240 seconds]

21:48 [1]MrMobius is now known as MrMobius

22:03 _whitelogger has joined #forth

22:03 xek_ has joined #forth

22:05 xek__ has quit [Ping timeout: 240 seconds]

22:46 xek__ has joined #forth

22:49 xek_ has quit [Ping timeout: 258 seconds]

23:11 <veltas> SRC DST LD or DST SRC LD ?

23:11 <veltas> Replace 'LD' with 'MOV' if you want

23:42 rdrop-exit has joined #forth

23:46 <rdrop-exit> veltas, parameter order is the same whether prefix or postfix is used

23:48 <veltas> Thanks

23:49 <rdrop-exit> np

23:50 <veltas> Writing Z80 assembly words at the moment

23:50 <veltas> as an exercise

23:50 <rdrop-exit> cool

23:57 <rdrop-exit> it's not uncommon for Forth assemblers use a comma at the end of an opcode's name, e.g.

23:58 <rdrop-exit> bx cx mov,