#nmigen on 2020-05-23 — irc logs at freenode.irclog.whitequark.org

2020-01-27 18:31 ChanServ changed the topic of #nmigen to: nMigen hardware description language · code at https://github.com/nmigen · logs at https://freenode.irclog.whitequark.org/nmigen

03:07 Degi has quit [Ping timeout: 272 seconds]

03:10 Degi has joined #nmigen

06:36 <_whitenotifier-c> [nmigen/nmigen-yosys] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/JfaLa

06:36 <_whitenotifier-c> [nmigen/nmigen-yosys] whitequark 03c80dc - Add ccache support for parity with yosys-pypi.

08:49 <kbeckmann> whitequark: Are you planning on supporting xdr>2 where a second clock is needed? I tried adding support for ODDRX2F for the ECP5 but had to change a bit in io.py to add the second clock/eclk and it gets a bit ECP5-specific.

09:00 <whitequark> kbeckmann: yes, but this will require some thought

09:01 <whitequark> actually, if you're acutely interested in the topic, I have a suggestion for you

09:01 <whitequark> right now the Pin() object doesn't state what the phase of DDR output is, compared with the clock

09:01 <whitequark> we should expose that information

09:01 <whitequark> in fact, we should at least collect it, to begin with

09:11 <kbeckmann> oh I see, yeah that would be useful.

09:17 <whitequark> if you collect it for the supported platforms (ideally with hw verification) then we could expose it through some sort of attribute

09:18 <whitequark> I have all of our supported hardware platforms on hand so we can check it there

09:18 <whitequark> with the exception of, I think, ultrascale, but that's still easy to arrange

09:23 <kbeckmann> oh that's great. I'll implement it for the ECP5 first since that's what I have personally.

09:23 Guest30583 has joined #nmigen

09:25 <whitequark> cool!

09:46 Asu has joined #nmigen

13:41 Guest30583 has quit [Quit: Nettalk6 - www.ntalk.de]

15:32 chipmuenk has joined #nmigen

17:50 electronic_eel has quit [Ping timeout: 258 seconds]

17:52 electronic_eel has joined #nmigen

19:07 Asuu has joined #nmigen

19:08 Asu has quit [Ping timeout: 264 seconds]

20:07 <smkz> if i want to use the nmigen simulator and extract signal values from the simulated logic (for processing / evaluation in python, not for making a .vcd); the right way to do so is by including code in the function fed to add_sync_process, right?

20:09 <whitequark> correct

20:10 <smkz> perfect

20:10 <smkz> thank you

20:10 <whitequark> in case you want to analyze simulation traces passively (as opposed to interacting with the logic or checking it as it goes) there is also https://github.com/nmigen/nmigen/issues/327

20:10 <whitequark> it's not currently implemented but it wouldn't be hard to add that at all

20:11 * smkz nods

20:11 <smkz> also is there a way to introspect into a module's internal Signals (rather than just the signals which are self.whatever defined in the __init__) from a process?

20:13 <whitequark> unfortunately not since they live inside the closure and so are hidden from any normal introspection

20:13 <whitequark> there ought to be a good way to do this but currently isn't

20:13 <whitequark> I would for now recommend exposing them through attributes but it is inelegant and we'll improve it

20:14 * smkz nods

20:15 <smkz> (thanks for answering my nmigen questions, i appreciate it a lot (as well as the work you do on nmigen))

20:17 <whitequark> no problem! glad i can help

20:20 <smkz> oh other question; is there a way to access the contents of a Memory from the process function? (ideally to grab all its state at once)

20:21 <whitequark> at once no, but you can do `yield mem[0]` where 0 is any index

20:21 <smkz> ahhh

20:21 <smkz> perfect ^^;

20:21 <whitequark> I'll keep the desire to be able to grab the entire thing in mind

20:22 <whitequark> right now it's using a highly inefficient approach (https://github.com/nmigen/nmigen/issues/359) but with the experience from cxxrtl I know how to improve it significantly

20:23 <whitequark> it could even then use a dense array rather than a list of ints

20:24 <smkz> cxxrtl is the thing where you convert the nmigen design into a c++ file for simulation that way?

20:24 <whitequark> any design yosys can parse, actually!

20:24 <whitequark> you can cosimulate nmigen code, verilog code and vhdl code with it

20:24 <whitequark> (synthesizable verilog and vhdl)

20:26 <smkz> neat!!

20:27 <smkz> will there be a way to get the results from the cxxrtl simulation for processing in python?

20:27 <whitequark> yep, that's actually what i'm working on right now

20:27 <whitequark> well

20:28 <whitequark> technically right now i am fixing bugs in the wasm toolchain but that's just three yak shaving levels deep into that task

20:28 <whitequark> see, using cxxrtl would require yosys from master, and it's a pain to require everyone to install it

20:28 <whitequark> so i thought i'd ship yosys in pypi

20:28 <whitequark> using cxxrtl from python*

20:28 <whitequark> though i guess no version of cxxrtl is released yet so it's true in general too

20:29 <awygle> lol poor wq is incapable of ignoring a wooly yak

20:29 <whitequark> awygle: look. at least today i was fixing bugs in wasi-libc which at least tangentially relate to the task at hadn

20:29 <whitequark> two days ago i was learning about windows x32 seh

20:29 <awygle> :)

20:30 <whitequark> four days ago i was looking into 32 bit x86 SIMD instructions

20:30 <whitequark> why 32 bit? because the only 64 bit shift 32-bit x86 has is a SIMD shift

20:30 <awygle> mhm

20:30 <awygle> 32-bit simd sucks even more than regular simd

20:30 <awygle> because you can't even assume SSE2

20:31 <whitequark> well

20:31 <whitequark> that's why cranelift's "i686" backend currently just crashes if you don't enable SSE4.1

20:31 <awygle> <_<

20:32 <whitequark> fun fact

20:32 <whitequark> psllq/psrlq are SSE2

20:32 <whitequark> but pinstr/pextr are SSE4.1

20:32 <whitequark> why the fuck?..

20:32 <awygle> yep

20:32 <whitequark> no but i mean why

20:32 <awygle> simd is a nightmare hellscape universally

20:32 <awygle> even Neon is bad

20:32 <whitequark> did they make it bad on purpose

20:33 <whitequark> did they seriously not consider people might want to extract elements from lanes

20:33 <awygle> which is a shame because simd is one of the coolest things

20:33 <whitequark> i never seriously looked into simd and tbh leaning towards keeping it that way

20:33 <whitequark> someone else with more tolerance for pain can do it, i'm sure

20:33 <ZirconiumX> "this time we'll do it right" - people who did not do it right

20:33 <awygle> i resolved the tension by just refusing to deal with anything that doesn't support AVX2

20:35 <whitequark> i know avx as "that thing i disable to get higher turbo frequency"

20:35 <whitequark> and measurable improvement in benchmarks (benchmark: 30 minutes of vivado junk) too

20:36 <awygle> AVX is horrifyingly non-orthogonal, but AVX2 fills in most of the gaps

20:36 <awygle> even if you only use it on 128-bit registers

20:36 <awygle> and therefore don't kill your turbo perf

20:36 <whitequark> hm

20:36 <whitequark> ok that's cool

20:36 <whitequark> who decided that glibc uses avx256 for memcpy?

20:36 <awygle> *shrug* RMS?

20:38 <awygle> what were you doing this benchmarking on? i thought Skylake was supposed to pay a lot less for AVX2

20:39 <whitequark> uh, i don't recall exactly

20:39 <whitequark> it's been two years or so

20:39 <awygle> mk

20:40 <awygle> Ryzen only just got 256-bit execution units so on Zen <2 you get really bad perf with 256-bit ops, no reason to use them, but AVX2 support is still useful for the 128-bit stuff

20:49 <sorear> I get the impression that there are a close to infinite number of simd instructions somebody wants

20:49 <sorear> and the glibc maintainers (rms not involved) can't turn down a new memcpy implementation that's x% faster on memcpy-only microbenchmarks

21:00 <awygle> do you _have_ to link to a libc to write a program for linux?

21:04 * awygle shakes away the terrible ideas brewing, gets back to work

21:07 <whitequark> awygle: nope!

21:07 <whitequark> now you're thinking with, uh, golang

21:07 <awygle> oh really? i didn't know go didn't use libc

21:07 <whitequark> they tried doing this on macos too and it broke *wonderfully*

21:07 <whitequark> absolute carnage

21:07 <awygle> that's actually kind of cool, it may be the only cool thing i've ever heard about go

21:08 <whitequark> so they had to choke on their pride and dynamically link libc on macos and windows after all

21:08 <awygle> yeah macos and the bsds are different

21:08 <awygle> unlike linux they don't consider the kernel the operating system

21:08 <whitequark> heh

21:08 <awygle> ... you know this why am i explaining it to you.

21:09 <whitequark> fun fact

21:09 <whitequark> you can make a 64-bit windows binary that does a far call into a 32-bit linux segment and then runs linux syscalls from it

21:09 <awygle> that's..... exactly what i want to do

21:09 <whitequark> in case you ever want me to install a rootkit that's probably how

21:09 <awygle> actually

21:09 <whitequark> what the actual fuck

21:09 <awygle> except for the 32-bit linux part

21:09 <awygle> 64-bit is fine

21:11 <whitequark> ok i mean it still bothers me substantially

21:11 <whitequark> what are you *doing*

21:11 <awygle> for posterity, here is my terrible idea - i want to write an N64 emulator, using mmap to emulate the TLB, which means i can't do it on Windows because of the 64kB segment size for MapViewOfFile. but in WSL 2, i can use mmap as god intended, but i need a way to make that seamless to the user. so my plan would be to run a WSL 2 process which does all the backend shit and do IPC to Windows for the GUI so the GUI can be native and i don't need X11.

21:11 <awygle> but i don't want to fuck around with distros or statically link musl, so.... raw syscalls, no libc, profit!

21:11 <whitequark> er

21:11 <whitequark> that doesn't apply to wsl. that applies to wine

21:11 <awygle> i figured lol. bummer

21:11 <whitequark> wsl (both 1 and 2) are lightweight virtualizations

21:12 <whitequark> i think the windows syscalls aren't even mapped inside the container

21:12 <awygle> 2 is significanly more virtual, is my understanding. 1 was more like Solaris LX branded Zones, where they wrote a syscall emulation table on top of NT. but 2 is just... a VM. at least this is my understanding.

21:12 <whitequark> not emulation

21:13 <whitequark> they added a new NT subsystem

21:13 <whitequark> it was basically like Interix

21:13 <awygle> right, ok

21:13 <whitequark> 2 is basically like CoLinux

21:13 <whitequark> in neither of these cases you could possibly use the GUI

21:13 <whitequark> in fact, you can't even use win32k from win32 console subsystem

21:13 <awygle> the thing you refer to with "basically like" should generally be _less_ obscure than the original thing :p

21:13 <whitequark> lol

21:14 <awygle> yeah hence the IPC bit. you can do an AF_UNIX socket from WSL 2 to Windows

21:14 <whitequark> oh hmm

21:14 <awygle> performance? :shrug: who knows

21:15 <whitequark> just map the same file from inside and outside of wsl

21:15 <whitequark> and use the socket as a mailbox

21:15 <awygle> i wonder if you can pass a shm between the two....

21:15 <whitequark> not sure, but i don't see why not

21:16 <awygle> really i wonder if you can map a shm into a windows process' space

21:16 <awygle> idk how to do that

21:17 <awygle> maybe MapViewOfFile would be good enough idk

21:17 <awygle> i'd have to test it

21:17 <awygle> and uh... .there are Many More Important Things i should be doing instead

21:17 <whitequark> haah

21:17 <whitequark> also

21:17 <awygle> (cr1901_modern do you have Cursed Knowledge here?)

21:18 <whitequark> is MapViewOfFile the reason wasm has a 64k page size?

21:18 <awygle> oh shit probably

21:18 <whitequark> I was wondering

21:18 <whitequark> I think it's *originally* related to NT on DEC Alpha?

21:19 <awygle> yeah something like that, there's a raymond chen blog post but i can't find it right now

21:19 <cr1901_modern> awygle: wasm's 64kB page size is the reason I haven't looked much further into using it as a VM for cursed vintage shit

21:19 <awygle> mm

21:19 <cr1901_modern> So no, I wouldn't have much to say about it

21:20 <whitequark> cr1901_modern: nah that's not really an issue afaik

21:20 <TD-Linux> awygle, fwiw intel *to this day* ships CPUs that don't support AVX2, so you can't generally make that assumption

21:20 <whitequark> people use wasm on microcontrollers that don't even have 64k of memory

21:20 <awygle> arright well time to pretend i'm useful and poke around LA design spaces for nmigen

21:21 <cr1901_modern> Hmmm, I do recall seeing that. But I was under the impression said applications don't allocate at all. Which I guess is fine for anything I want to do

21:21 <awygle> TD-Linux: yeah but nothing where i'd care about the performance gainst of AVX2/manual SIMD would run well on those anyway.

21:21 <whitequark> cr1901_modern: ahh possible

21:21 <whitequark> i mean... isn't one wasm page exactly one COM segment?

21:21 <whitequark> that actually seems pretty reasonable to me

21:21 <awygle> 4k is too small for pages :p

21:21 <cr1901_modern> yes lmao. But I wanted to try it on 65xx, and other friends w/ 16-bit addr space

21:22 <whitequark> oh yeah no that's not happening

21:22 <whitequark> you can't even target C to them

21:23 <awygle> aw bummer, you can't do AF_UNIX/SOCK_DGRAM on windows. or at least you couldn't when this article was written, maybe it's different now

21:23 <TD-Linux> awygle, yeah maybe. I mean they are modern 4ghz comet lake CPUs, so e.g. for dav1d we have to worry about them

21:23 <awygle> fair

21:23 <awygle> that's like, general utility software, whereas this is an emulator

21:23 <awygle> so i'm not too sad if i cut off some potential users

21:24 <awygle> i'd have different priorities for something like dav1d

21:24 <cr1901_modern> whitequark: Indeed. Although fun fact: I've been reading about TinyBASIC, one of the earlier BASICs, and a... well, Tiny one. Was developed in the mid 70s. >>

21:24 <awygle> my kingdom for a reliable packet-oriented interface :/

21:24 <cr1901_modern> The TinyBASIC interpreter is actually implemented as a VM. You're expected to implement the VM on your target to get a "free" TinyBASIC interpreter.

21:25 <cr1901_modern> (stack machine VM, like WASM)

21:25 <cr1901_modern> so the idea certainly goes back that far :P

21:25 <TD-Linux> pascal and pcode easily has that beat

21:25 <whitequark> cr1901_modern: wasm is uhhhh, you could call it a stack machine, sure

21:26 <whitequark> my understanding is that the two good reasons wasm is a stack machine is because it makes formal semantics tractable, and because the encoding is compact

21:26 <whitequark> but it's not a stack machine in the way forth or jvm are

21:27 <whitequark> cr1901_modern: http://troubles.md/posts/wasm-is-not-a-stack-machine/

21:27 <whitequark> here

21:27 <whitequark> that says it better than i can phrase right now

21:27 <cr1901_modern> Ahhh hrmmm

21:27 <whitequark> >This essentially makes WebAssembly a register machine without liveness analysis, but not only that, it’s a register machine that isn’t even in SSA form - both of the tools at our disposal to do optimisation are unavailable.

21:28 * cr1901_modern spends some time reading

21:29 <cr1901_modern> TD-Linux: I'm sure p-code is cool too, but I've not studied it (nor pascal) much.

21:29 <cr1901_modern> I went on a TinyBASIC diversion about a week ago b/c I read that even BASIC had trouble fitting into some home computers in the 70s. Which surprised me.

21:30 <cr1901_modern> And TinyBASIC was a compromise- trade speed to fit into small places via a carefully-designed stack machine VM

21:31 <awygle> whitequark: the difference between `lookup` and `request` is that lookup can be called multiple times, right? when is `lookup` intended to be used?

21:33 <whitequark> basically never, it's just there in case you somehow need it

21:34 <whitequark> and because it's needed internally

21:34 <sorear> whitequark: some ppc/mips/arm hardware strongly wants 64kb pages I think?

21:34 <awygle> ok

21:35 <whitequark> mhm

21:36 Cynthia has joined #nmigen

21:43 <awygle> so something like FFSynchronizer takes a string for the domain, and is a module, and creates the domain in its module object, and then after everything's elaborated the domains of the modules and submodules all the way up to the root are unified? is that approximately correct?

21:45 <whitequark> almost

21:46 <whitequark> FFSynchronizer doesn't create the domain

21:46 <whitequark> `m.d.foo += ...` doesn't *create* foo

21:46 <whitequark> it's like referencing an external symbol in C

21:47 <awygle> ah. so somebody somewhere is doing `m.domains += ClockDomain("foo")`

21:47 <awygle> and if not it'll be an error

21:47 <whitequark> yes.

21:47 <whitequark> the manual i'm not currently writing will explain that in painstaking detail

21:47 <awygle> and that somebody has to be above the FFSynchronizer in the hierarchy? or no

21:47 <whitequark> anywhere

21:47 <awygle> hm ok

21:47 <whitequark> it's how domains work in migen

21:48 <whitequark> fairly reasonable overall, though lack of local domains in migen hurts it significantly

21:57 <awygle> you are in a maze of twisty little _ModuleBuilder*s, all slightly different

21:59 <awygle> oh god you tweeted the dumb thing i said and now i have 20 notifications lol

21:59 <awygle> (to be clear this is fine it's just amusing)

22:03 thinknok has joined #nmigen

22:10 chipmuenk has quit [Quit: chipmuenk]

22:45 Asuu has quit [Quit: Konversation terminated!]

23:37 thinknok has quit [Ping timeout: 246 seconds]

23:53 <smkz> i have additional question; if i have a memory i want to introspect into during simulation; i know how to do so if the memory is defined directly in the module i'm feeding to the simulator,

23:53 <smkz> but in this case the memory is defined inside a submodule of a submodule of the module i'm simulating

23:53 <smkz> how would i thread it through / how would i access it?

23:54 <smkz> i've tried a few things like doing "self.memory_introspection_port = my_memory" inside the elaborate() function where it's defined

23:54 <smkz> but that doesn't seem to create something that can be accessed by the module that has that module as its submodule ;;