##openfpga on 2020-03-03 — irc logs at freenode.irclog.whitequark.org

00:05 Degi has quit [Ping timeout: 255 seconds]

00:07 Degi has joined ##openfpga

00:12 X-Scale` has joined ##openfpga

00:14 X-Scale has quit [Ping timeout: 260 seconds]

00:14 X-Scale` is now known as X-Scale

00:15 Lord_Nightmare has joined ##openfpga

00:18 X-Scale` has joined ##openfpga

00:19 X-Scale has quit [Ping timeout: 260 seconds]

00:19 X-Scale` is now known as X-Scale

00:25 Bike has joined ##openfpga

00:29 zng has quit [Quit: ZNC 1.7.2 - https://znc.in]

00:31 zng has joined ##openfpga

00:33 rohitksingh has joined ##openfpga

01:30 emeb has left ##openfpga [##openfpga]

01:41 lopsided98 has quit [Remote host closed the connection]

01:42 ym has quit [Remote host closed the connection]

01:42 lopsided98 has joined ##openfpga

01:45 emeb_mac has joined ##openfpga

02:09 genii has quit [Quit: Morning comes early.... GO LEAFS GO!]

02:14 rohitksingh has quit [Ping timeout: 268 seconds]

02:38 rohitksingh has joined ##openfpga

02:48 Maylay has quit [Ping timeout: 240 seconds]

02:52 Degi has quit [Ping timeout: 256 seconds]

02:53 Degi has joined ##openfpga

03:13 Maylay has joined ##openfpga

03:19 Bike has quit [Quit: Lost terminal]

03:37 genii has joined ##openfpga

03:54 genii has quit [Quit: Morning comes early.... GO LEAFS GO!]

05:16 ____ has joined ##openfpga

05:23 _whitelogger has joined ##openfpga

07:02 emeb_mac has quit [Quit: Leaving.]

08:50 OmniMancer has joined ##openfpga

09:04 <____> Is anyone familiar with a 16-bit hyperbus? Gowin uses one for the internal connection to whathever is that on-chip thing they call PSRAM.

09:06 <____> Afaik Cypress spec talks about 8 bits only.

09:10 <tnt> Doesn't the gowin doc have info ?

09:11 <tnt> Also, does that mean the connections to it have io buffers / io ffs and you need to do the normal clok-to-out / inpout setup-hold time analysis ?

09:21 <____> Gowin doc is pretty limited, and is mostly about their custom hyperbus core interface, not about raw interface. Or maybe I missed something.

09:24 <____> The PSRAM is not a primitive. It is magically connected somewhere at the synthesis stage, if you give the corresponding top level ports the corresponding magical names.

09:26 <tnt> Looking at the doc, it really just looks like you have a classic PSRAM chip connected to some IO of the FPGAs.

09:26 <tnt> instead of being broken out to pads, they go to another die in the package.

09:26 <____> So, i guess that constraint-wise, the PSRAM ports are treated the same way as any other port.

09:26 <tnt> But the doc I'm reading shows the data width as 8 bits, not 16.

09:32 <____> For my specific part it is specified that the PSRAM width is 8 bit, but the DQ is 16 bit.

09:34 <____> Gowin's core user interface uses 64-bit ports and a minimum 16-byte burst. I suppose there is some muxing magic involded.

09:35 <____> Well, I guess this would require some hacking with internal LA.

09:35 <tnt> Well if psram is x8 and dq is x16 you just have two psram dies in parallel.

09:36 <tnt> so you'd have two rwds as well.

09:36 <tnt> and CS / CK / CK_n are shared.

09:39 <tnt> Actually looking at table 5-2 of IP UG525, it's two completely independent psram dies ...

09:40 <____> Wow, that's actually seems true, every one of them is doubled, even CK and CS.

09:40 <____> Thanks

09:42 <tnt> I'm kind of curious why there is a differentiation between PSRAM and HyperRAM in that document.

09:43 <tnt> (like in Table 2-3)

09:43 <tnt> of DS861

09:47 <____> The interface is hyperbus for both of them, and IPUG525 says that there's no difference. Maybe it's about marketing?

09:49 <tnt> yup maybe. I guess maybe the psram is some custom silicon while hyperram is "official". It might not support the same configuration registers options.

10:56 emily has quit [Quit: killed]

10:56 eddyb has quit [Quit: killed]

10:56 promach3 has quit [Quit: killed]

10:56 swedishhat[m] has quit [Quit: killed]

10:56 jfng has quit [Quit: killed]

10:56 indefini[m] has quit [Quit: killed]

10:56 nrossi has quit [Quit: killed]

10:56 henriknj has quit [Quit: killed]

10:56 john_k[m] has quit [Quit: killed]

10:57 omnitechnomancer has quit [Quit: killed]

10:57 scream has quit [Quit: killed]

10:57 xobs has quit [Quit: killed]

11:22 indefini[m] has joined ##openfpga

11:32 <azonenberg> fffuuuuuu i just spent the last 4 hours chasing a bug caused by copying code from another project and not patching up one net name

11:32 <azonenberg> Which led to my I2C IP not having a clock

11:33 <azonenberg> Aaaaand the ONE file in the entire project without `default_nettype none was, you guessed it, the top level

11:39 <tnt> that sounds way too familiar

11:41 <azonenberg> My coding style requires it but i don't automatically enforce it

11:41 <azonenberg> i really need to find a standardized system for creating new projects that avoids some of these issues

11:41 <tnt> Can yosys default be changed ?

11:41 <azonenberg> I'm actually using vivado for this, and i'm not sure about the yosys default

11:42 <tnt> What annoys me is that this is not per-file and so when I use lattic or xilinx IPs, it often breaks :/

11:42 <azonenberg> i have however thought about using the yosys parser to write a linter that enforces my style guidelines though

11:42 <azonenberg> Yes. My general solution to this is simple, don't use third party IP :P

11:43 <azonenberg> the only xilinx IP i use with any degree of regularity is the ILA, and the design i had this problem on is actually a testbench for the latest version of my own ILA

11:43 <tnt> heh, sure but not always an option ... (I often do changes / additions to existing projects, so rewriting the whole thing is not viable :p)

11:43 <azonenberg> with a view towards eventually discontinuing use of the vivado ILA permanently

11:43 <azonenberg> after all, not much sense having code buildable with f/oss tools if you depend on non-free blobs

11:43 <azonenberg> that can't be compiled except with vivado

11:44 <tnt> I end up having the default_nettype wrapped in `ifdef and then I use iverilog as a syntax checker.

11:44 <azonenberg> well i want to do more

11:44 <q3k> same, iverilog for linting

11:44 <azonenberg> i want to do things like alerting on a flag which is set to 1 in a state machine but never cleared to 0

11:44 <azonenberg> lack of default_nettype none

11:45 <q3k> azonenberg | i want to do things like alerting on a flag which is set to 1 in a state machine but never cleared to 0

11:45 <q3k> that will require some level of formal verification in order to be done well

11:45 <q3k> probably

11:45 <azonenberg> initially it would be quite simple

11:45 <q3k> or you can just detect when a line gets turned into a constant driver

11:45 <azonenberg> within one always block, if you see assignments to 1'b1 only

11:45 <azonenberg> and never to any other value

11:45 <azonenberg> that's a warning

11:45 <azonenberg> because you probably intended this to be a single cycle pulse and forgot to add the default-zero

11:45 <azonenberg> (this has bit me a lot)

11:46 <azonenberg> it should be quite easy to do at the AST level, harder on synthesized logic

11:46 <q3k> doing it at AST level sounds like just tons of false positives (ie. code that's broken that passes your simplistic checks)

11:47 <azonenberg> that's false negatives

11:47 <azonenberg> false positives is warnings for logic that doesn't meet the filter

11:47 <q3k> depends how you look at it, that's why i added the (explanation)

11:47 <azonenberg> or logic that meets the filter but isn't broken

11:48 <azonenberg> my general rule is that a linter/warning tool needs to have an extremely low false positive rate to be useful, or people ignore the spam

11:48 <q3k> it sounds like you're trying to write a 'go vet' but for verilog

11:48 <q3k> ie. something a bit smarter than a linter

11:48 <azonenberg> while false negatives are bad, halting problem says you can't catch all bugs

11:48 <azonenberg> any bugs i catch are better than none

11:48 <azonenberg> other rules i want to enforce: no mixing <= and = in one always block

11:48 <q3k> which is interesting, because both go and verilog are braindead languages, and you write tools around them to discover bugs, instead of making the language less braindead

11:48 <azonenberg> no latches in always_comb blocks

11:48 <q3k> so that's a quaint parallel.

11:49 <azonenberg> no use of numbered module ports or synthesis constraints in comments

11:49 <azonenberg> in FPGA mode: mandatory initial value for all registers

11:49 <azonenberg> in ASIC mode: use of "initial" is an error

11:49 <azonenberg> (this can lead to sim-synthesis mismatches with asic HDL if the simulator isn't explicitly instructed to ignore initial values)

11:50 <azonenberg> I might also ban the ?: operator. Certainly nested instanecs of it

11:50 <azonenberg> multiple drivers from different always blocks, use of # delays

11:50 <azonenberg> statically impossible conditionals/assignments due to width mismatch

11:51 <azonenberg> (post elaboration maybe)

11:51 <azonenberg> for example reg[3:0] foo; foo <= 32;

11:51 <azonenberg> but foo <= 8 should be legal despite the unsized 8 being expanded to 32'd8 per the LRM

11:52 <azonenberg> Systemverilog fixes a lot of the things in my original list

11:52 <azonenberg> but it does still have issues :p

12:04 ZipCPU has joined ##openfpga

12:06 henriknj has joined ##openfpga

12:06 eddyb has joined ##openfpga

12:06 jfng has joined ##openfpga

12:06 swedishhat[m] has joined ##openfpga

12:06 emily has joined ##openfpga

12:06 xobs has joined ##openfpga

12:06 john_k[m] has joined ##openfpga

12:06 promach3 has joined ##openfpga

12:06 omnitechnomancer has joined ##openfpga

12:06 nrossi has joined ##openfpga

12:06 scream has joined ##openfpga

12:43 rohitksingh has quit [Ping timeout: 240 seconds]

13:11 fjullien has quit [Ping timeout: 255 seconds]

13:50 ____ has quit [Quit: Nettalk6 - www.ntalk.de]

14:07 ZipCPU has quit [Ping timeout: 265 seconds]

14:28 OmniMancer has quit [Quit: Leaving.]

14:39 genii has joined ##openfpga

15:10 <lambda> azonenberg: have you tried VHDL? it doesn't have most of those problems either ;)

15:15 <azonenberg> lambda: it also looks like ada and i find it pretty much unreadable

15:15 <azonenberg> i have lots of ideas for things i want in languages, which seem to not be what anyone else wants

15:15 <azonenberg> for example, rust looks like an awesome memory safe C replacement

15:15 <azonenberg> But I want "rust++"

15:16 <lambda> fair enough, it takes some getting used to. I'm just always glad for its strictness and somewhat decent type system whenever I hear these verilog horror stories

15:16 <azonenberg> i.e. full OO on a statically memory safe, non-GC'd bare-metal-friendly platform

15:16 <azonenberg> AFAIK this does not currently exist

15:18 <lambda> there's probably always just One More Thing™, no matter how many languages there are to be honest

15:18 <azonenberg> well the single big blocker to me moving ~95% of my code to rust is the lack of proper OO

15:18 <azonenberg> structs or whatever they're called don't count

15:18 <azonenberg> in particular, my code tends to make heavy use of base classes that provide common functionality which is occasionally overridden

15:19 <azonenberg> so full inheritance, not just interfaces

15:19 <azonenberg> and sometimes even multiple inheritance, which i use a lot in jtaghal

15:19 <azonenberg> OO just fits very naturally to a lot of hardware problems like making drivers for a peripheral

15:19 <azonenberg> as well as a lot of UI stuff

15:27 <lambda> true, maybe eventually something will come along and fill that gap

15:29 Zorix has quit [Ping timeout: 248 seconds]

15:30 <anuejn> azonenberg: there is the Deref pattern in rust, which gives you something quite similiar to inheritance

15:33 <anuejn> but it is a rather evil hack

15:33 <anuejn> https://github.com/rust-unofficial/patterns/blob/master/anti_patterns/deref.md sais:

15:33 <anuejn> >We do intend to add a mechanism for inheritance similar to this to Rust, but it is likely to be some time before it reaches stable Rust

15:33 <anuejn> *says

15:42 Zorix has joined ##openfpga

15:58 tlwoerner is now known as tw-eh

15:58 tw-eh is now known as tlwoerner

15:59 <q3k> azonenberg: i mean, it's a different pattern. if you're applying java-style OO to rust you'll feel extremely constrained

15:59 <q3k> azonenberg: you can implement all these things, just thinking differently, in the system that rust gives you

15:59 <q3k> azonenberg: and generally, IMO, end up with easier to grok code

16:00 <q3k> azonenberg: (multiple inheritance is evil)

16:00 <q3k> i had a very similar issue when I was trying to port some of my old C++ code (which was similar to you C-with-classes C++) to rust

16:01 <q3k> just took a while to adjust

16:01 <q3k> *to your

16:45 <azonenberg> q3k: i mean, you can also write full java style OO in C

16:46 <azonenberg> using get/set functions, virtual fuctions, etc. Doesnt make it a good idea, or the right tool for the job

16:46 <q3k> my point was more that if you let go of patterns from other languages you usually end up more productive

16:47 <azonenberg> And this is why i like C++, it doesnt really enforce much in the way of patterns/paradigms on you

16:47 <azonenberg> you can go full C imperative, you can go full java OO, you can do C-with-classes

16:47 <azonenberg> you can even do functional stuff up to a point

16:47 <q3k> i mean, that's still one single pattern on a spectrum

16:48 <q3k> rust doesn't enforce that much either

16:53 <azonenberg> its not things that rust enforces that i complain about, so much as the lack of syntax for various things

16:53 <azonenberg> Like inheritance

16:53 <q3k> that's not syntax, it's different semantics

16:53 <azonenberg> I also remember not being super happy with the way it handled object lifetimes

16:54 <azonenberg> i forget the specifics but it seemed like the default type for arguments etc was readonly reference or something and that led to lots of annoyance if you actually wanted to make a copy of something

16:54 <q3k> i think you just need to spend more time with the language

16:55 <azonenberg> oh actually no, i think it had to do with a pass-by-writable-reference turning into a transfer of ownership?

16:55 <q3k> it's a complex little thing and it just takes time

16:55 <q3k> you can definitely pass a mutable borrow without transferring ownership

16:55 <q3k> so i'm still not sure what you mean

16:55 <azonenberg> i havent actually used it much. I just remember spending most of my time fighting the tools to express what should have been simple concepts

16:56 <q3k> the borrow checker will piss you off at first but it's a tool for correctness that will find issues with your code

16:56 <q3k> same as a type checker will piss you off if you're not used to complex static typing

16:57 <azonenberg> my recollection was that the borrow checker was overly paranoid and would complain about things that were obviously safe and could be easily statically verified as safe

16:58 <q3k> also not sure when you last used rust, but some new lifetime elisions have been introduced that make more simple use cases easier to express without annotating everything

16:58 <azonenberg> That could have been part of it. It was a while ago

16:58 <q3k> generally that's the borrow checker's job, to be paranoid

16:58 <q3k> if you're sure you're right then use unsafe {}

16:58 <azonenberg> At which point i've lost the whole benefit of using a memory safe language :p

16:59 <q3k> i mean, pick one - memory safety or not

16:59 <q3k> you can't both complain about the borrow checker being paranoid and that you want memory safety

16:59 <azonenberg> Paranoid meaning it's shooting at shadows rather than finding actual problems

16:59 <q3k> i would need to see some concrete examples

16:59 <azonenberg> yeah i havent touched it in a while

17:00 <azonenberg> But IMO a well designed memory safe language stays out of your way and doesn't let you do anything stupid

17:00 <q3k> if you read TRPL it generally shows you how to do things in a way that appease the borrow checker

17:00 <azonenberg> But also doesn't force you to explicitly say everything you want

17:00 <q3k> so do you have any examples of such a language?

17:00 <azonenberg> No, i dont think it exists :p

17:00 <q3k> there might be a reason for that

17:00 <azonenberg> Lol

17:01 <q3k> i mean, rust is not perfect, i don't even like rust that much

17:01 <q3k> but that borrow checker is usually right

17:01 <azonenberg> My interpretation of paranoid was "no false negatives but a ton of false positives"

17:01 <azonenberg> if the false positive rate could be reduced a lot i'd be more happy

17:01 <azonenberg> One thing i recall not obviously seeing in rust was the ability to create a memory safe mapping for a region of absolute physical memory

17:01 <azonenberg> for example a memory mapped packet buffer for an ethernet IP

17:02 <q3k> i'm not well versed in embedded rust, but i'm sure you can find examples on how to do that in redox

17:03 <q3k> also a lot of times you can just 'cheat' by using Arc/Rc when dealing with lifetimes

17:03 <q3k> and that's generally fine in my book, reference counts are cheap

17:03 <emily> 16:57 <azonenberg> my recollection was that the borrow checker was overly paranoid and would complain about things that were obviously safe and could be easily statically verified as safe

17:04 <emily> i'm sure if you sent an easy hundred-line patch to make the compiler statically verify these patterns as safe it would be expected (but in reality it's almost certainly not so easy)

17:04 <emily> expected → accepted

17:06 <azonenberg> hmm i think it might have been the "exactly one mutable reference" rule i had trouble with?

17:07 <azonenberg> how are you supposed to have global state you can change from two places at once with proper synchronization?

17:07 rohitksingh has joined ##openfpga

17:07 <azonenberg> e.g. a fifo you can pop from one of two worker threads

17:08 <q3k> you wrap it in a Mutex

17:09 <q3k> or you use an existing FIFO construct that allows for two ends across different threads

17:09 <q3k> ie. https://doc.rust-lang.org/std/sync/mpsc/

17:10 <sensille> yeah, you really have to let go of the notion that you can do everything yourself in safe rust

17:17 oter has quit [Quit: Textual IRC Client: www.textualapp.com]

17:27 Stary has quit [Quit: ZNC - http://znc.in]

17:28 Stary has joined ##openfpga

17:41 fjullien has joined ##openfpga

17:44 _franck_ has quit [Ping timeout: 272 seconds]

17:53 Lord_Nightmare has quit [Quit: ZNC - http://znc.in]

17:58 <azonenberg> q3k: yeah the fifo was just an example

17:59 <azonenberg> i was envisioning more complex data structures not in a library, like traversing some complex directed graph

17:59 <azonenberg> and possibly modifying state at various nodes

18:00 <q3k> i mean, you have to prove to the borrow checker that you can do this thread safely

18:00 <q3k> either by writing unsafe code that just tells it 'fuck off i know what i'm doing', or using existing code that does that

18:01 <q3k> with the simplest example being sync::RwMutex for instance

18:05 <azonenberg> Also, this is a bit more living-on-the-edge

18:05 <azonenberg> but on most platforms you can do lock-free data structures by taking advantage of the fact that byte/word sized operations are inherently atomic

18:05 <azonenberg> Without a need to explicitly mutex

18:06 <azonenberg> A language that recognizes that e.g. incrementing a uint32 from two threads at once is safe would be nice

18:06 <q3k> rust also has this

18:06 <azonenberg> obviously this might be OK or not depending on the specific ISA, but you get the idea of the sort of low-level stuff that matters to me as an embedded guy

18:06 <q3k> via explicit atomics

18:06 <emily> rust is about "correct by construction", not "you do whatever and it tries its damnedest to prove that it's not broken yet"

18:06 <emily> because not broken yet can become broken in the future

18:07 <emily> but yes you can achieve the same result here just by declaring you explicitly want and depend on that property

18:07 <emily> rather than just partying on it and having it at most documented in a comment

18:07 <azonenberg> yeah my point is, i want to be able to do low-level performance stuff while also being sure it will work

18:07 <q3k> but rust lets you do that

18:07 <q3k> you just have to prove the compiler that it's safe

18:08 <azonenberg> But this is a bit of a moot point for now, because most of the important stuff i've done lately has been in HDL

18:08 <q3k> it's just you can't realy on UB like accessing u32s across threads

18:08 <q3k> you have to explicitely say 'i want an atomic u32'

18:09 <q3k> and your platform might as well implement it using a simple u32 that is accessed using normal load/store instructions

18:09 <q3k> bonus: your code is then actually portable

18:09 <azonenberg> Fair enough, my point is more that i would prefer such stuff to not *be* UB

18:09 <q3k> it's not in rust

18:09 <azonenberg> e.g. language semantics that incrementing an int always wraps mod 2^32

18:10 <azonenberg> re HDL, this allows a lot of correctness properties to be ensured just by the thing compiling, like only writing to a register in one always block

18:10 <azonenberg> or pointers not being used outside the bram that they're intended to go to

18:10 <azonenberg> and you can use formal for higher level properties

18:11 <q3k> you should take a look at bluespec now that it's open

18:11 <TD-Linux> in fact, AtomicU32 in Rust is defined to wrap at mod 2^32 :)

18:12 <azonenberg> Does rust have arbitrarily sized "int", btw, or did they kill that?

18:12 <azonenberg> that's one of the biggest complaints i have with C :p

18:12 <azonenberg> I'm thinking of enforcing a C++ coding style that only allows stdint numeric types

18:12 <q3k> there's a bigint crate

18:12 <azonenberg> i.e. you can never say int, you have to explicitly say int32

18:12 <azonenberg> no i mean, "integer of unknown platform dependent size"

18:12 <azonenberg> should IMO not be a type

18:12 <TD-Linux> azonenberg, in general yes, the normal types are like "i32" and "u32". the biggest exception is "usize"

18:13 <q3k> yes, isize and usize

18:13 <q3k> which are explicitly arch specific

18:13 <TD-Linux> equivalent to size_t

18:13 <azonenberg> well ok having a size be dependent on address length makes sense for pointers or array indexes

18:13 <azonenberg> But only for that purpose

18:13 <q3k> otherwise you have {u,i}{8,16,32,64,128}

18:13 <azonenberg> (and it's something you'd never use in serialization)

18:16 <TD-Linux> rust doesn't have structs that you can dd to disk and it makes me a bit sad :(

18:16 <azonenberg> yes exactly

18:16 <TD-Linux> btw there is ##bluespec but no activity on it in a while

18:17 <azonenberg> my ideal embedded language would statically enforce a bunch of safety properties but also let you take advantage of low level stuff

18:17 <azonenberg> for example having well defined bitfields and packed structs with explicit endianness and msb-lsb ordering

18:17 <azonenberg> basically allow a struct to be declared either as optimized for efficient access with arbitrary platform dependent alignment etc, or serializable

18:18 <azonenberg> in the latter case it would have well defined in memory representation

18:18 <azonenberg> i.e. all floats are ieee754, all multibyte fields big endian, all struct members consecutive with no padding, etc

18:20 <qu1j0t3> is there an Embedded Rust working group that azonenberg could join

18:20 <azonenberg> better yet, allow a struct to simply be declared and used, by default optimized for in memory use

18:21 <azonenberg> but have some sort of serialize property you could apply to iot

18:21 <levi> Yes, but it's generally working at a higher level than the kinds of things being discussed right now.

18:21 <azonenberg> i.e. you could have Foo bar / Serializable Foo bar

18:21 <azonenberg> and cast between them, shuffling bits as needed

18:21 <azonenberg> But you could also do something like

18:22 <azonenberg> Serializable Foo bar [at 0x40030800]

18:22 <azonenberg> in order to memory map a SFR as a struct

18:22 <levi> Rust is *usable* for embedded, but definitely wasn't designed as "embedded-first".

18:22 <azonenberg> Exactly

18:23 <azonenberg> I want an embedded-first OO language that provides as many safety properties as reasonably practical without compromising the core mission

18:23 <azonenberg> Not a safe language shoehorned into embedded, which is what rust looks like

18:24 Asu has joined ##openfpga

18:25 <azonenberg> bonus points if you can figure out how to implement ownership semantics in a way that's compatible with ping-ping DMA buffers etc

18:25 <azonenberg> so you can change ownership of a block of memory to hardware, then when you get an interrupt mark it as usable by the app again

18:25 <levi> I don't think it's really "shoehorned" any more than C or C++ are.

18:25 <azonenberg> But enforce bounds checking when the app is using it

18:25 <azonenberg> levi: yes, C/C++ are not ideal for embedded either

18:26 <azonenberg> e.g. UB around struct packing, bitfield ordering, and endianness when casting complex data types to a byte*

18:26 <azonenberg> Hence why i want a language that is explicitly designed to support memory mapped peripherals in a well defined fashion

18:27 <levi> The embedded working group has come up with some pretty interesting tools with that regard, but they're definitely not to everyone's liking.

18:27 <qu1j0t3> there's some prior art, since operating systems have been writtne in HLLs for about 70 years now

18:27 <qu1j0t3> well, 60+

18:28 <levi> PL/I, Ada, and BLISS had a lot of interesting ways to specify low-level details that no one really looks at when designing languages these days.

18:28 <azonenberg> qu1j0t3: yes, and the vast majority are C/C++ and just assume the UB

18:28 <qu1j0t3> no

18:28 <qu1j0t3> there have been many languages used. lots of prior art.

18:29 <qu1j0t3> C wasn't used until 1973-odd

18:29 <azonenberg> well ok i dont have a ton of exposure to REALLY old OSes

18:29 <qu1j0t3> right, but could make an interesting survey

18:29 <qu1j0t3> Per Brinch Hansen "CLassic Operating Systems" mentions some, but definitely not all

18:29 <azonenberg> my OS knowledge is windows, linux, uc/os, freertos, vxworks, and probably a few other more obscure ones

18:29 <levi> There was also some interesting research on bitdata types from the Haskell-OS community that no one has tried to do much with beyond experiments.

18:30 <azonenberg> There is of course a totally different tangent

18:30 <qu1j0t3> DEC had BLISS

18:30 <azonenberg> which is to design the hardware for safe access and not require all of this BS

18:30 <azonenberg> like, in Antikernel the memory manager is in hardware and operates at a page level

18:31 <levi> That was a common approach back in the 60s, actually.

18:31 <azonenberg> and enforces single-owner semantics per PID (no multithreading, one thread per process)

18:31 <azonenberg> this means that a peripheral can malloc a page on its own with no software interaction

18:31 <levi> Through the 70s-80s to some degree too, when everything was made with bit-slice ICs and microcode.

18:31 <azonenberg> DMA some data to it, then chown it to your app and send you a pointer

18:31 <azonenberg> or you can fill a buffer with data, flush cache, chown it to the NIC

18:32 <azonenberg> no races possible because you forfeit your own rights to the page as you do so

18:32 <levi> The lowest-level language available on the Burroughs B5000 series machines was a variant of Algol.

18:32 <levi> It had a descriptor-based memory management scheme as well; not super familiar with how it worked though.

18:34 <azonenberg> also there were no SFRs in antikernel

18:34 <emily> 18:09 <azonenberg> e.g. language semantics that incrementing an int always wraps mod 2^32

18:34 <azonenberg> Memory mapping was only used for bulk data transfer

18:34 <emily> this is already the case in rust because i32 is the default-inferred integer type btw

18:34 <emily> (sorry for resurrecting the old topic)

18:35 <azonenberg> and the basic primitive for control plane activity was either an RPC or an interrupt (request-response or unidirectional "FYI" with optional data attached))

18:35 <levi> Rust also only supports 2's complement signed integer semantics, so that's helpful.

18:35 <azonenberg> in the RPC case you had explicit handshaking

18:35 <emily> also re all the rust stuff: you can, fundamentally, just use unsafe if you know what you're doing is safe

18:35 <azonenberg> levi: no unsigned?

18:35 <emily> but it does help to internalize the rust mindset enough that you know how to do things without just sprinkling it everywhere

18:36 <azonenberg> thats another thing i disliked about java

18:36 <levi> It has unsigned, but doesn't support other kinds of signed integers.

18:36 <azonenberg> makes doing crypto, ECC, etc hard

18:36 <emily> it has unsigned

18:36 <azonenberg> oh ok

18:36 <azonenberg> so unsigned or twos complement. That's a sane choice

18:36 <emily> c'mon, it's a systems language, of course it has unsigned :p

18:36 <azonenberg> i've seen a lot of idiocy from language designers over the years sooo... :p

18:36 <emily> sudden flashbacks of those Java OSes

18:36 <azonenberg> Exactly

18:36 <azonenberg> Reminds me of the programming languages class i took as an undergrad

18:37 <azonenberg> we had to do a project in SALSA, an actor-based language built on top of Java

18:37 <azonenberg> meant for distributed systems

18:37 <azonenberg> i chose a password cracker and demonstrated beautiful scaling from 1 to 32 nodes of the department x86 cluster

18:38 <azonenberg> what i did not send the professor, out of fear for my grade (this was his pet project, descended from his dissertation) were the results for my C++/sse2 implementation that ran faster on one core of my laptop than salsa on 32 nodes of the cluster

18:38 <azonenberg> Or the CUDA implementation that, on a handful of GPUs in my living room, ran faster than linear scaling of SALSA extrapolated to the #1 top500 system at the time

18:42 <levi> Yeah; fortunately the people writing code for top500 machines aren't doing it in Java-based languages.

18:43 <levi> Mostly in C and Fortran still, unless things have changed drastically recently.

18:44 <azonenberg> yeah that sounds right. LAMMPS is a mix of C++ and FORTRAN, that was the last large HPC codebase i did much with

18:44 <azonenberg> back as an undergrad i never got access to more than half a rack of BlueGene/L

18:45 <azonenberg> it's funny, my RTX 2080 Ti probably has more flops than that

18:47 <azonenberg> Let's see, 2.9 Tflops/rack in coprocesor mode, so 1.45 Tflops/midplane, a 2080 Ti can do 14.2 Tflops

18:47 <azonenberg> So my GPU is actually equivalent to just shy of five racks of BG/L lol

18:52 <q3k> azonenberg: i mean, you care more in distributed systems than plain number crunching

18:52 <q3k> just because it was a bad choice for your usecase doesn't mean it's not a valuable tool

18:58 <azonenberg> well yes, my point was more to illustrate just how obsolete BG/L is by modern standards

18:58 <azonenberg> (not that a half rack was that much even back then compared to the size of the whole... 16 rack? system)

18:59 <q3k> do people actually use SALSA to do number crunching on bluegenes? i would expect to see a lot of c++/fortran with MPI

19:00 <q3k> your example doesn't really tell me anything about the power of your GPU vs a bluegene cluster, just that with a poor choice of tool just throwing more compute at a problem ain't gonna solve things

19:03 rohitksingh has quit [Ping timeout: 256 seconds]

19:04 <levi> I think SALSA is mostly an academic proof-of-concept sort of language, and is probably only really used in academic environments to teach distributed computing concepts.

19:06 <q3k> yeah, that's kind of my point (and it's IMO a good goal for a language)

19:06 <levi> I would also expect all production code on scientific compute clusters to be in C/C++/Fortran with MPI, mostly because scientists.

19:06 <q3k> i'm just not rady to go from 'i wrote this slow SALSA implementation of X' to 'my gpu is more powerful than a bluegene cluster'

19:07 <azonenberg> q3k: i was comparing peak flops in that example

19:07 <azonenberg> and i'm comparing a bluegene from 2008 to a modern GPU

19:08 <azonenberg> and no i dont think anyone uses salsa on it

19:08 <q3k> 'Or the CUDA implementation that [...] ran faster than linear scaling of SALSA [on bluegene]'

19:08 <azonenberg> i was extrapolating to roadrunner actually

19:08 <azonenberg> which i dont think was a bluegene, it was #1 on top500 at the time

19:09 <azonenberg> and my point was more to show how slow salsa was :p

19:09 <q3k> and that's fine that it's slow?

19:10 <q3k> like you started with azonenberg | i've seen a lot of idiocy from language designers over the years sooo... :p

19:10 <q3k> and i really am not sure what sort of idiocy you're talking about

19:12 Jybz has joined ##openfpga

19:12 Jybz has quit [Remote host closed the connection]

19:14 rohitksingh has joined ##openfpga

19:21 <levi> Looking at the Blue Gene/L architecture, it seems Java was a particularly poor choice for scaling performance-oriented numeric code on that platform.

19:29 <levi> And having a modern high-end desktop GPU outperform 2 racks of it in peak flops doesn't sound ridiculous either. The actual top-performing Blue Gene/L systems were around ~100 racks.

19:32 <q3k> especially for things like pure bruteforce, where a GPU just fits better than a bunch of CPUs

19:34 rohitksingh has quit [Ping timeout: 255 seconds]

19:35 emeb has joined ##openfpga

19:37 <adamgreig> azonenberg: i'm late to the party [because the rust embedded wg meeting is on right now too] but rust does have a lot of things you just mentioned on embedded

19:38 <adamgreig> packed structs that live at some memory address are absolutely a thing and is how most embedded devices do MMIO

19:38 <adamgreig> including thread-safe "ownership" thereof, so that only one thing can modify at a time in a way that can be safely moved between threads/interrupt handlers/etc

19:39 <adamgreig> even clever type level systems to automatically set the system interrupt floor based on current context to ensure no pre-emption on shared resources etc

19:40 <adamgreig> a lot of systems for extremely easy struct serialisation, and you can have structs with C layout semantics, or specifically packed, etc

19:42 <TD-Linux> oh are the HALs packed struct backed now?

19:42 <adamgreig> when were they not?

19:44 <TD-Linux> maybe they always were. I mean it's not obvious when I'm writing something like

19:44 <TD-Linux> rcc.apb1enr.write(|w| { w.pwren().bit(true) });

19:45 <adamgreig> that's a method-based api on top of a packed struct that lives in memory

19:45 <adamgreig> it compiles down to a normal read-modify-write or write to the relevant register address

19:47 <TD-Linux> yeah, I guess I assumed the backing was a direct mmio write

19:48 <adamgreig> though actually that might change due to a hilarious llvm issue, heh

19:48 <TD-Linux> link?

19:48 <adamgreig> technically llvm is allowed to insert reads to any dereferenceable reference, so you might end up with an unexpected read action on a mmio which is read sensitive

19:49 <adamgreig> only way around is to never construct a reference to mmio, so only do explicit pointer-based atomic reads

19:49 <adamgreig> that's sort of irrelevant though

19:49 <adamgreig> uh, see https://github.com/rust-embedded/wg/pull/387 and refs

20:02 mumptai has joined ##openfpga

20:18 rohitksingh has joined ##openfpga

21:16 rohitksingh has quit [Ping timeout: 240 seconds]

21:20 <tnt> So, correct me if I'm wrong, but there is no way in the ice40 (using a single pll, target is up5k) to have two clocks outputs of the same frequency with a dynamically variable phase shift between them.

21:32 <tnt> Ok, nobody corrected me so ... damnit.

21:49 <levi> Wouldn't take that as definitive evidence; no one's said anything.

21:49 <kc8apf> Finally sorted out all the dependencies to build jtaghal. C++ dependencies is such a mess

22:10 Asu has quit [Quit: Konversation terminated!]

22:28 <tnt> So in the python wrapper, doing PyErr_SetString(...) isn't enough to raise an exception, the function actually needs to return ...

22:28 <tpw_rules> tnt: have you consulted the manual

22:28 <tnt> tpw_rules: the manual ?

22:29 <tpw_rules> lattice tech note 1251

22:29 <tnt> oh of the pll you mean ?

22:29 <tpw_rules> "iCE40 sysCLOCK PLL Design and Usage Guide"

22:29 <tnt> Well it doesn't even list the two PLL outputs ...

22:30 <tnt> and yeah, obviously I looked at it, and I couldn't find any way to do what I want.

22:30 <tpw_rules> then it must not be possible

22:30 <tpw_rules> it says tho "The PLL provides two optional fine delay adjustment blocks that control the delay of the PLLOUT output relative to the input reference clock, to an external feedback signal, or relative to the selected quadrant phase shifted clock."

22:30 <tnt> what's possible is way different than what's in the lattice docs ...

22:31 <tpw_rules> i'm not sure what you're proposing

22:32 <tnt> well in _this_ case nothing. But my point is that not being in the lattice docs is not exactly enough for me to conclude something is not possible. (like for instance outputing two clocks 90 deg appart. Nowhere in TN1251 will it tell you that you can do it, but you can ...)

22:33 <tpw_rules> maybe that's the wrong note

22:33 <tpw_rules> because i remember finding out that fact from a tech note and using it

22:33 <tnt> the icecube UI allows you to do that.

22:33 <tnt> The FPGA library reference also references the two PLL outputs.

22:34 <tpw_rules> there is a document on it: FPGA-TN-02052-1.0

22:35 mumptai has quit [Remote host closed the connection]

22:35 <tpw_rules> hmm, maybe not?

22:37 <tpw_rules> here it is: https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=2ahUKEwi99Zzrrv_nAhUB1qwKHVHJCTsQFjAAegQIAxAB&url=https%3A%2F%2Fwww.latticesemi.com%2F-%2Fmedia%2FLatticeSemi%2FDocuments%2FTechnicalBriefs%2FSBTICETechnologyLibrary201508.ashx%3Fdocument_id%3D51401&usg=AOvVaw291hUX1Vltv5YVgl3Iqwtv (sorry for the url junk i can't get rid of it)

22:37 <tpw_rules> page 101

22:38 <tpw_rules> it looks like you can do it, as long as the pll's frequency is the same as the input frequency

22:38 <tnt> yeah, using bypass mode. Not an option, I need a synthesized clock.

22:39 <tpw_rules> the 2F variety might be able to do it

22:39 <tpw_rules> page 104

22:40 <tnt> lattice download ... 37 min left ...

22:41 <tnt> I don't understand how their website can be so bad.

22:41 <tpw_rules> it went okay for me, but yeah it is amazing

22:42 <tnt> But anyway, I know the 2F variant. And unless I'm missing something, I don't see how to achieve that. (which is why I asked here in the first place in case I missed something or misread ...)

22:42 <tpw_rules> so the dynamicdelay delays both pins the same?

22:43 <tnt> GENCLK or GENCLK_HALF don't have the dunamic delay applied. SHIFTREG_{0,90} have the delay applied but are 1:4 of the frequency of the GENCLK output. And there is only two dynamic delay control: 1 in the feedback path to control phase wrt the input clock. and 1 in the output path.

22:44 <tpw_rules> and the output goes to both A and B

22:44 <tnt> So the "best" I could possibly do is output shiftreg_0 and then genclk_half and post divide genclk_half in the fabric.

22:45 <tnt> but in my case genclk_half would be 280 MHz which is a tad high for a UP5k.

22:45 <tpw_rules> alright. sorry for wasting your time

22:50 <tnt> np, appreciate having a second pair of eyes on it.

23:15 renze has quit [Quit: Spaceserver reboot?!]

23:19 renze has joined ##openfpga

23:48 Bike has joined ##openfpga

23:54 unixb0y has quit [Ping timeout: 258 seconds]