##openfpga on 2018-12-24 — irc logs at freenode.irclog.whitequark.org

00:00 pie___ has quit [Ping timeout: 250 seconds]

00:03 ayjay_t has quit [Read error: Connection reset by peer]

00:03 ayjay_t has joined ##openfpga

00:16 GuzTech has quit [Ping timeout: 240 seconds]

01:00 Flea86 has joined ##openfpga

01:03 <whitequark> tnt: there is a ? you can put in casez

01:05 oter has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

01:20 pie__ has joined ##openfpga

01:29 Richard_Simmons has quit [Ping timeout: 264 seconds]

01:35 Bob_Dole has joined ##openfpga

02:00 zng has joined ##openfpga

02:16 unixb0y has quit [Ping timeout: 272 seconds]

02:17 unixb0y has joined ##openfpga

02:26 _whitelogger has joined ##openfpga

03:28 JSharp has quit [Quit: Updating details, brb]

03:28 JSharp has joined ##openfpga

03:32 Richard_Simmons has joined ##openfpga

03:35 Bob_Dole has quit [Ping timeout: 250 seconds]

03:39 <_whitenotifier-6> [GitHub] Responsive is better than fast.

03:39 <_whitenotifier-6> [Boneless-CPU] whitequark created branch master - https://git.io/fhUTh

03:39 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 38 commits to master [+11/-4/±50] https://git.io/fhUTj

03:39 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 936d96b - arch.boneless: new architecture (Boneless v2).

03:39 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 211dd22 - arch.boneless: make sure NOP is 0x0000.

03:39 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 2050d71 - arch.boneless: add assembler with labels.

03:39 <_whitenotifier-6> [whitequark/Boneless-CPU] ... and 35 more commits.

03:45 <cr1901_modern> "Responsive is better than fast." Huh...

03:46 <whitequark> it's some github "zen" thing

03:47 <whitequark> which they send as a test payload

03:56 azonenberg_work has joined ##openfpga

04:17 _whitelogger has joined ##openfpga

04:23 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 2 commits to master [+0/-0/±2] https://git.io/fhUk2

04:23 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark bc3945b - Minor clarify fixes.

04:23 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark cb79d60 - Simplify ALU instruction decoding (-4 LUT).

05:34 rohitksingh has joined ##openfpga

06:01 zng has quit [Quit: ZNC 1.8.x-nightly-20181211-72c5f57b - https://znc.in]

06:06 zng has joined ##openfpga

06:13 Miyu has quit [Ping timeout: 240 seconds]

06:49 ayjay_t has quit [Read error: Connection reset by peer]

06:50 ayjay_t has joined ##openfpga

06:52 <tnt> whitequark: yeah, but casez(mysig) will also make a 'z' in 'mysig' be a don't care. I'd like to have only the case options have don't cares, not the input signal I'm matching against.

06:59 rohitksingh has quit [Ping timeout: 250 seconds]

07:01 <whitequark> right

07:01 <whitequark> no idea how to do that

07:16 rohitksingh has joined ##openfpga

07:18 catplant has quit [Quit: WeeChat 2.2]

07:21 catplant has joined ##openfpga

07:28 rohitksingh has quit [Remote host closed the connection]

07:30 rohitksingh has joined ##openfpga

07:36 <whitequark> ZipCPU: so, I am trying to convert an FSM-driven CPU into a pipelined CPU, for various reasons

07:36 <whitequark> I am interested in formally proving their equivalence

07:36 <ZipCPU> nMigen?

07:36 <whitequark> yeah

07:36 <whitequark> both in nMigen

07:36 <whitequark> really, the pipelined one will have the same CPI as FSM-driven one

07:36 <whitequark> but it turns out that FPGA tooling is kind of bad at optimizing FSM-driven CPUs

07:37 <whitequark> right now the FSM logic takes well over 50% of LUT count

07:37 <ZipCPU> Ouch

07:37 <whitequark> and that's with many different toolchains

07:37 <ZipCPU> Ok

07:37 <whitequark> yeah, it should be somewhere in the range of 200 LUT, and in reality it's more like 500

07:37 <whitequark> they will be both in-order and scalar

07:37 <ZipCPU> How much of a proof do you have in place already?

07:38 <whitequark> absolutely nothing, I have a set of testcases

07:38 <whitequark> (and a number of bugs in them, some of which I probably haven't found yet)

07:38 <ZipCPU> RISC-V CPU?

07:38 <whitequark> no, it's my own architecture

07:38 <ZipCPU> Ok

07:39 <ZipCPU> Clock for clock equivalence?

07:39 <ZipCPU> Or more general?

07:39 <whitequark> general. I think I need a bit of background

07:39 <ZipCPU> Never done the formal thing before?

07:39 <whitequark> this CPU is intended for FPGA control plane and the idea is that it takes absolute minimal amount of resources

07:39 <whitequark> so it stores everything except PC and flags in a single block RAM

07:40 <whitequark> it is also designed to exploit single-port RAM like in UP5K, but scale up to true multiport RAM

07:40 <whitequark> there are two reasons I am converting it to a pipeline design in spite of the fact that every instruction, by design, involves up to 3 loads and up to 1 memory store

07:40 <whitequark> to a single BRAM

07:40 <whitequark> the first one is that inferredd FSM control logic is unusably bloated

07:40 <ZipCPU> "stores everything ... in a single block RAM": does that include CPU state variables, or just CPU registers?

07:41 <whitequark> general-purpose registers

07:41 <whitequark> the only CPU state variables that are defined by architecture is PC and flags (ZSCV)

07:41 <ZipCPU> Just GP registers, or ... other memory as well?

07:41 <whitequark> GP registers, code and data share address space

07:41 <whitequark> and live in a single BRAM

07:42 <whitequark> it is an exceptionally compact design, which AFAIK has no precedent, or at least I haven't seen any

07:42 <ZipCPU> How much formal verification have you done so far?

07:42 <whitequark> it takes a number of lessons from PicoBlaze though

07:42 <whitequark> I have never done any formal verification for programmable logic, though I have some experience with Coq

07:42 <whitequark> I've looked at RVFI.

07:43 <ZipCPU> Ok ... so you'll be working through a learning curve at the same time. Sure. Here's the lesson I learned recently on CPU verification:

07:43 <ZipCPU> Keep the instruction around. Re-decode it within your formal logic at every step of the pipeline, and then verify that all of the local values within the pipeline continue to match the instruction you are processing.

07:44 <ZipCPU> That's similar to what RVFI is doing. They use a packet that goes through the pipeline, and then verify it at the end

07:44 <ZipCPU> s/They use/It uses/

07:45 <ZipCPU> You can do something similar, but have (as a minimum) the instruction as your packet, and then verify everything against it.

07:45 <whitequark> right, so what I'm thinking is: have an external interface where each clock cycle it is announced: a) the instruction opcode, b) what it is storing to memory, c) what is the new state of flags (if changed) and PC

07:46 <whitequark> because this should match for all in-order scalar implementations, no matter what they do inside.

07:46 <ZipCPU> Why an *external* interface?

07:46 <sorear> if you have a mechanism to $display instructions one at a time as they're executed, you have what you need for RVFI-alike

07:46 <whitequark> external to the implementation, I mean

07:46 <whitequark> hm

07:47 <whitequark> what I was thinking about is using the FSM implementation as a reference and comparing the pipelined implementation against it

07:47 <whitequark> but maybe that's a bad idea

07:47 <ZipCPU> The difficult part will be induction

07:47 <ZipCPU> You might manage to go several cycles with that in BMC, but ... induction can be more of a challenge

07:48 <ZipCPU> So there's somewhat of a division between Clifford and I: I use induction for everything, he does not

07:48 <ZipCPU> We joke about my being crazy for doing it, but I like the extra assurance it provides. That and ... somethings can only be proven via induction

07:49 <ZipCPU> Induction requires .... more intrusive properties to make certain the CPU state is consistent at every state

07:50 <ZipCPU> This is why I now "re-decode" the instruction at every CPU state--it helps to insure a consistent state within the CPU at every stage of the pipeline

07:50 <whitequark> hmmm so this is how my pipeline looks: https://pbs.twimg.com/media/DvKpLyzXQAI88lz.jpg:large

07:51 <ZipCPU> I might also recommend, since you are starting from scratch, that you don't worry so much about verifying the equivalence of the implementations, but rather that one of the implementations (or both) is equivalent to the ISA behavior

07:51 * ZipCPU clicks on the link

07:51 <whitequark> essentially, it has three load/decode stages, then some combinatorial logic that actually executes the instruction, then a writeback stage.

07:51 * ZipCPU had forgotten yo uwere using a spreadsheet for your pipeline ;)

07:51 <ZipCPU> Four stages?

07:51 <whitequark> yes.

07:52 <whitequark> three-address code, so load opA, load opB, store result, and fetch.

07:52 <whitequark> in the case of ALU instructions.

07:52 <whitequark> of course, since every stage is competing for memory access, it will be always mostly stalled. but this is fine.

07:53 <ZipCPU> Memory ... everything is using block RAM, right?

07:53 <whitequark> yep.

07:53 <ZipCPU> How big is the block RAM?

07:53 <whitequark> 16 bit by however much storage you want.

07:53 <whitequark> it should run with 256x16 iCE40 fine, and address space permits up to 65536x16.

07:54 <whitequark> which would be single port RAM territory on iCE40UP5K.

07:54 <ZipCPU> Ok, so ... 65kx16 then for the size of the proof

07:54 <whitequark> that seems like a very large state space...

07:54 <ZipCPU> Is the memory external or internal to the CPU?

07:55 <whitequark> internal to CPU.

07:55 <whitequark> (there is an external bus also, which you can use to attach peripherals, expand memory, etc.)

07:55 <ZipCPU> (It's not nearly as large a state space ... if you only have a 10 stage pipeline, only 10 memory values will ever be relevant in any proof)

07:55 <whitequark> ah I see.

07:55 <ZipCPU> So ... after your memory is larger than 10, bigger memories don't really change things

07:56 <ZipCPU> But it is internal to the CPU so .... you'll need assertions mid-pipeline that the instructions are consistent with what's in memory ...

07:56 <ZipCPU> What kind of states can the CPU ever be halted within? Are there specific places where it can be halted?

07:57 <whitequark> it doesn't have opcode space for a halt instruction. right now I use a special jump in a simulator to signal halt.

07:57 oter_ has quit [Quit: My iMac has gone to sleep. ZZZzzz…]

07:57 <whitequark> so, HLT ~ J .-1

07:57 <ZipCPU> (One of my frustrations is that, with the ZipCPU, it can be halted in *any* state ... that necessitated a method of verification)

07:58 <ZipCPU> Ok, what sort of stalls might you expect? Does the CPU ever stall?

07:58 <sorear> what does "state" mean in a pipelined context? do you not just stall the frontend and wait for everything to drain?

07:58 <whitequark> a lot. with single port RAM, each ALU instruction will stall four times

07:59 <ZipCPU> "stall" four times? But your memory is internal to the CPU. What's causing the stalls?

07:59 <whitequark> the memory may be single-port.

07:59 <whitequark> so only one access to the memory may go on per cycle

08:01 <whitequark> basically the challenge with this CPU is scheduling access to its internal memory

08:01 <whitequark> using the FSM as an abstraction for that did not work out

08:01 <whitequark> so now I am using a pipeline as an abstraction for that

08:01 <ZipCPU> whitequark: This sounds like a fun problem, and a good one for formal verification

08:01 <whitequark> right? I think so too

08:02 <whitequark> there are definitely some very good formal properties to be stated, like "in a SPRAM implementation, RE and WE are never on at the same time"

08:02 <ZipCPU> I'm not sure I have any wonderful ideas for how to get started. Usually I recommend starting with something simpler, but ... the problem at hand is always the one you want to work with

08:02 <ZipCPU> That's good

08:02 <ZipCPU> How about for instruction I, that reads from memory A, that it should be reading from memory A ?

08:03 <whitequark> I was thinking about this, but, depending on the implementation, reads may well go in different order

08:03 <ZipCPU> Or that, in your pipeline implementation, only instruction I is ever in the pipeline at a time

08:03 <ZipCPU> I think that was what you had described to me, no?

08:04 <whitequark> I think these are both not very good properties for formal verification of an *arbitrary* implementation, as opposed to a specific one

08:04 <whitequark> for example:

08:04 <whitequark> for any ALU instruction, the Load A and Load B stages are not interdependent

08:04 <whitequark> so they can be swapped

08:04 <whitequark> and loads will go in different order

08:04 <ZipCPU> The tricky (but important) one is writeback. If instruction I is an ADD instruction, then OpA should match the memory, OpB should match memory, and OpA+OpB should be written back

08:05 <ZipCPU> You could start there

08:05 <whitequark> hmmm

08:06 <ZipCPU> Be aware of multiplies ... those will definitely get in your way, so leave those for the end when you have everything else verified

08:06 <whitequark> so, in my formal model, I parse the instruction, extract operands from memory, execute the instruction, and verify that the output matches the FI packet?

08:06 <whitequark> there are no multiples in this CPU :D

08:07 <ZipCPU> Oh, sorry, looking closer at the "M-class" instructions it looks like 'M' stood for memory, rather than multiply

08:08 <ZipCPU> Since you have specific classes, you could create an assertion for each class: if the instruction being retired at this point is of class X, then ... these properties should hold

08:09 <ZipCPU> (Whatever they may be)

08:09 <whitequark> let's look at just AND.

08:09 <whitequark> I'd write something like...

08:10 <whitequark> if (i_opcode == OPCODE_AND) assert fv_stored_value == mem[r_win|i_regX] & mem[r_win|i_regY];

08:10 <whitequark> where fv_stored_value is the information from the packet about the instruction being retired,

08:11 <whitequark> i_* is the fields from the decoded instruction,

08:11 <whitequark> and r_win is the register window position (currently implemented as external to CPU, will probably become uarch state in the future)

08:11 <ZipCPU> Sounds like a good start

08:12 <whitequark> ok, I can definitely do that for all instructions.

08:12 <whitequark> how do I write the testbench and run it? is there a guide?

08:12 <ZipCPU> Do not confuse FV with a testbench. They are two completely different methodologies

08:12 <ZipCPU> Yes, there is a guide

08:12 <whitequark> ok, I do not know the terminology

08:12 <ZipCPU> (A couple even)

08:12 <whitequark> I need to instantiate the CPU and feed it to the toolchain somehow

08:12 <whitequark> that's what I don't know how to write

08:12 <ZipCPU> Yes

08:13 <ZipCPU> Let's see ... where's the best start for using SymbiYosys

08:13 <ZipCPU> There's a good readthedocs page for SymbiYosys

08:13 <ZipCPU> Have you written yosys scripts before? ... like for building things for iCE40's?

08:14 <whitequark> yeah, I use the Yosys script interface a lot

08:14 <whitequark> even wrote a few commands that are upstream, like equiv_opt

08:14 <whitequark> for proving equivalence of RTL before and after some pass

08:15 <ZipCPU> Have you looked through my tutorial at all? That's another place for getting started with SymbiYosys information

08:15 <whitequark> this one? https://zipcpu.com/zipcpu/2018/12/20/sby-makefile.html

08:15 <ZipCPU> That's also got some good information in it

08:16 <ZipCPU> I was going to recommend this one: https://zipcpu.com/tutorial

08:16 <ZipCPU> Lesson 3 gets into using SymbiYosys (don't give up on the lesson too early for being too basic)

08:16 <ZipCPU> Yeah, slide 30 of lesson three shows the basic outline of a SymbiYosys script

08:16 <whitequark> ah I see! I haven't looked at that since I don't really write Verilog directly

08:16 <whitequark> in fact nMigen does not use Verilog anywhere, it synthesizes to RTLIL

08:17 <ZipCPU> So ... how is your flow going to work then? nMigen->RTLIL->Verilog->SymbiYosys ?

08:17 <whitequark> I think so, yeah, since Verilog is (a) the human-readable output (b) can be used with other toolchains.

08:18 <whitequark> so, making sure that Yosys write_verilog pass did not mess up is important

08:18 <ZipCPU> Looks like lesson three of the tutorial walks you through the initial basics of using SymbiYosys and all the details of how it works

08:18 <whitequark> thanks, will use that

08:18 <daveshah> Making sure write verilog works is the only reason to use verilog

08:18 <whitequark> hehehe

08:18 <daveshah> read_ilang should be fine in SymbiYosys too

08:18 <ZipCPU> Will you be creating your formal statements within nMigen?

08:19 <ZipCPU> formal *properties* .. sorry

08:19 <whitequark> nMigen doesn't yet support formal properties, and I'm not entirely sure if they survive write_verilog in Yosys yet

08:19 <daveshah> The alternative to that is just exposing the necessary internal state as external outputs

08:19 <whitequark> so I think I'll start with writing them in Verilog

08:19 <daveshah> like RVFI

08:20 <whitequark> yeah, I'm going to follow RVFI

08:20 <whitequark> right now I really want to treat the CPU as a black box

08:20 <whitequark> maybe later I will add some assertions to the CPU itself

08:20 <ZipCPU> Ok ... that'll work until you get to induction ... then you need properties describing internal states

08:20 <ZipCPU> In other words, start out with "mode bmc" in your sby file

08:21 <daveshah> I'm not convinced you ever need induction

08:21 <whitequark> ahhh I think I see why induction needs properties describing internal states

08:21 <daveshah> riscv-formal gets away wiyhout it

08:21 <whitequark> because it ignores initial statements

08:21 <daveshah> Exactly

08:21 <ZipCPU> whitequark: Didn't I share that there was a division between Clifford (and now David) and I over using induction?

08:22 <ZipCPU> :D

08:22 <whitequark> yeah, but I didn't really understand why, until now

08:22 <ZipCPU> For now, it's a healthy debate

08:22 <daveshah> The third option is abc pdr. This provides a full proof too, but doesn't tend to need so many internal assertions as it determines reachability itself

08:22 <ZipCPU> None of us are religious about it (yet--or at least, I'm not)

08:22 <daveshah> However, I think a CPU might be too much for it

08:22 <whitequark> daveshah: even a boneless CPU? :P

08:22 <daveshah> Might be worth a try

08:22 <ZipCPU> daveshah: whitequark is dealing with up to 65kx16 of memory. I didn't think abc pdr handled memory well

08:23 <daveshah> Sure, but there's no need for more than a few 10s or 100s of words in that case

08:23 <ZipCPU> whitequark: The fastest engines right now for CPU work are yices and v3 of boolector. If you just use the "smtbmc" engine in SymbiYosys, you should do well

08:24 <whitequark> ZipCPU: thanks. I am off to watch FV find the bugs in my CPU I currently know the existence of.

08:24 <ZipCPU> Wheeee!!!

08:24 <ZipCPU> My guess is that you'll start finding a bunch of bugs in your properties first ;)

08:25 <whitequark> yeah, when I was writing testcases, it was about 50/50 split.

08:25 <whitequark> but I expect this to change once I switch to a pipelined design.

08:25 <whitequark> especially once I give it a true multiport memory and hazards become something that needs to be handled...

08:27 * ZipCPU found a lot of unchecked hazards when he first verified his ZipCPU

08:27 GuzTech has joined ##openfpga

08:27 <whitequark> yeah, I am *definitely* not trusting a pipelined design without a full formal proof

08:27 <whitequark> humans are just not good at finding this kind of bugs at all

08:28 * ZipCPU trusted his design before the formal proof, and now realizes his error

08:28 <SolraBizna> I didn't verify my pipelined design at all

08:28 <whitequark> ever since I seriously looked at FV of software, I am acutely aware that most code I write works, at most, by coincidence

08:28 <SolraBizna> (Relatedly, I never got it fully working)

08:28 <ZipCPU> I'm not sure if I'd write a CPU again without starting with formal at the beginning, just based on all I've learned in the process

08:28 <ZipCPU> ;D

08:28 <whitequark> but for software, FV is still often intractable

08:28 <whitequark> you cannot really use it for e.g. Python code

08:29 <ZipCPU> There are approaches to doing so ... I've just never tried them

08:29 <whitequark> there are well established approaches

08:29 <ZipCPU> SolraBizna: Still working with it and want to give formal a try?

08:29 <whitequark> for example, Coq extraction, and there's Frama-C for C code

08:29 <SolraBizna> I've been thinking about starting from scratch again and using formal verification as an excuse to do it

08:29 <ZipCPU> I like your excuse

08:30 <SolraBizna> It's one of those things where I'm going to hate learning it, but once I know it and use it it will save countless hours of work

08:30 <ZipCPU> I actually enjoyed learning FV. I had anticipated that I would not, but I found it to be a lot of fun

08:31 <whitequark> I really liked learning Coq

08:31 <sorear> my impression is that the major difference is that in RTLIL your state space is always finite, while nontrivial C or Python programs have infinite state spaces

08:31 <SolraBizna> I expect it will be similar with me, once I get past the "quit dithering and actually figure out how to talk to SymbiYosys" stage

08:31 <whitequark> however, I never got far in Coq

08:31 <whitequark> working with natural numbers is very different from working with inherently bounded state spaces like in programmable logic

08:31 <ZipCPU> sorear: You may have hit the nail on the head there

08:31 <sorear> one of these is NP-hard, the other is undecidable

08:32 <whitequark> it's an excellent mental exercise btu I think if you want it to be practical, you need many years of it

08:32 <whitequark> and maybe a PhD or two

08:32 <whitequark> that you write while learning

08:32 <whitequark> such is the state of the field...

08:32 <ZipCPU> No, I disagree

08:32 <ZipCPU> I found bugs in the first design that I (as a complete FV newbie) tried to formally verify

08:32 <whitequark> no no, I am talking about Coq h ere

08:32 <ZipCPU> Ah, ok

08:33 <whitequark> Coq is far more general, it is a bit like a formal verifier construction kit

08:33 <whitequark> there is a verified C compiler called CompCert, in Coq

08:33 <ZipCPU> whitequark: Do keep me posted on how this works out for you. I'd love to hear (and catalog) your experiences.

08:33 <whitequark> and most of the work that went into it, was in figuring out how to make *any* compiler verified in Coq

08:34 <whitequark> sure!

08:35 <ZipCPU> whitequark: Ever seen my ORCONF 2018 presentation? https://github.com/ZipCPU/zipcpu/raw/master/doc/orconf2018.pdf ?

08:35 <whitequark> not yet

08:36 <ZipCPU> The bottom of slide four discusses the differences between induction and BMC

08:37 <whitequark> ah I see, that's a good explanation

08:37 <whitequark> but here's something weird

08:38 <whitequark> I used induction a lot in Coq and in Coq, it works like this: you prove the basis of induction, and then you prove the step

08:38 <sorear> then cakeml went and did an arguably stronger approach using a completely different system

08:38 <ZipCPU> whitequark: Go on

08:39 <whitequark> but in Yosys it seems that the proof of the basis is missing; instead, you take an arbitrary but valid state of the basis, without trying to prove that it is reachable

08:39 <whitequark> it feels to me that there should be something in the middle between Yosys BMC and Yosys induction

08:39 <daveshah> The proof of the basis is done by running BMC first

08:39 <sorear> "proving that a state is reachable" is … harder than NP

08:39 <whitequark> hmm, but then why do you need a white box approach?

08:40 <daveshah> sorear: that's what abc pdr does

08:40 <whitequark> can you take the results of BMC and feed them into induction?

08:40 <daveshah> It either completes in a second, or doesn't complete at all

08:40 <daveshah> whitequark: no, because the starting state is arbitrary

08:40 <whitequark> yeah, that's what I don't understand

08:40 <whitequark> why is it arbitrary?

08:40 <ZipCPU> To avoid calculating reachability ;)

08:41 <sorear> state reachability is PSPACE-complete

08:41 <sorear> both "BMC" and "induction" are only solving NP problems

08:41 <whitequark> ok, so it is basically an optimization

08:41 <whitequark> I see

08:41 <ZipCPU> Optimization? Not what I'd call it

08:41 <ZipCPU> It's not optimizing an existing solution to work faster, it's just not calculating reachability at all

08:42 <whitequark> well, optimization might not be a good word

08:42 <ZipCPU> Your assertions are used to force your design into a reachable state

08:42 <whitequark> but I think I understand why this happens now

08:42 <ZipCPU> This was the important part of the last FV quiz: if you find a bug in section (A) (the first part of your trace), you don't have enough assertions

08:42 * sorear has an irrational dislike of automatic theorem provers with an exponential or larger gap between average/heuristic case and worst/provable case, which is most of them

08:43 <whitequark> of course, I still want reachability, but I assume that there are very good reasons it isn't done

08:43 <ZipCPU> If you find a bug in section (B) or (C) of the trace (last clock, or last timestep), then you may have a logic bug

08:43 <daveshah> Well, try abc pdr and see if it works

08:43 <ZipCPU> No ... run from abc pdr!

08:43 * whitequark has an irrational dislike of abc

08:43 <whitequark> (or maybe rational)

08:43 <ZipCPU> Somtimes abc pdr works fast, and gives you a solution very quickly

08:43 <sorear> a circuit with 200 state bits can have states which are reachable, but only after 2^200 time steps

08:44 <ZipCPU> Other times, smtbmc gives you a result and ... abc pdr gets stuck

08:51 * ZipCPU heads back to bed

08:55 rohitksingh has quit [Ping timeout: 250 seconds]

09:03 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 1 commit to master [+0/-0/±2] https://git.io/fhUYK

09:03 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 26d7202 - Remove explicit HALT state, use J(-1) instead.

09:29 _whitelogger has joined ##openfpga

09:48 <tnt> Does yosys take advantage of 'x' ? Like if in a always_comb I assign a buch of signals to 'x' most of the time, does it know I don't care about the output in this case and its free to pick whatever is most convenient to generate ?

11:43 rohitksingh has joined ##openfpga

11:52 rohitksingh has quit [Ping timeout: 272 seconds]

12:10 Miyu has joined ##openfpga

12:14 Miyu has quit [Ping timeout: 244 seconds]

12:28 pie__ has quit [Ping timeout: 252 seconds]

12:31 sgstair has quit [Remote host closed the connection]

12:45 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 3 commits to master [+6/-1/±2] https://git.io/fhUcC

12:45 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 92fee90 - Separate core and testbench code.

12:45 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 9bef717 - Add a disassembler, usable as a GTKWave filter.

12:45 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 455dc55 - Add Python package.

12:51 sgstair has joined ##openfpga

13:50 GenTooMan has joined ##openfpga

14:00 Flea86 has quit [Ping timeout: 240 seconds]

14:10 _whitelogger has joined ##openfpga

14:35 Miyu has joined ##openfpga

14:48 s_frit has quit [Ping timeout: 250 seconds]

15:07 X-Scale has quit [Read error: Connection reset by peer]

16:17 zng has quit [Quit: ZNC 1.8.x-nightly-20181211-72c5f57b - https://znc.in]

16:19 zng has joined ##openfpga

16:25 pie__ has joined ##openfpga

17:10 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 2 commits to master [+5/-0/±2] https://git.io/fhUun

17:10 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 47db93e - Teach disassembler to emit NOPs.

17:10 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 6fd5f1a - Write a formal specification for the instruction set.

17:33 rohitksingh has joined ##openfpga

18:00 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 3 commits to master [+0/-0/±6] https://git.io/fhUzd

18:00 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark d36736b - Fix JR/JAL immediate width. (Bug found with formal methods!)

18:00 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark f615859 - Fix ADDI not setting flags. (Bug found with formal methods!)

18:00 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 5183ba2 - Add some delay to pins testbench, now that ADDI/SUBI work properly.

18:15 oter has joined ##openfpga

18:16 oter has left ##openfpga ["Textual IRC Client: www.textualapp.com"]

18:26 azonenberg_work has quit [Ping timeout: 240 seconds]

18:47 GenTooMan has quit [Read error: No route to host]

18:52 oter has joined ##openfpga

18:53 oter has quit [Client Quit]

18:58 oter has joined ##openfpga

19:10 oter has quit [Quit: My iMac has gone to sleep. ZZZzzz…]

19:11 azonenberg_work has joined ##openfpga

19:14 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 2 commits to master [+3/-0/±2] https://git.io/fhUaT

19:14 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 14d9d10 - Make the disassembly a bit more compact.

19:14 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark f4c0504 - Add iCEblink40-HX1K example.

19:14 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 1 commit to master [+4/-0/±1] https://git.io/fhUak

19:14 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark d81a52c - Add iCEblink40-HX1K example.

19:21 m_w has joined ##openfpga

19:27 rohitksingh has quit [Ping timeout: 246 seconds]

19:37 <SolraBizna> tnt: I vaguely remember checking this and finding that does

19:38 <SolraBizna> *it does

20:09 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark pushed 2 commits to master [+0/-0/±4] https://git.io/fhUVs

20:10 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark a6a4b36 - Add XCHG pseudo, expanding to XOR swap.

20:10 <_whitenotifier-6> [whitequark/Boneless-CPU] whitequark 0fc6d57 - Change the pins test to use PWM.

20:12 <SolraBizna> I'll test it again

20:23 <whitequark> ZipCPU: okay, so, i'm done

20:23 <whitequark> here is my specification: https://github.com/whitequark/Boneless-CPU/blob/master/formal/formal.sv

20:24 <whitequark> I still need an assertion that the CPU retires at least one instruction each 20 clock cycles (20 can happen with a really large shift)

20:25 <whitequark> and that flags are the same as when the previous instruction was retired, for most of them

20:25 <whitequark> for the former, I think I need SystemVerilog $past()[*20] or something

20:25 <whitequark> which doesn't seem to be supported in Yosys

20:25 <whitequark> for the latter, I need to figure out if $past() actually gives me a value, or if there is no such past value

20:26 <whitequark> which I don't know how to express

20:27 <SolraBizna> tnt: it does

20:55 oter has joined ##openfpga

21:21 <pointfree> qu1j0t3: Nifty floating-gate analog memory chip http://www.kowatec.com/prod/ap/doc/apr6016-v13.pdf

21:35 kristianpaul has quit [Quit: Lost terminal]

21:36 <qu1j0t3> pointfree: interesting, thanks

21:56 <TD-Linux> whitequark, did you consider just a shift-by-1 instruction and requiring a software loop?

21:56 <TD-Linux> tho I suppose non pipelined jumps make that really bad

21:56 oter has quit [Quit: My iMac has gone to sleep. ZZZzzz…]

22:01 <sorear> that's what lm32 does, I think it came up yesterday

22:02 <sorear> (you don't need shift left by 1 since it's just ADD)

22:02 oter has joined ##openfpga

22:02 <sorear> (technically you can construct a right shift by 1 from arithmetic and jumps but it's Not Pretty)

22:05 <emily> is that actually... practical to do? that sounds like implementing addition in terms of a successor instruction, but /me is clueless

22:44 oter has quit [Quit: My iMac has gone to sleep. ZZZzzz…]

22:52 Flea86 has joined ##openfpga

23:04 <mithro> First attempt at new SymbiFlow website -> https://symbiflow.github.io/

23:09 m_w has quit [Quit: Leaving]

23:10 <mithro> daveshah: Would love help fixing up the Project Trellis section

23:10 m_w has joined ##openfpga

23:24 oter has joined ##openfpga

23:25 <tnt> SolraBizna: thanks :)

23:41 oter has quit [Quit: My iMac has gone to sleep. ZZZzzz…]

23:51 <q3k> mithro: there needs to be more 'get started' and 'boards to buy' sections before 'how it works' imo

23:51 <q3k> mithro: the whole 'how it works' is a wall of text i think

23:52 <q3k> mithro: a lot of the graphics don't even fit on my screen at once https://q3k.org/u/e439169bd003c1e9725f800eb9c5aa8a4fa525dbda9e1547f232c11442ca3a93.png

23:52 <q3k> mithro: really needs more entry-level high-overview graphics much higher up

23:52 <q3k> mithro: and you should also repeat the get started 'call to action' a bunch of more times across the page

23:53 <q3k> mithro: since if you scroll past it, it's not even on the horizontal menu on top

23:54 <q3k> mithro: also that 'open source FPGA tooling for rapid innovation' tagline is way too long

23:55 <q3k> also migen is not spelle MiGen

23:56 <mithro> https://github.com/SymbiFlow/symbiflow-website/issues

23:56 <q3k> i's also bump that contrast up quite a bit, that gray is barely readable