##openfpga on 2018-09-11 — irc logs at freenode.irclog.whitequark.org

00:00 <awygle> i specifically wanted polarized, i never buy non-polarized sunglasses anymore

00:01 <azonenberg_work> Yeah i dont care about the polarized stuff aspect

00:01 <azonenberg_work> My requirements were a single frame available with replaceable clear and dark lenses

00:01 <azonenberg_work> Adjustable earpieces, rubberized nose to help hold it in place when heavily sweating etc

00:02 <azonenberg_work> then Z87.1 high impact as well as MIL-PRF-31013 impact standards

00:05 <whitequark> azonenberg_work: done

00:06 <whitequark> oh sec

00:06 <awygle> i can't find any actual, like, manufacturer website for these sunglasses

00:06 <openfpga-bot> [jtaghal] whitequark pushed 1 new commit to master: https://git.io/fAwGe

00:06 <openfpga-bot> jtaghal/master e19e380 whitequark: Implement device enumeration for GlasgowSWDInterface.

00:09 <rqou> azonenberg_work: so, i just discovered that apparently you can get slide-out storage units that fit in server racks

00:09 <rqou> did you know about that?

00:10 <awygle> they're, uh... not cheap

00:10 <rqou> heh, figured as much

00:10 <rqou> i didn't even know these existed

00:10 <awygle> we had a couple at planetary

00:11 <awygle> rack mount shelving ditto

00:12 <azonenberg_work> whitequark: gaah the forward is broken again i think

00:12 <rqou> lool

00:13 <rqou> use ipv6 without a forward? :P

00:13 <whitequark> azonenberg_work: fixed i think

00:13 <whitequark> let me look up some keepalive options for ssh

00:14 <whitequark> azonenberg_work: should do keepalive now

00:15 <whitequark> rqou: russia

00:15 <whitequark> does not have ipv6

00:15 <whitequark> anywhere afaik

00:15 <rqou> what

00:15 <whitequark> we just have NAT. a lot of NAT

00:15 <rqou> whyyy?!

00:15 <whitequark> most big ISPs don't give you real IPs anymore

00:15 <whitequark> my guess is there's no real incentive

00:16 <whitequark> there was NAT even before IPv4 exhaustion

00:16 <whitequark> also, shitty SOHO routers don't do IPv6 and no one wants to upgrade all that

00:16 <rqou> there's no "real" incentive in the us and yet isps are very slowly upgrading

00:16 <whitequark> for some reason ipv6 is quite popular in the US on mobile

00:16 <whitequark> no idea why

00:17 <rqou> meanwhile supposedly some isps in brazil are doing ipv6-only with ipv4 over ipv6

00:17 <rqou> i learned this from a mojang bug report because mojang thought they were really clever by totally disabling ipv6 in their launcher

00:18 <rqou> interestingly, I don't have ipv6 on mobile, just nat

00:20 <openfpga-bot> [jtaghal-apps] azonenberg pushed 1 new commit to master: https://git.io/fAwGM

00:20 <openfpga-bot> jtaghal-apps/master df3d9d7 Andrew Zonenberg: Added initial enumeration support for Glasgow

00:20 <openfpga-bot> [jtaghal-cmake] azonenberg pushed 1 new commit to master: https://git.io/fAwGD

00:20 <openfpga-bot> jtaghal-cmake/master 0bdeb6c Andrew Zonenberg: Updated to latest submodules

00:22 <rqou> surprisingly, comcrap in the us has a really competent backend team and has been deploying native dual stack for quite some time

00:22 <azonenberg_work> rqou: yeah looking forward to getting a proper dual stack setup on a static allocation here once i'm set up

00:23 <azonenberg_work> i had a tunnel before just because i didnt want to renumber the network too many times

00:23 <azonenberg_work> but once i'm set up at the new lab i'm going full static /56

00:23 <rqou> oh yeah, one stupid thing is that comcrap assigns dynamic ipv6 prefixes

00:23 <azonenberg_work> I think you can get static on business class

00:23 <azonenberg_work> it takes some effort but i got one

00:23 <azonenberg_work> (comcast has dynamic v4 too fwiw)

00:23 <rqou> supposedly according to the interwebs you'll usually keep it until the cmts gets rebooted

00:24 <azonenberg_work> yeah thats standard practice for dynamic ips in general

00:24 <zkms> whitequark: major US cell carriers built their LTE packet cores on ipv6 and also the fruit company has been pushing for ipv6 pretty hard

00:26 <rqou> why does my phone still not have ipv6?

00:26 <azonenberg_work> Glasgow API version:

00:26 <azonenberg_work> Serial number: (error)

00:26 <azonenberg_work> User ID: (error)

00:26 <azonenberg_work> Interface 0: Glasgow revA

00:26 <azonenberg_work> Enumerating interfaces... 1 found

00:26 <azonenberg_work> whitequark: ^

00:28 <whitequark> azonenberg_work: hmmmm let's see

00:30 <whitequark> azonenberg_work: uhhhh

00:30 <whitequark> LeakSanitizer does not work under ptrace (strace, gdb, etc)

00:30 <whitequark> this is a reason to not always enable sanitizers.

00:31 <azonenberg_work> Thats not relevant to the error, is it?

00:31 <whitequark> it is

00:31 <whitequark> I tried to strace jtagd

00:31 <whitequark> to see why it breaks

00:31 <whitequark> and I cant

00:32 <azonenberg_work> Well, i guess disable sanitizers in your local build temporarily

00:32 <azonenberg_work> and make a ticket for "only enable sanitizers in some specifiic build config: or something?

00:33 <whitequark> sure I did that

00:36 <whitequark> azonenberg_work: fixed

00:37 <openfpga-bot> [jtaghal] whitequark pushed 1 new commit to master: https://git.io/fAwZa

00:37 <openfpga-bot> jtaghal/master 6844146 whitequark: Fix GlasgowSWDInterface serial number discovery.

01:03 <openfpga-bot> [jtaghal-cmake] azonenberg pushed 1 new commit to master: https://git.io/fAwnX

01:03 <openfpga-bot> jtaghal-cmake/master 248db6f Andrew Zonenberg: Updated to latest submodules

01:05 <openfpga-bot> [jtaghal-apps] azonenberg pushed 1 new commit to master: https://git.io/fAwnH

01:05 <openfpga-bot> jtaghal-apps/master 21e9142 Andrew Zonenberg: Added --api glasgow switch

01:06 <openfpga-bot> [jtaghal-cmake] azonenberg pushed 1 new commit to master: https://git.io/fAwn7

01:06 <openfpga-bot> jtaghal-cmake/master b22db16 Andrew Zonenberg: Updated to latest submodules

01:07 <azonenberg_work> OK jtagd now starts and runs with a glasgow attached

01:08 <azonenberg_work> Doesn't do much yet because i havent added socket commands for the SWD protocol yet

01:08 <azonenberg_work> I also have not yet added support for the client to query the transport layer protocol

01:08 <azonenberg_work> So right now it gets very confused trying to send jtag commands to a swd interface

01:08 <whitequark> found some memory leaks

01:08 <whitequark> lemme fix those

01:09 <azonenberg_work> The server correctly ignores the JTAG commands in SWD mode but the client doesn't yet know it should be trying to do SWD :p

01:10 <azonenberg_work> I'm about to start doing some cable plant work downstairs so will have to leave this for a bit

01:10 <azonenberg_work> But will try to get back to it tonight in 3-4 hours

01:11 <whitequark> ah ok!

01:44 unixb0y has quit [Ping timeout: 240 seconds]

01:44 unixb0y has joined ##openfpga

01:54 <rqou> whee https://mjg59.dreamwidth.org/51177.html

01:54 <rqou> awygle: ^

01:59 Miyu has joined ##openfpga

02:02 hackkitten has quit [Ping timeout: 244 seconds]

02:06 emeb has quit [Quit: Leaving.]

02:28 <rqou> huh this is new -- contactless payment on a gas pump

02:29 <rqou> still no emv though

02:29 <whitequark> contactless is emv i think

02:30 <rqou> yeah, but this pump doesn't support a physical chip card

02:30 <rqou> only magstripe and contactless

02:34 <whitequark> rqou: no skimmers?

02:35 pie___ has joined ##openfpga

02:37 <awygle> rqou: I don't disagree with anything said there, but the focus on VC-backed open source is weird, as is the idea, implied by the statement that lack of adoption doesn't help developers, that adoption somehow *does* help developers

02:37 pie__ has quit [Ping timeout: 240 seconds]

02:38 <whitequark> yeah

02:41 <awygle> In addition to Tidelift, I'd call out License Zero as a cool thing I've learned about recently which is relevant to this topic

02:43 <awygle> long term i'd like to see us figure out how to adapt the worker-owned cooperative model to a world where a project has potentially thousands of workers and it's very difficult to gauge the relative amounts of their work

02:43 <whitequark> lol stripe account

02:43 <whitequark> so, not anything i could potentially use

02:43 <awygle> whitequark: yes, that sucks for a number of reasons. i like the concept much more than the implementation.

02:43 <whitequark> something something bitcoin

02:44 <awygle> whitequark: have you checked out uh... Stellar i think? supposedly much more usable for transactions than bitcoin?

02:44 GenTooMan has quit [Quit: Leaving]

02:44 * awygle is hugely ignorant here and hopes to gain knowledge

02:45 <whitequark> oh there's a number of networks like that

02:45 <whitequark> none of them have the adoption of bitcoin though

02:46 <whitequark> i'm looking forward to using something other than the bitcoin tire fire

02:47 <awygle> ah okay so it's adoption limited

02:47 <awygle> i was wondering if stellar's backing by stripe meant it was susceptible to all the bullshit gatekeeping or something

02:48 <awygle> woo pcbs

02:48 <awygle> 2-5 days

03:05 <whitequark> awygle: it probably is, but bitcoin exchanges aren't immune from AML either

03:14 Maya-sama has joined ##openfpga

03:19 Maya-sama has quit [Ping timeout: 272 seconds]

03:24 <TD-Linux> awygle, stellar doesn't have the distributed consensus model that bitcoin has, making it not really better than just using stripe

03:32 <awygle> TD-Linux: i mean, i feel like "better for what purpose" is a relevant question here, but like i said i have no real knowledge in this space

03:32 <azonenberg_work> awygle: better for collecting VC dollars?

03:38 <TD-Linux> heh. my more serious (but no less truthful) answer is that I think there's plenty of room for high level improvements to bitcoin (e.g. lightning) that don't throw away properties that make bitcoin as successful as it is.

03:40 <TD-Linux> t. slightly biased as I've done some power sidechannel work on libsecp256k1, bitcoin's elliptic curve implementation

03:40 <rqou> tbh i don't really trust any asymmetric crypto to be properly hardened against side channels?

03:41 <rqou> i've actually implemented secp256r1 and i have no idea if it's even correct, let alone secure

03:41 <rqou> and i don't really know how i would go about confirming that it is

03:45 <TD-Linux> it's really hard to do correctly. secp256k1 has pretty extensive mitigations - e.g. when there is a branch it computes values for both sides of the branch and then does a cmov

03:47 <TD-Linux> but it's still mostly guesswork. one piece of hardware that I keep on not finishing is a stm32 board that has current sense resistors and amplifiers in front of each decoupling cap, giving me really high bandwidth current measuring capability, also synchronized to the clock

03:47 <TD-Linux> with the goal of making a ci test of sorts for power sidechannels in crypto algorithms

03:48 <rqou> and then intel manages to make you a new side channel anyways :P

03:49 <TD-Linux> yeah, that's going to be a gift that keeps on giving :)

04:08 _whitelogger has joined ##openfpga

04:08 <whitequark> azonenberg_work: are you here yet?

04:19 Bike has quit [Quit: Lost terminal]

04:22 ZipCPU has quit [Ping timeout: 245 seconds]

04:25 ZipCPU has joined ##openfpga

04:52 <azonenberg_work> whitequark: just packing up

04:53 <azonenberg_work> TD-Linux: awesome

04:53 <azonenberg_work> TD-Linux: personally, i think that side-channel-free crypto in software on a GP CPU is impossible, period

04:53 <azonenberg_work> If you need to eliminate side channels do it in hardware with guaranteed constant timing

04:54 <azonenberg_work> no caches, no buses, no shared resources of any kind that you can have contention that affects timing

04:54 <azonenberg_work> Custom hardware makes power tweaking much easier too

04:55 <azonenberg_work> TD-Linux: did you see the CPU architecture i designed for running crypto algorithms?

04:56 <azonenberg_work> TD-Linux: http://paste.debian.net/plainh/79aca68f

04:57 <azonenberg_work> Meant to run any current or future hash or cipher efficiently. Not for pubkey at all

04:58 <rqou> <bullshit>what about post-quantum cryptography?????</bullshit>

04:59 <azonenberg_work> I mean i can't predict the future, but since the days of DES basically all block/stream ciphers and hashes have involved a bunch of high fan-in bitshifts, additions, bitwise operations, table lookups,etc

04:59 <TD-Linux> azonenberg_work, I wouldn't say it's impossible, but it's extremely implementation dependent which is unfortunate

04:59 <azonenberg_work> Minimal use of conditionals, multiply/divide, etc

05:00 <TD-Linux> part of the reason I'm using stm32 is a lot of the hardware wallets use it

05:00 <azonenberg_work> TD-Linux: Without microarchitectural information the chip vendors don't release?

05:00 <azonenberg_work> i don't think i's possible

05:00 <azonenberg_work> certainly not on an applications processor with any kind of OoO engine - may be possible on an in-order MCU core

05:00 <TD-Linux> yeah the m4 is pretty simple and in order

05:00 <azonenberg_work> Yeah but there is still potential for some stuff with the flash prefetch engine etc

05:00 <azonenberg_work> or AHB bus contention

05:01 <TD-Linux> indeed, that's the reason why I'm measuring. if I already knew the answer I woudn't do it :)

05:01 <azonenberg_work> Lol

05:01 <azonenberg_work> Anyway, i'm curious what you think of that ISA

05:01 <rqou> lol i totally forgot that m4s do have prefetch/cache/etc

05:01 <azonenberg_work> I never actually implemented it, and i had some more work to do on the control plane side of things i think (this was mostly datapath)

05:02 <azonenberg_work> The goal was to maximize instructions per clock for a crypto-specialized CPU with a fully deterministic in-order pipeline and a single register file write port

05:02 <azonenberg_work> i.e. you cannot retire >1 reg write per clock

05:03 <TD-Linux> rqou, only on flash but yes. the m7 supports external dram and has cache on that too iirc

05:03 <rqou> i've also been bit by the store buffer

05:03 <azonenberg_work> TD-Linux: It has a Y-shaped pipeline made of three ternary ALUs :D

05:03 <rqou> if you clear a timer interrupt bit too close to the end of the isr handler the handler will trigger a second time

05:03 <TD-Linux> what's the utility of the ternary?

05:03 <azonenberg_work> No R/I type format

05:04 <azonenberg_work> Everything is r32 op r32 op imm32

05:05 <azonenberg_work> So you have two parallel execution units like that

05:05 <azonenberg_work> Total of four registers, two opcodes, two immediates

05:05 <azonenberg_work> Then you have a third execution unit that, instead of using registers as inputs, operates on the outputs of the previous ALUs

05:05 <azonenberg_work> with a third opcode

05:05 <azonenberg_work> and writes to a single register

05:06 <azonenberg_work> Did i mention this uarch was targeting VERY high fan-in operations? :D

05:07 <TD-Linux> rqou, yeah the nvic is also really complex. I haven't had many problems with it though, but I've always turned off interrupt preemption

05:07 <rqou> have you thought about maybe doing an explicit datagraph ISA instead?

05:07 <azonenberg_work> rqou: It might be hard to make that deterministic runtime

05:07 <rqou> hmm ok

05:07 <azonenberg_work> Which was the other goal, i wanted cycle accurate performance with no data dependent control flow whatsoever

05:07 <rqou> i don't really know anything about the design space

05:07 <TD-Linux> presumably this is primarily targeting AES?

05:08 <rqou> and MSFT/QCOM didn't say much about it

05:08 <azonenberg_work> TD-Linux: it's designed to run AES, MD5, SHA1/2, and any future replacements

05:08 <azonenberg_work> The vision was that you could bake this into an asic that will be in service for decades

05:08 <azonenberg_work> and have performance better than a stock MCU core but more flexibility than a hard accelerator

05:09 <azonenberg_work> and probably much less area than an eFPGA

05:09 <TD-Linux> why the ternary tho

05:09 <azonenberg_work> Because a lot of hashes mix a constant in every round

05:10 <TD-Linux> ohhhh

05:10 <TD-Linux> I thought you were actually implementing ternary *base* arithmetic

05:10 <azonenberg_work> No

05:10 <azonenberg_work> I meant 3-input arithmetic

05:10 <TD-Linux> (there are some horrific base 3 hash algorithms)

05:10 <azonenberg_work> The whole ~140 bit instruction word implements

05:10 <azonenberg_work> (reg1 op1 reg2) op3 (reg3 op2 reg4)

05:11 <azonenberg_work> sorry i missed the immediates

05:11 <azonenberg_work> (reg1 op1 reg2 op1 imm1) op3 (reg3 op2 reg4 op2 imm2)

05:11 <azonenberg_work> i think op3 might take an immediate too

05:11 <azonenberg_work> So you could do (r0 + r2 + 0xdeadbeef) ^ (r4 & r6 & 0x41414141) in one instruction

05:12 <azonenberg_work> You see why this would excel at crypto now? :)

05:12 <azonenberg_work> oh and i think all of the regs input to the ALUs can be bitwise complemented too

05:12 <TD-Linux> it looks okay. the latency is kind of enormous though

05:12 <azonenberg_work> So you could do the MD5 (b & c) | (!b & d)

05:12 <TD-Linux> this is probably ok for aes-ctr and the like though

05:12 <azonenberg_work> in one instruction

05:12 <azonenberg_work> And well, this is meant for stream processing in general

05:13 <azonenberg_work> crypto tends not to have much data-dependent operations

05:13 <azonenberg_work> So you can have a deep pipeline

05:13 <azonenberg_work> in fact, if you look at the memory map

05:13 <azonenberg_work> input and output are memory mapped FIFOs :D

05:13 <azonenberg_work> this core is meant to be a black box where data goes in and data comes out

05:13 <azonenberg_work> and it sits between say application logic and a TCP offload engine or something

05:15 <azonenberg_work> Like i said before, I never actually *built* it so I don't know how performance would be - it'd need to be tested and tweaked a lot

05:15 <azonenberg_work> It was mostly just an architectural experiment that targeted a less-common point in the design space

05:15 <rqou> inb4 you built another preshot

05:16 <azonenberg_work> whats that?

05:16 <rqou> mocking name for intel's prescott uarch

05:16 <rqou> also wtf prescott was "only" 90nm?

05:17 <rqou> i thought it was a smaller node than that

05:17 <TD-Linux> azonenberg_work, you'll know you've hit peak prescott housefire when you have to clock your alus at half speed

05:18 <rqou> wait it does that?

05:24 <TD-Linux> rqou, page 57 of https://www.agner.org/optimize/microarchitecture.pdf

05:25 <rqou> hey azonenberg_work, you're working on your jtag tools right?

05:25 <rqou> want to finish up coolrunner-ii support for all parts?

05:25 <rqou> also add the "crbit" format support?

05:26 <TD-Linux> also two ports run at double speed

05:29 <azonenberg_work> rqou: I'm working on features i can justify for work right now

05:29 <azonenberg_work> so arm stuff and swd

05:29 <azonenberg_work> But if i have time, sure

05:29 <azonenberg_work> if there are not tickets on the github already, file them

05:29 <rqou> which repo?

05:30 <azonenberg_work> jtaghal

05:33 <TD-Linux> this has probably already been discussed to death but I was happy to see the talos ii mobo uses icestorm and friends on a hx1k https://git.raptorcs.com/git/talos-system-fpga/tree/

05:43 m_w has quit [Ping timeout: 244 seconds]

05:47 m_w has joined ##openfpga

06:16 sensille has joined ##openfpga

06:26 <sensille> i'm using yosys/arachne-pnr for the first time. how do i set a timing constraint on the clock?

06:29 <azonenberg_work> sensille: i could be wrong (I'm less familiar with it than some other people)

06:29 <azonenberg_work> but my understanding is that arachne is not a timing-driven PAR

06:30 <azonenberg_work> it does the best job it can, then you run static timing to see if it worked

06:30 <sensille> i was afraid of that, thanks

06:30 <azonenberg_work> nextpnr is the next-generation tool that i believe is timing driven

06:30 <sensille> output from icetime: Unable to resolve delay for path ce -> ltout in cell type LogicCell40!

06:30 <azonenberg_work> i think it works on ice40 and ecp5?

06:30 <azonenberg_work> mithro: ^^

06:34 GuzTech has joined ##openfpga

06:39 <sensille> looks like i'm having a hard time fitting my design into a hx8k :-/

06:40 <sensille> 24% luts of a artix-7/35T

06:41 rohitksingh_work has quit [Quit: Leaving.]

06:41 <azonenberg_work> sensille: have you considered optimizing it? What's it do?

06:42 <sensille> 4-channel stepper motor controller, the design i discussed with ZipCPU the other day

06:43 <azonenberg_work> Hmm, might be worth trying to figure out where your area is going

06:44 <azonenberg_work> and see if you can share some resources between channels or otherwise shrink it

06:44 <azonenberg_work> that sounds awfully large

06:44 <sensille> i don't know the old xilinx had a local 4-bit RAM in each cell. i used that for a cpu to switch state between threads. maybe current fpgas are similar and i can very cheaply multiplex the logic between the controllers

06:44 <azonenberg_work> well, i was actually going to suggest that you overclock the design

06:44 <azonenberg_work> and have one copy at 4x the rate

06:44 <azonenberg_work> and just have a shift register or something on the io cells

06:44 <azonenberg_work> stepper control doesnt sound like it needs 100 MHz clock frequencies

06:44 <sensille> the area goes into adders

06:45 <sensille> i'm running it at 20Mhz

06:45 <azonenberg_work> So can you make timing at 80?

06:45 <azonenberg_work> if so, you can have one copy of the core logic

06:45 <azonenberg_work> just replicate state

06:45 <sensille> i know as soon as icetime works :)

06:45 <azonenberg_work> Lol

06:45 <sensille> vivado reported 35ns

06:46 <sensille> but it was only constrained to 50ns, so i don't know the limit

06:46 <azonenberg_work> Vivado timing doesnt mean much unless you set a tihgt constraint

06:46 <azonenberg_work> exactly

06:46 <azonenberg_work> it wasnt trying hard

06:47 <sensille> is multiplexing cheap? does it map well to the cells?

06:47 <sensille> i guess is depends much on the architecture

06:48 <azonenberg_work> Yeah it should be, a 4:1 mux is one lut

06:48 <azonenberg_work> so basically replace every state bit with a lut and four dffs

06:48 <sensille> so 4 channels would be the sweet spot :)

06:48 <azonenberg_work> With a 6LUT xilinx arch

06:48 <azonenberg_work> On a lattice 4LUT arch you would need two luts per mux i think

06:49 <azonenberg_work> unless there is hard mux ip

06:49 <sensille> one channel implementation: LCs 6151 / 7680

06:49 <azonenberg_work> (you might also be able to optimize the channel logic itself but this is a start)

06:49 <azonenberg_work> The naive solutoin to everything in FPGA is throw hardware at it

06:49 <azonenberg_work> but sometimes you can run faster with less hardware and do the same work

06:50 <sensille> i can cut channel logic by 30% without trying too hard i guess

06:50 <sensille> but to be honest i was hoping to have some headroom :)

06:50 <azonenberg_work> Yeah before you do any muxing of channels

06:50 <azonenberg_work> See how small you can get one

06:51 <azonenberg_work> Then let me take a look at the RTL and i might have some suggestions on how to shrink it more

06:51 <sensille> you will recoil in horror when you see the 208-bit-adders :)

06:52 * GuzTech gasps

06:52 <sensille> :)

06:53 <GuzTech> Dare I ask?

06:53 <GuzTech> Why do you have/need 208-bit adders? :D

06:53 <sensille> i can probably cut them down to 180 bits ;)

06:53 <azonenberg_work> why

06:53 <azonenberg_work> That is probably your #1 problem :p

06:53 <azonenberg_work> that's at least 208 luts per adder

06:54 <sensille> to calculate a 5th order polynomial differentially over 20M steps without accumulating too much error

06:54 <GuzTech> If it's a ripple carry adder...

06:54 <GuzTech> Or else it's even more (but most likely faster).

06:54 <sensille> so i don't need multiplication

06:55 <azonenberg_work> sensille: uh... i feel like there has to be a better solution :p

06:56 <azonenberg_work> on a xilinx part i'd use the hard multiplier block

06:56 <GuzTech> How large is your bitwidth?

06:57 <GuzTech> I feel like you could implement a small multiplier with carry-save adders, which would be faster and smaller.

06:58 <sensille> i don't know what bitwidth i'd need, depends if i implement floating point or not i guess

06:59 <sensille> but t^5 is large for t==20M, no idea how i would approach that

06:59 <sensille> of course the coefficient is small

06:59 <rqou> i would suggest a microcoded approach

06:59 <rqou> with some ram and a smaller adder

07:00 <sensille> but that would have to run at a quite high clock, if i want to end up with 20M evaluations/s

07:00 <rqou> hmm

07:01 <rqou> I'm not familiar with your design but can you use precomputed lookup tables?

07:01 <sensille> no

07:01 m4ssi has joined ##openfpga

07:01 knielsen has quit [Ping timeout: 246 seconds]

07:01 rohitksingh_work has joined ##openfpga

07:02 <rqou> not even by cheating? I'm not aware of stepper controllers being that complicated

07:02 <sensille> my current approach is something like the mechanical difference engine. babbage also solved the error propagation problem my making the adders larger :)

07:02 <sensille> s/my/by

07:02 <sensille> he used 35 digits or such

07:03 <rqou> why is there a giant polynomial involved?

07:04 <sensille> i want to use 5th order to make the math for the approximation very easy. with some tricks a 4th order polynomial might also do

07:04 <sensille> end-to-end the design is dead simple. but also relatively expensive in the fpga, but cheap on the host

07:05 <rqou> uh... don't stepper motors normally just have a "step" and "direction" pin? what does the polynomial do?

07:05 <sensille> so i can do the host part on an rpi

07:05 <sensille> calculate the step

07:06 <rqou> ah, you're building a cnc machine?

07:07 <rqou> why does this calculation need to be in the fpga? can it be in the rpi instead?

07:07 <sensille> 3d printer

07:07 <rqou> alternatively, can you evaluate your polynomial with Horner's method?

07:08 <sensille> this biggest problem of all hobby 3d-printers is step calculation. they currently to that in an mcu and have to make tons of horrible simplification

07:08 <sensille> and the rpi doesn't have a good enough timing to do the control directly

07:09 knielsen has joined ##openfpga

07:09 <rqou> wait, the printer is entirely open-loop, right?

07:09 <sensille> yes

07:09 <rqou> it seems like it should be possible to precompute all the step counts ahead of time and just feed that into an fpga to generate step pulses?

07:09 <sensille> my goal is to control the jerk of the motion, to reduce vibrations

07:10 <sensille> yes, but that approach has 2 problems: it might bring the rpi to its limits, and you need to transfer the step data to the fpga

07:11 <sensille> for the latter you need some kind of compression

07:11 <sensille> and you could say i use a polynomial for compression :)

07:11 <rqou> does the rpi have dma-capable qspi?

07:12 <sensille> qspi? quad? afaik (i'm actually using an orange pi), dma yes, but only one data line

07:12 <sensille> not sure

07:13 <rqou> I'd definitely investigate doing all the hard stuff on the pi and have the fpga just generate pulses

07:13 <sensille> but it's also a matter of latency. linux isn't very good at it out of the box

07:14 <GuzTech> Can't you run a bare-metal program that does all the calculation? At least the latency would be much better.

07:14 <rqou> yeah i was thinking just mmaping the dma controller and the spi peripheral and poking them directly

07:15 <rqou> overall i think you should pick a better hardware-software tradeoff :P

07:15 <sensille> bare metal, without linux? then i'd need another board for the UI stuff

07:15 <GuzTech> Oh nvm then :P

07:15 <rqou> alternatively do look into Horner's method which avoids your giant exponentiation problem

07:16 <sensille> rqou: should i manage to fit the design in an hx8k i think the tradeoff would be not too bad

07:16 <GuzTech> But yeah, as rqou said, you should think about about a better tradeoff.

07:16 <sensille> of course i don't want to spend $50 on the fpga

07:16 <rqou> i also tend to bias my tradeoff towards software because software is easier with shorter dev cycles

07:17 <rqou> azonenberg_work tends to bias towards hardware because hardware is easier to verify

07:17 <sensille> my current tradeoff is the simplest overall design

07:18 <GuzTech> Horner's method seems simple enough to implement.

07:19 <sensille> yeah, looking :)

07:20 <sensille> you think a, lets say, 30 bit multiplier is cheap than a 200 bit adder?

07:20 <sensille> *cheaper

07:20 <s_frit> sensille: just curious: do you think this jerk control business is going to work better than building a closed-loop solution?

07:21 <sensille> s_frit: closed loop would mean a completely different kind of hardware, much more expensive

07:22 <s_frit> sensille: i see. i guess even with a closed loop controller, you still want the control input to be as smooth as possible.

07:22 <sensille> i want to raise the cheap hobby-solutions to the next level, by spending $10-$20 on an fpga

07:22 <GuzTech> sensille, a 30x30 bit multiplier is basically a 900 bit adder. But if you build it with carry-save adders, then your latency is much lower.

07:22 <rqou> how do the current solutions all work?

07:22 <GuzTech> So it's a area/performance tradeoff.

07:23 <rqou> GuzTech: uh, the final answer should only be 60 bits, not 900?

07:23 <GuzTech> You'd need 30 times 30 bit adders, no?

07:24 <GuzTech> Or am I not awake yet?

07:24 <sensille> rqou: the use an mcu to generate the steps. the max out at 60-100kHz with an awful lot of jitter

07:24 <sensille> they can't even control the acceleration properly, let alone the jerk

07:25 <rqou> and you feed them gcode?

07:26 <rqou> GuzTech: not with the usual wallace tree structure

07:26 <s_frit> am i correct to presume there is sigma-delta modulation of the pulses involved?

07:26 <sensille> also, the current stepper driver technologie allows for a microstep resolution of 1/256 to get smooth motions, of course there's no way you can make use of it with an mcu

07:26 <rqou> why not?

07:26 <sensille> rqou: yes, the currently get fed gcode

07:27 <rqou> i get the feeling this entire thing can be implemented by intelligently programming an stm32f4

07:27 <sensille> rqou: 1/256 means step rates into the low MHz

07:27 <rqou> using its hardware timers to generate the proper pulses

07:28 <GuzTech> rqou: True, a Wallace/Dadda/HPM structure uses less hardware, but it would still be more than just 60 bit adders.

07:28 <sensille> and of course in theory step rate changes with every step when doing a curve

07:28 <s_frit> use a high-speed spi port to stream out a pulse stream

07:28 <sensille> s_frit: digma-delta?

07:29 <sensille> pre-calculating the exact step-data would be too much for an rpi i guess

07:29 <rqou> how computationally expensive can these calculations all be?

07:29 <s_frit> sigma-delta modulation -- noise-shaped pulse generation (tbh i don't know if it is applicable to stepper motor control, i just assumed)

07:30 <sensille> rqou: evaluating a 5th order polynomial or sin/cos :)

07:30 <rqou> how frequently?

07:30 <sensille> once per step

07:31 <sensille> so a few 100k/s

07:31 <sensille> on multiple channels

07:31 <rqou> O_o

07:31 <rqou> I'd personally go with an approach of calculating all of this ahead of time on a PC and generating a list of steps

07:32 <rqou> and then you can figure out how to actually "play" this list

07:32 <s_frit> or compute the high-res verson at a lower rate, and interpolate to upsample

07:32 <sensille> producing GB of data?

07:32 <rqou> would it really be that much?

07:33 <rqou> in any case, GBs of storage are pretty cheap nowadays :P

07:33 <sensille> you could compress it by approximating it with a polynomial ;)

07:33 <s_frit> sensille: how are you evaluating the polynomial? as a p(x) type thing, or p(x) = f(p(x-1))

07:33 <sensille> also, pre-calculating would take too much time

07:34 <sensille> s_frit: currently the latter

07:34 <sensille> rqou: if you can't do it in realtime, you can't wait for it. a print may take 20h

07:35 <azonenberg_work> sensille: personally, i would indeed go closed loop

07:35 <sensille> azonenberg_work: why?

07:35 <azonenberg_work> Because it eliminates all this math and you can do basic PID control or something

07:35 <azonenberg_work> much more precise too

07:36 <azonenberg_work> of course, i also wouldn't build a FDM 3d printer

07:36 <rqou> given these constraints I'd probably feed Ethernet into an fpga

07:36 <azonenberg_work> i'd go with stereolithography or similar

07:36 <azonenberg_work> or SLS

07:36 <sensille> and what would be the advantage? you'd still need to calculate the path

07:36 <rqou> a program on a PC doing math and feeding a step list into the fpga

07:37 <sensille> SLA needs a path for the laser, same problem, higher rates :)

07:37 nurelin has quit [Ping timeout: 244 seconds]

07:37 <azonenberg_work> sensille: i'd feed a set of coordinates to the FPGA over Ethernet from a CPU of some sort

07:37 <azonenberg_work> Then have the FPGA do simple closed loop PID control to reach those coordinates

07:38 <sensille> but you also need to control the speed and coordinate that with the extrusion

07:38 <sensille> i think closed loop would just add another layer of problems

07:38 <s_frit> maybe feed the coordinates to the FPGA at low rate, then upsample using some high-quality interpolation to generate the pulses. assuming that the control signal is bandlimited this will work just as well as running the polynomials at high rate

07:39 <rqou> at this point I'd start looking into approximations

07:39 <rqou> since somehow those Arduino-powered things work

07:39 <sensille> ok, i can fit 2 channels into an hx8k, there's hope :)

07:39 <sensille> s_frit: that is not far from what i do

07:40 <sensille> rqou: the polynomials are already the approximations

07:41 <s_frit> sensille: the advantage of what i'm proposing is there is a single stream of numbers for each channel, and the interpolator just needs to generate a fixed number of interpolated points for each input sample, you may be able to store the interpolation weights in ram

07:41 <sensille> otherwise i'd need to calculate sin/cos

07:42 <s_frit> for example you could use 8th order interpolation, then each oversampling tap will need 32x8 32-bit coefficients and you'll need to perform 8 multiplies per output sample

07:42 <s_frit> erm that should be 8 x 32-bit coefficients

07:43 <azonenberg_work> sensille: sin/cos to how many bits?

07:43 <sensille> azonenberg_work: haven't really thought about this, maybe 24?

07:44 <rqou> wait, I'm not quite seeing why you need to calculate sin/cos every single step

07:44 <rqou> what exactly is the performance improvement you want to make?

07:45 <sensille> control the jerk

07:45 <sensille> but maybe i can do with a linear interpolation between 16 steps or such

07:45 <sensille> or more

07:45 <rqou> don't you only need accurate calculations around direction changes?

07:46 <sensille> my goal is to print models that are described by splines

07:46 <sensille> so the direction changes with each step

07:47 <s_frit> cubic hermite interpolation is also an option maybe

07:47 <sensille> and even with gcode, when transitioning between the lines, the head has to move along a curve

07:48 <rqou> yeah, I'd go with the "stream data via Ethernet" approach

07:48 <azonenberg_work> honestly, i wouldn't even use gcode

07:48 <sensille> s_frit: ZipCPU explained a way to interpolate from coordinates to me, that ended up using 4th order polynomials

07:48 <sensille> azonenberg_work: i don't want to use gcode

07:49 <sensille> that was my starting point of this adventure

07:49 <azonenberg_work> i'd precompute a full toolpath in real time on the CPU, with target positions streaming over ethernet every few microseconds

07:49 <azonenberg_work> then the FPGA just does closed-loop control to ensure you go to that position

07:49 <azonenberg_work> you could also go with something like a Zynq and nix the ethernet and have a super low latency link

07:49 <sensille> closed-loop would probably triple the price of the printer

07:50 <azonenberg_work> I didn't say i built cheap stuff :)

07:50 <azonenberg_work> I'm the wrong guy to ask if you're cost optimizing

07:50 <azonenberg_work> I go for quality, accuracy, and reliability

07:50 <s_frit> sensille: ZipCPU knows more math than me, but i was just thinking if you're contemplating linear interpolation, cubic hermite is a nice step up without going to your full 5th order solution. maybe it doesn't fit for x,y paths quite so well, i don't know

07:50 <azonenberg_work> Think Mitutoyo, not Aoyue :p

07:50 <sensille> currently my only concern is if i need a $50 fpga or a $10 one :)

07:50 <sensille> with the $50 my approach works fine

07:51 <sensille> s_frit: i tried to write down the requirements: http://3dpfs.sensille.com/index.php?title=Mathematics

07:53 <sensille> but hey, if 2 controllers already fit, i can probably optimize it to 4 and i'm good :)

07:55 <s_frit> sensille: from my point of view, assuming a low-rate-control-with-upsampling/interpolation solution, a lot of the performance requirements amount to trading off between source data rate, source data bandwidth (i.e. max velocity delta), and interpolation quality.

07:57 nurelin has joined ##openfpga

07:58 <sensille> s_frit: yes. i'd like to stay well below 1MBit for the source data rate

08:01 nurelin has quit [Ping timeout: 246 seconds]

08:02 <s_frit> sensille: and you want > 1us pulse resolution?

08:03 <s_frit> sensille: or >1MHz pulse rate?

08:03 <sensille> for extreme moves, step rate may go up to 6 MHz

08:04 <sensille> probably not needed during prints, but 1MHz is reached easily

08:04 <s_frit> sennille: how are the steps transmitted to the motor? what's the data look like?

08:05 <sensille> 2 lines, step/dir. pulses on step (or double edge)

08:06 <s_frit> so it's purely about pulse rate, pulse width doesn't come into it

08:06 <sensille> yes

08:07 soylentyellow has quit [Remote host closed the connection]

08:07 <s_frit> but you're going to get some "issues" with pulse spacing being quantized to the fpga clock speed

08:07 <sensille> the pulses are sampled by the stepper driver at about 16MHz

08:08 <s_frit> ah ok. so the stepper driver has it's own filtering

08:09 soylentyellow has joined ##openfpga

08:10 <s_frit> does the stepper driver have a bandwidth specification? when you say it samples at 16MHz do you mean it's rated to properly handle input pulse rates up to 8MHz?

08:10 nurelin has joined ##openfpga

08:10 <sensille> i think that's the specified limit, yes

08:20 <s_frit> sensille: 1mbit/sec for 4 channels of 2-tuple jerks (say 32-bits per value) is a sample rate of 4kHz so that's approx 256x to 2048x upsampling ratio. nyquist rate would be 2kHz -- maybe you could calculate how that relates to spatial resolution

08:28 <sensille> erm - no ;) what does the 2kHz relate to? curvature change?

08:29 pie___ has quit [Ping timeout: 244 seconds]

08:45 <s_frit> it's the maximum representable frequency in your control signal (whatever the control signal represents, presumably some derivative of position, that gets integrated)

08:46 <sensille> yes, but how does 'frequency' relate here to the physical reality? assuming the speed is constant, that would be something like the change of curvature?

08:46 <s_frit> so, if max |v| is 1 m/sec then i guess that 2kHz maps to .05mm spatial period

08:47 <sensille> 1m/sec is a realistic upper bound

08:47 <s_frit> like you should be able to represent a little left-right wiggling sine wave / zig/zag with that period, maybe

08:49 <s_frit> probably not though, because that's a totally theoretical upper bound, and your interpolators may not be that good

08:50 <s_frit> note that this is for representing sharp high-frequency stuff like corners

08:50 <s_frit> using a 4kHz sampled control stream

08:51 <s_frit> if you use something more like a spline "display list" of course you'll get might tighter temporal/spatial resolution

08:51 <sensille> there are no real corners, the point is to transform corners into curves

08:51 <s_frit> i just wanted to think through the uniformly sampled case

08:52 <s_frit> well then i guess you get best-case 0.5mm rounded corners with this regularly sampled scheme

08:52 <sensille> would the maximum frequency determine the maximum 'slew rate' and thus the maximum velocity?

08:52 <sensille> hm

08:53 <sensille> 0.5mm at 1m/s, ok

08:53 <s_frit> yeah i'm not exactly sure how to convert bandwidth to slewrate

08:53 <sensille> the hardware can't deliver that anyway

08:53 <s_frit> they are convertible, for sure

08:53 <sensille> i'm aiming for 0.1mm corners at 0.1m/s

08:54 <s_frit> that seems totally doable

08:55 <s_frit> i mean, totally doable with a 4kHz stream of jerks (uniformly sampled) then interpolated with linear or cubic interpolation

08:55 <s_frit> of course you would still generate the 4kHz stream using your "smooth curves" techniques

08:56 <s_frit> hopefully i didn't make a mistake with the math ;)

08:56 <sensille> a 4kHz stream of jerks with my old implementation would certainly be good enough. but i failed at generating that stream, so i thought of a way to make the generation easy

08:57 <s_frit> haven't you just moved the generation to the fpga? where you still haven't made it easy?

08:57 <sensille> in fpga it's easy, only 6 lines of code

08:57 <sensille> but also a bit costly

08:58 <sensille> not in azonenberg_work's terms of 'costly', though)

08:58 <s_frit> what was the problem with the "old implementation" then? why did you fail at generating the stream?

09:00 <s_frit> i mean, you could stream out 8 channels of 4kHz 32 bit data using the i2s audio interface on a RasPi

09:00 <sensille> i have no idea how to approach the math

09:00 <s_frit> oh

09:01 <sensille> the main point is that errors must not accumulate

09:01 <s_frit> yeah, i was wondering about that

09:02 <s_frit> so you need to interpolate the data in such a way that the errors don't accumulate

09:02 * s_frit thinks hard

09:03 <sensille> i would want to start with 3 basic geometries: lines with s-curve motion profile, circles with constant velocity and clothoids with constant velocity

09:04 <sensille> but later on i want to use nurbs as source. this is where things really start to get nasty

09:05 <s_frit> numerical integration schemes are always vexed

09:06 <sensille> i'll need to find approximate the curve length for that

09:06 <s_frit> is all this predicated on the idea that the motors don't skip pulses (i.e. pulses map precisely to particular x, y positions, relative to the starting point)

09:06 <s_frit> ?

09:06 <sensille> yes

09:07 <s_frit> and how do current controllers work? do they transmit x,y coords and then compute the deltas in the controller?

09:08 <s_frit> i mean, maybe it's easier to just spit out a 4kHz stream of x,y coordinates, interpolate the data on the fpga, and then compute the jerks from the interpolated data

09:08 <sensille> depends on what you call 'controller' in the chain. the current control board (mcu based) read gcode (description based solely on linear moves) and directly generate the pulses

09:09 <s_frit> you can still embed all of the 5th order stuff into the x,y generation to keep the motion smooth

09:09 knielsen has quit [Read error: Connection reset by peer]

09:09 knielsen has joined ##openfpga

09:09 <s_frit> right, so at a miniumu you want to replace gcode with something that can represent curves

09:09 <s_frit> *minimum

09:10 <s_frit> or generate 4kHz gcode

09:11 <s_frit> btw. let me know if i'm annoying you with my questions... just seems like an interesting problem

09:11 <sensille> in a first implementation i need to read gcode and enrich it by added smooth transitions between the lines. the controllers currently do that, too, but not in a way that satisfies me

09:12 <sensille> no, i'm happy to've found someone who has taken an interest in it :)

09:12 <sensille> my math skills are weak, so i need every help i can get ;)

09:12 m_w has quit [Ping timeout: 245 seconds]

09:13 <s_frit> my math skills are not great, but i'm currently back at uni studying math, trying to improve

09:14 <sensille> i have to admit i have a math pre-grade from uni, but that was a long time ago. so i'm back at school level

09:14 <s_frit> but i know a bit about signal processing, that's why i mention the sample-rate stuff

09:14 <sensille> i see. i tried to wrap my head around that several times in the past, without success

09:16 <s_frit> well, the main thing is, if you have a regularly sampled stream with sample rate N, you can represent signals with frequencies (i.e. sine wave components) up to N/2.

09:17 <sensille> yes, nyquist, that's where my knowledge starts and end :)

09:17 <sensille> and a bit of FFT

09:17 <s_frit> then if you want to represent a "corner" (e.g. a sharp direction change like a triangle wave in a one dimensional signal) then you won't be able to make it super-sharp, because then it would have energy about the nyquist rate

09:17 <s_frit> *above

09:21 <s_frit> you're going to have some energy above the nyquist rate with your piecewise curves, but if you transmit absolute position that should be no big deal (it might cause some wobbles, i'm not sure)

09:22 soylentyellow_ has joined ##openfpga

09:22 <s_frit> the serious problems will start if you are transmitting velocity or jerk, and you violate the nyquist limit, and then you interpolate the result, and then you "integrate" by feeding pulses to the motor

09:24 soylentyellow has quit [Ping timeout: 240 seconds]

09:29 <sensille> i would first generate a source curve that doesn't violate the limits, and sample that

09:30 <sensille> my current approach samples postion, velocity and acceleration, and calculate a 5th order curve between each 2 points. very simple math

09:31 <sensille> ZipCPU proposed to just sample position and interpolate over a window with a 4th order curve

09:32 <s_frit> makes sense

09:32 <s_frit> how many luts does a 32-bit multiplier use?

09:32 <sensille> but the latter would give me only indirect control over the velocity, and none over acceleration

09:33 <s_frit> assuming that you are producing eqi-spaced time series data for position, that already encodes velocity and acceleration

09:34 <sensille> with "space" meaning the 2-dimensional curve length

09:34 <sensille> if i want constant velocity

09:34 <s_frit> hmm, i'm not exactly following, i think we need to use clearer language

09:35 <s_frit> the output of this system is a series of pulses, which are discrete position change commands, no?

09:35 <sensille> it is important to see that we control the motor per axis, meaning 2 motors, while the velocity is the 2-dimension velocity, |v|

09:35 <sensille> yes

09:36 <s_frit> right, so |v| = sqrt(x'^2 + y'^2) if i remember correctly

09:37 <s_frit> where x' = dx/dt and y' = dy/dt

09:37 <s_frit> i'm imagining the time-series is a series of (x,y) pairs

09:38 <s_frit> (x,y) pairs get generated from the source curves using a parameterisation that gives you the |v| properties that you want

09:38 <s_frit> by x,y i mean absolute position

09:40 soylentyellow__ has joined ##openfpga

09:42 <sensille> in ZipCPU's approach, yes

09:43 soylentyellow_ has quit [Ping timeout: 264 seconds]

09:43 <s_frit> your control step for each axis looks something like (in a c-like language): at each over-sampled time step: newx = iterpolatedataforx(); errx = newx - currentx; if (errx >= stepsize) { output_up_pulse(); currentx += stepsize; } else if (errx >= -stepsize) { output_down_pulse(); currentx -= stepsize; }

09:44 <s_frit> this way you're always comparing an interpolated position to the actual position, and there's no chance of drift/error accumulation

09:45 <s_frit> as soon as you start doing open-loop numerical integration you're going to accumulate round-off errors -- i guess what you're proposing is to reset to a known position at the start of every curve segment, and make sure your numerics are accurate enough to not accumulate significant error during the curve

09:47 <s_frit> i think ZipCPU and my method are the same, except maybe i'd use a different interpolator, but not qualified to give advice on fpga implementation strategies, so i'd defer to him on that

09:49 <s_frit> i do think it's worth noting that in a regularly sampled system, your interpolation times are going to be some fixed multiple of the base sample rate so you can potentially store the interpolation coefficients for each sub-sample tap in block ram

09:50 <s_frit> perhaps more useful: interpolation times will be at some fixed subdivision of the base sample period

09:58 <sensille> s_frit: yes, each segment has to be calculated from the real position incl. error, not the theoretical one. an interesting point: to keep to derivatives continous, interpolation also has to take these into consideration. except one decides that the error is too small to matter

09:59 <s_frit> sensille: this runs through a bunch of low-order polynomial interpolators: http://yehar.com/blog/wp-content/uploads/2009/08/deip.pdf

10:01 <sensille> that's a long read

10:02 <s_frit> well the short summary is: "just use cubic hermite"

10:03 <sensille> "... for audio oversampling"

10:04 <s_frit> sinsille: there is one with simple coefficients her: http://www.musicdsp.org/archive.php?classid=0#16 see Oscillator::UpdateWithCubicInterpolation

10:05 <s_frit> hmm, maybe not that last one

10:05 <s_frit> i don't see any difference between audio sampling and what we are discussing here

10:05 <s_frit> it's all signals

10:06 <s_frit> that last one the code is more confusing than i thought

10:06 <s_frit> thing with that 3rd order cubic is i know it will give you significantly better performance than linear

10:06 <s_frit> *linear interpolation

10:07 <s_frit> it's pretty much flat for frequencies below N/4

10:07 <sensille> the difference (or not?) is that i care about continues 2nd derivative

10:07 <s_frit> that matters for audio too

10:08 <sensille> so can i just use a sound chip to control my steppers? :P

10:08 <s_frit> if you can live with pulse rate below 22kHz sure

10:09 <sensille> use x/y as stereo input and derive steps from the magnitude of the output signal

10:10 <s_frit> anyhow, we're talking about taking your low-rate 4kHz position data (equivalent to the audio signal) and upsampling it in the fpga to 1Mhz or more

10:11 <s_frit> out of a raspberry pi you can easily stream 44kHz 64-bit digital data out of the audio interface, so you could easily output 8 8hKz 32-bit values to the fpga that way

10:11 <s_frit> i'm talking about the i2s serial audio interface here

10:12 soylentyellow__ has quit [Quit: Leaving]

10:12 <s_frit> i'm assuming 32-bits is enough resolution to represent the x,y positions on your printer, is that correct?

10:12 soylentyellow has joined ##openfpga

10:14 <sensille> 20 bit are enough for my printer, and 24 bit would probably enough for any printer

10:14 <s_frit> ok

10:14 <s_frit> i'm just thinking some more about this interpolator, and comparing it to usual audio applications

10:14 <s_frit> usually you would probably use something better than cubic to do such high-level oversampling i think

10:15 <s_frit> you want to go from 4kHz to 1Mhz say

10:15 <sensille> where 4kHz @32 bit and 1Mhz @1 bit

10:16 <s_frit> well, kinda, yeah

10:16 <s_frit> actally i was thinking 1MHz 32bit and then you generate the 1 bit signal from that

10:17 <s_frit> if you want this thing to output 6Mhz pulses, then you'd ideally want to be able to upsample to 6Mhz, but who knows what the fpga can do

10:20 <s_frit> i suspect that the optimial solution looks something like: output data from the rpi at the highest feasible rate (ie you generate high-quality, very smooth data using your fancy algorithm) that will probably be something in the region of ~16kHz, but spatially bandlimited to like 1 or 2 kHz, then the fpga runs the most complex interpolation it can afford to get the data up to 6MHz, then you

10:20 <s_frit> run the loop i mentioned above to output the pulses

10:21 digshadow has joined ##openfpga

10:21 <s_frit> the point here is that the better the data is that you generate with the "smooth" algorithm, the less work the interpolator/up-sampler needs to do

10:23 <s_frit> you might also want to consider outputting 32-bit fixed-point position data from the smooth algorithm, so the interpolator has more detail to work with

10:23 <sensille> what do you mean by 'better'? closer to the original, or modified in a way that the interpolator generates the best result?

10:24 <s_frit> i mean closer to the original / more information

10:26 <s_frit> in a theoretical sense a perfect interpolator will recover the original signal even if it contains components at nyquist, but there is no perfect interpolator, and low-order polynomial interpolators won't perform that well (although we're talking about very smooth/low frequency source material, so i'd expect them to perform pretty well)

10:26 m_w has joined ##openfpga

10:30 <s_frit> sensille, where are you up to with the implementation?

10:31 <sensille> running for a circle

10:32 <sensille> but need to do the corexy coordinate transformation yet

10:33 m_w has quit [Ping timeout: 252 seconds]

10:34 <s_frit> what does "corexy coordinate transformation" mean?

10:36 <sensille> the motor movement does not directly relate to x/y movements

10:36 <sensille> but that's a simple linear transformation

10:36 <sensille> https://corexy.com/theory.html

10:41 <sensille> other printers may need more complex transformations, like sqrt or transformation into polar coordinates. but that should influence the interpolation much

10:41 <s_frit> interesting

10:41 <sensille> should not, of course

10:42 <s_frit> depending on what scheme you use, you could do the transformation on the low-frequency data prior to interpolation

10:42 <sensille> yes, that's the plan

10:43 <sensille> in my current implementation i need to transform position, speed and acceleration

10:43 <s_frit> is it better to interpolate transformed data, or transform interpolated data? i don't know. it will make a subtle difference if the transformations are not linear

10:44 <s_frit> fun :)

10:44 <sensille> the interpolation needs to have a step size based in the actual implementation, so i guess the former

10:46 <sensille> another transformation: http://kandepet.com/wp-content/uploads/2017/03/cartesian-vs-delta.jpg

11:20 soylentyellow_ has joined ##openfpga

11:20 soylentyellow has quit [Read error: Connection reset by peer]

11:24 soylentyellow_ has quit [Ping timeout: 252 seconds]

11:35 ondrej3 has quit [Ping timeout: 245 seconds]

11:37 ondrej3 has joined ##openfpga

12:06 Miyu is now known as hackkitten

12:18 <sensille> maybe now someone can give me a hint with this, from earlier this day: "Unable to resolve delay for path ce -> ltout in cell type LogicCell40!"

12:18 <sensille> from icetime

12:29 rohitksingh_work has quit [Read error: Connection reset by peer]

12:59 lain has quit [Ping timeout: 240 seconds]

12:59 Bike has joined ##openfpga

13:32 lain has joined ##openfpga

13:36 rohitksingh has joined ##openfpga

13:41 rohitksingh has quit [Ping timeout: 264 seconds]

13:49 Maya-sama has joined ##openfpga

13:57 Maya-sama is now known as Miyu

14:01 rohitksingh has joined ##openfpga

14:12 soylentyellow has joined ##openfpga

14:24 rohitksingh has quit [Ping timeout: 240 seconds]

14:53 Miyu has quit [Ping timeout: 246 seconds]

14:53 rohitksingh has joined ##openfpga

14:58 <mithro> azonenberg_work / sensille: Yes nextpnr is timing driven -- can you log a bug about that ce -> ltout delay issue?

15:06 pie_ has joined ##openfpga

15:11 hackkitten has quit [Read error: Connection reset by peer]

15:12 hackkitten has joined ##openfpga

15:15 <mithro> sensille: The issue is lack of timing data - if you log a bug I think it will get fixed pretty quickly

15:15 GuzTech has quit [Quit: Leaving]

15:22 <whitequark> azonenberg_work: poke

15:31 rohitksingh has quit [Quit: Leaving.]

15:39 emeb has joined ##openfpga

15:47 pie_ has quit [Read error: Connection reset by peer]

15:47 pie__ has joined ##openfpga

15:52 Miyu has joined ##openfpga

15:53 Miyu has quit [Read error: Connection reset by peer]

15:55 Miyu has joined ##openfpga

16:40 pie_ has joined ##openfpga

16:43 pie__ has quit [Ping timeout: 240 seconds]

16:51 emeb has quit [Ping timeout: 240 seconds]

16:56 emeb has joined ##openfpga

17:02 m4ssi has quit [Remote host closed the connection]

17:12 rainey has left ##openfpga ["Leaving"]

17:23 <pie_> lol;

17:23 <pie_> its good to be enterprise https://twitter.com/rygorous/status/1039296167824089091?s=19

17:28 <prpplague> pie_: hehe

17:30 ym has quit [Remote host closed the connection]

17:44 GuzTech has joined ##openfpga

17:55 <awygle> i just accidentlaly opened a second desktop, i didn't know windows could do that

17:56 <whitequark> what

17:56 <whitequark> oh windows has a proper WM now

17:56 <whitequark> it does tiling too

17:57 <pie_> yeh

17:57 <awygle> yeah it would have been relaly cool if i'd... done it on purpose

17:57 <awygle> i also had remote desktop open so i got Confused

18:10 m_w has joined ##openfpga

18:20 digshadow has quit [Ping timeout: 240 seconds]

18:32 <rqou> wait windows has a tiling mode now?

18:36 azonenberg_work has quit [Ping timeout: 240 seconds]

18:41 Ultrasauce has quit [Quit: No Ping reply in 180 seconds.]

18:42 Ultrasauce has joined ##openfpga

18:43 azonenberg_work has joined ##openfpga

18:51 knielsen has quit [Read error: Connection reset by peer]

18:51 knielsen has joined ##openfpga

19:08 <rqou> wtf is up with network equipment that used commands like "no foobar" to disable "foobar"?

19:08 <rqou> is this a Cisco thing?

19:08 <tnt> yes

19:08 <whitequark> no u

19:10 <Ultrasauce> http://bcas.tv/paste/results/rJ8C5n44.html this is the shit i'm dealing with today

19:10 <Ultrasauce> about as inconsistent as naming can be using a single language i think

19:11 <Ultrasauce> no public documentation of course, doubt atheros would give me the time of day

20:01 <awygle> I dislike how vim does :set novar

20:19 <azonenberg_work> rqou: fwiw, LATENT* will probably use an IOS-esque CLI just for ease of use by people who are familiar with it

20:20 <azonenberg_work> Not a 100% command clone, but close enough it will be easy to learn

20:21 <azonenberg_work> (for example, Quagga is a f/oss implementation of OSPF, BGP, and some other stuff that uses a very IOS-like shell)

20:22 <tnt> yeah we actually extracted their vty implementation into a re-usable library that we use for all the osmocom gsm network stuff configuration.

20:23 <kc8apf> azonenberg_work: yuck. I must prefer Junos style over Cisco style

20:24 <azonenberg_work> kc8apf: i learned on cisco so that's what i know

20:24 <azonenberg_work> havent used juniper

20:25 <azonenberg_work> That being said, the switching core is going to be pretty well separated from the CLI

20:25 <azonenberg_work> So it wouldn't be that difficult for someone to write a replacement CLI that has a different command set

20:26 <whitequark> azonenberg_work: poke

20:26 <azonenberg_work> whitequark: ack

20:27 <kc8apf> wtf? Since IIS 6.0, HTTP request parsing is done in the windows kernel

20:27 <whitequark> azonenberg_work: any luck with SWD?

20:28 <azonenberg_work> whitequark: got pulled away to do something else for a bit, about to try some more stuff now

20:29 <whitequark> sweet

20:36 <azonenberg_work> whitequark: right now i'm working on adding swd support to the network protocol as well as adding checks to prevent confusion if you have a swd client connect to a jtag cable and vice versa

20:46 <awygle> oh wtf

20:47 <awygle> siglent doesn't even sell a rack mount kit for the SSA3000X series

20:47 <azonenberg_work> awygle: hint that you're not dealing with professional grade hardware? :p

20:47 <awygle> azonenberg_work: i mean rigol sells one for the 1000Z series :p

20:47 <azonenberg_work> lol yes i know

20:47 <azonenberg_work> i have one for my 1102d

20:48 <awygle> also https://www.siglent.eu/sva1015x.html cooooool

20:48 <awygle> want VNA

20:49 <azonenberg_work> awygle: i want to be able to do s-parameters through a diffpair

20:49 <awygle> don't we all

20:50 <azonenberg_work> So that would need a 4-port VNA right?

20:50 <azonenberg_work> Not a 2-port like that

20:50 <azonenberg_work> i'm honestly not sure what i'd do with a 2-port VNA because all of my high speed stuff is differential

20:50 <azonenberg_work> i guess i could characterize each leg independently and hope to extrapolate or something

20:51 kuldeep has quit [Read error: Connection reset by peer]

20:54 <awygle> i wonder if a 2:1 balun would give you what you need... probably not

20:55 <azonenberg_work> awygle: on a different note, some time soon i want to wriracing software

20:55 <azonenberg_work> to write some curve tracing software*

20:55 <azonenberg_work> (gaah vmware stealing my keyboard focus)

20:55 <awygle> oh good, there were more letters there lol

20:55 <azonenberg_work> basically bolt my DSO to my variable PSUs

20:56 <azonenberg_work> and add support for things like plotting I/V through a FET as a 3D function of Vgs

20:56 <awygle> should get a load pull machine and wire that in too

20:57 <awygle> or build one

20:57 <awygle> i wonder if a variable DC load like you can just buy would work for that...

20:58 <azonenberg_work> Not sure

20:58 <azonenberg_work> Short term i just want to be able to do things like plot I/V through a diode

20:59 <azonenberg_work> i mean i have all of the hardware i need already

20:59 <azonenberg_work> I just need to bolt the pieces together

21:01 <awygle> "just" :p

21:10 <zkms> nice https://twitter.com/jovanbulck/status/1039549856023490560

21:11 <reportingsjr> tools, tools, tools!

21:11 <reportingsjr> I keep wanting/needing a DC load and I'm torn between building one and buying a cheap one :P

21:11 <awygle> argh lol

21:11 <awygle> reportingsjr: same. also a lab psu

21:13 <pie_> oh hey thats neat https://twitter.com/florob/status/1039534956295405568 :P

21:15 <reportingsjr> awygle: I bought the EEZ H24005 when the crowdsupply/whatever run happened, I've been pretty pleased with it

21:15 <reportingsjr> a bit pricey for my normal budget though

21:16 <awygle> oh cool. Missed this. What did it cost you?

21:17 <reportingsjr> I think it was around $400

21:19 <azonenberg_work> I want to build my own lab PSU because these rohde & schwarz ones are soooo slow

21:19 <reportingsjr> My normally budget about $300/year for a piece of test equipment, so it was a bit higher than I like to spend

21:20 <azonenberg_work> like, 1 FPS or less update rate over the SCPI interface

21:22 <awygle> yeah I wonder how fast this h24005 is

21:22 <azonenberg_work> If i could have the same PSU core and a better management interface i'd be happy

21:22 <reportingsjr> good question, I haven't pushed it

21:22 <awygle> if it's super slow we could always replace or augment the cpu with an fpga I guess

21:22 <awygle> it *is* open source

21:22 <reportingsjr> I know that it has the ability to output a "waveform" somewhat fast

21:24 <reportingsjr> I don't know if there is going to be another run of the HW at some point. I had heard that was going to happen at some point.

21:32 <awygle> How complex is it? We could always build, like, three lol

21:37 <reportingsjr> more complex than that

21:37 <reportingsjr> well, you could, the price would just be way higher than that

21:37 <reportingsjr> it looks like th creator is working on a new "rev" of the powersupply atm

21:37 <reportingsjr> https://www.eevblog.com/forum/projects/eez-h25005-a-possible-successor-of-eez-h24005-programmable-power-supply/

21:37 <awygle> speaking of building stuff, glasgow boards arrive thursday

21:38 <reportingsjr> nice

21:38 <reportingsjr> rev b?

21:38 <awygle> yep

21:38 Bike has quit [Ping timeout: 252 seconds]

21:38 <reportingsjr> how many did you end up ordering?

21:39 <awygle> 10 PCBs, kit for 5

21:40 <reportingsjr> planning on sending any of them out to other people yet?

21:40 <awygle> one for me, two for whitequark, one for azonenberg_work, one to sacrifice to the yield gods

21:40 <reportingsjr> haha

21:47 Miyu has quit [Ping timeout: 252 seconds]

21:51 m_w has quit [Quit: Leaving]

21:56 kuldeep has joined ##openfpga

22:19 GuzTech has quit [Ping timeout: 246 seconds]

22:20 kuldeep has quit [Ping timeout: 252 seconds]

22:23 Bike has joined ##openfpga

22:26 kuldeep has joined ##openfpga

22:44 azonenberg_work has quit [Quit: Leaving.]

22:49 azonenberg_work has joined ##openfpga

23:16 kuldeep has quit [Ping timeout: 240 seconds]

23:17 kuldeep has joined ##openfpga

23:25 <awygle> what's the state of the art in open source DDR* controllers?

23:28 <q3k> litedram is pretty neat

23:28 <q3k> doesn't have phy code for ecp5 tho

23:29 <q3k> (i was supposed to write it but then I got distracted)

23:29 <gruetzkopf> doit

23:29 <q3k> (I have DDR working from migen/litex though, which is not as easy as it should be, though)

23:30 <q3k> (you need to manually string together a ddr block and a delay block and some custom logic :/)

23:33 kuldeep has quit [Ping timeout: 244 seconds]

23:34 kuldeep has joined ##openfpga

23:35 <awygle> azonenberg_work: https://harmoninstruments.com/posts/oshpark_4l.html

23:35 <awygle> litedram does look cool

23:36 <awygle> i don't love migen/litex tho. but I'll get over it I guess.

23:36 <q3k> one of us

23:36 <q3k> i don't love it either

23:37 <q3k> but then again, there's nothing about computers that I love anymore

23:37 <awygle> too dark for a tuesday afternoon lol

23:38 <awygle> i already have compilers gaslighting me i don't need an extra helping of ennui :p

23:38 <whitequark> awygle: why not?

23:38 <awygle> whitequark: i guess i just don't get it?

23:38 <whitequark> huh?

23:39 <awygle> like, i will totally admit that verilog is suboptimal in a lot of ways

23:39 <awygle> but i don't get why using python instead is somehow amazing

23:39 <q3k> it's not really instead

23:39 <awygle> it just means i have to learn a whole new set of conventions and spend _another_ six months not knowing whether i'm doing a <= or a = assignment

23:39 <whitequark> awygle: oh

23:39 <q3k> once you wrap your head around the fact that all the python is just run at compile time, it clicks

23:40 <awygle> i did that with verilog in college, i don't know why i'd throw away that knowledge

23:40 <whitequark> well uh, two things

23:40 <q3k> that knowledge still kinda of applies, and migen actually simplifies it

23:40 <whitequark> first, it's not useful knowledge

23:40 <q3k> you have self.comb and self.seq and that's it

23:40 <whitequark> it's an artifact of verilog being shit. it's like remembering all the implicit promotion rules in javascript.

23:40 <q3k> sorry, self.sync

23:40 <q3k> ^ this

23:41 <whitequark> second, the usefulness of migen is not in that it uses python

23:41 <awygle> i will take that as a typo and not you proving my point for me lol

23:41 <whitequark> python is actually kind of bad for this task

23:41 <whitequark> the usefulness of migen is that it's one meta level removed from verilog

23:41 <q3k> yes

23:41 <whitequark> you literally can't do with verilog what i'm going to do in migen for glasgow soon

23:41 <whitequark> which is to say

23:42 <whitequark> take sequential descriptions of processes, like "receive 4 bytes from a FIFO and put them into a register" and generate an FSM that has a state per byte

23:42 <whitequark> no amount of shitty verilog generate statements and macros will actually make that usable

23:43 <q3k> doing things 'the other way around', ie having your python meta-design generate external data is also super useful

23:43 <whitequark> writing verilog is like writing assembly. shitty assembly that's too high level to actually represent your target device yet too low level to express anything useful

23:43 <awygle> that... kind of sounds trivial to do in verilog, so i assume i'm not understanding hte example

23:43 <q3k> you createa config register, and that automatically updates a C header file for you if you use misoc

23:43 <q3k> without ever having to manually maintain a memory map of any sort

23:43 <whitequark> awygle: so you have a glasgow FIFO

23:43 <whitequark> with a dout, readable, and re signals

23:44 <whitequark> that gives you a byte at a time

23:44 <whitequark> an FSM that reads four bytes from there needs 4 states

23:44 <awygle> sure, okay

23:44 <whitequark> I can make an abstract operation of "get n bytes (or even bits) from FIFO and put them in this register" and have it generate the FIFO states automatically

23:44 <whitequark> without any defines, localparams, having to track what state has which number, caring about resets

23:45 <awygle> i mean i'd just do this with a counter. which is technically an N-state FSM but is super easy to define and reason about

23:45 <whitequark> oh, but now you need to embed this in a much larger protocol engine

23:45 <whitequark> think it's a part of the SWD protocol

23:45 <whitequark> the engine would be driven by FIFO commands and drive another module

23:46 <whitequark> take a look at swd.py I wrote recently and tell me this would be as straightforward in verilog

23:46 <rqou> what about the classic $SILICON_VENDOR technique of ad-hoc code generators written in perl/bash/c/tcl/php? :P

23:46 <awygle> q3k: i can see how that would be useful, although it is impossible to stress how much i do not care about the "cpu on an fpga" use case (but i realize other people do and that that would be useful for them)

23:46 <whitequark> this is migen

23:46 <whitequark> it's an ad-hoc code generator written in python

23:46 <q3k> awygle: that's just an example off the top of my head

23:46 <whitequark> just less crappy than the usual thing

23:46 <rqou> lol

23:46 <awygle> q3k: right, i get you and i see the value in the general point

23:47 <awygle> whitequark: okay, i'll take a look at that sometime soon

23:47 <q3k> but yeah, tbh migen does need better docs

23:47 <q3k> but I'm still not sure how to actually write better docs

23:47 <whitequark> there's also a lot of value in migen's stdlib

23:47 <awygle> i just had a hell of a time with the tiny bits of migen i wrote for the FTDI stuff

23:47 <awygle> which sort of reinforced my preexisting bias

23:47 <whitequark> I literally never want to write a FIFO from scratch

23:47 <whitequark> hm

23:48 <awygle> whitequark: again, i see the value in this, but my response to that is something like fusesoc, not something like migen or litex (and what even is the difference anyway?)

23:48 <q3k> litex is a fork of misoc

23:49 <q3k> both are frameworks that build on migen and provide abstractions useful in SoC definition

23:49 <q3k> both for the actual logic and automation around it

23:49 <whitequark> awygle: I can write Verilog but I can't imagine any reason I actually would want to write it, unless I'm literally forced to

23:49 <awygle> i guess the core of my irritation is that absolutely _everything_ in the FPGA ecosystem is _garbage_, and everyone is like "the reason is verilog" whereas i'm like "i can't even evaluate whether verilog is good or bad becaues i can't see it under all this garbage"

23:49 <whitequark> similar to how I can write C but can't imagine any reason I would want to

23:49 kuldeep has quit [Ping timeout: 264 seconds]

23:49 <whitequark> the reason isn't just verilog

23:50 <whitequark> verilog is as garbage as the other things but independenently

23:50 <whitequark> it's a pattern of bad decisions basically

23:50 <awygle> verilog is like writing C if every compiler segfaulted >1 time per day on every platform

23:50 <awygle> lol

23:51 <whitequark> verilog is like writing C without valgrind or ubsan

23:51 <q3k> verilog is the result of the ecosystem having made a series of unfortunate decisions for dozens of years

23:51 <whitequark> like yosys would just silently miscompile incorrect verilog and there is literally no way to make it complain

23:51 <whitequark> and clifford has not been very happy about me trying to fix that

23:51 <whitequark> "just write correct verilog duh"

23:51 <awygle> yeah well, that's a whole other thing

23:51 <whitequark> no that's a part of this pattern

23:51 <awygle> the level of macho in the ecosystem is quite high

23:51 <awygle> for no good reason

23:52 <q3k> whitequark: I vaguely remember this discussion from way back, if you still have an example I'd like to see it

23:52 <whitequark> mind you, yosys is lightyears ahead of the rest of the ecosystem, but even it is tainted with this stupid bullshit

23:52 <whitequark> q3k: it's on yosys bugtracker i think

23:52 <q3k> whitequark: since I was having a friendly argument with clifford about this recently

23:52 <whitequark> basically it resolves driver-driver conflicts to a constant

23:52 kuldeep has joined ##openfpga

23:52 <whitequark> in some unobvious cases

23:52 <whitequark> even when it emits a warning there's no good way to turn that into an error

23:53 <whitequark> clifford added an option to do -Werror based on warning text but I'm not going to use that, it's gross

23:53 <q3k> whitequark: do you know if any of the IEEE standards mandate a particular behaviour in that case

23:53 <whitequark> no

23:53 <whitequark> it's literally UB

23:53 <whitequark> by design

23:53 <q3k> ah wonderful

23:53 <whitequark> verilog has no fucking reason to have UB

23:53 <whitequark> during synthesis

23:53 <whitequark> i will die on this hill

23:53 <q3k> i think 'making the warning/error system better' is somewhere on the backlog closer to the head than the tail

23:54 <pie_> how can a HDL have undefined behaviour

23:54 <pie_> *sane HDL

23:54 <q3k> ie at least semanticize it so that -W(no-)error does not have to be stringly typed

23:54 <awygle> what would be a reasonable answer? i guess just "synthesis terminated"?

23:54 <whitequark> to be explicti, i think clifford is doing a great job, it's that my standards are set by the vastly better sequential language compilers

23:54 <whitequark> pie_: well migen doesn't have UB*

23:55 <whitequark> * not by design, i think you can sort of cause UB with it if you try, but that should be fixed as an implementation bug

23:55 <whitequark> q3k: but even -Werror now is not sufficient

23:55 <whitequark> there's sme cases when that warning isn't emitted at all

23:55 <awygle> well anyway, thanks for the lively discussion. i will continue to try to grasp the value of migen

23:56 <whitequark> I have a patch somewhere in the queue that fixes it, but I got frustrated after that discussion with Clifford and never finished it

23:56 <whitequark> I can give you the diff if you want

23:56 <awygle> and continue to not yell "WHY" at everyone who mentions it

23:56 <q3k> i wouldn't mind just a pointer to the issue tracker and/or pull request if there is one

23:56 <q3k> i honestly won't have time to work on that this month :)

23:56 <q3k> i first wanna resolve some nextpnr bugs

23:57 <q3k> like the PLL bugs the cr1901_modern keeps finding :P

23:57 <whitequark> q3k: https://github.com/YosysHQ/yosys/issues/545

23:57 <q3k> *that

23:57 <whitequark> here's the case where no warning is emitted at all

23:57 <pie_> quote worthy: <q3k> verilog is the result of the ecosystem having made a series of unfortunate decisions for dozens of years

23:58 <awygle> to finish out my thought earlier

23:59 <q3k> whitequark: thx

23:59 <awygle> it seems to me ill-advised to try to overturn the existing ecosystem without the support of any of the hardware vendors before we have exhausted all other options. which is why i am much more interested in (open source, none of this Verific bullshit) SystemVerilog support for Yosys than i am in Migen

23:59 <awygle> thank you all for your time lol

23:59 <q3k> awygle: i would live SV support in yosys as well

23:59 <q3k> awygle: just that nobody has come up and said that they want to and have time to implement it :P

23:59 <awygle> yeah that second half is a real bear :p