tpb has quit [Remote host closed the connection]
tpb has joined #symbiflow
citypw has joined #symbiflow
<mithro> syed.ahemd: At the moment is it pretty quiet with people on holidays, I expect that acomodi and litghost will be back in Jan and able to help then
hansfbaier has joined #symbiflow
<mithro> hansfbaier: You are correct that the 071-dump_all takes a pretty large amount of disk space
<hansfbaier> mithro: I now really understand the reason for the ticket: In order to build the part for the Nitefury, the fully bonded part had to be built first. It creates a TON of JSON files and hung my linux system because create_node_tree consumed all my 32GB of RAM.
<hansfbaier> mithro: I think if we could use sqlite instead of JSON that would save a HUGE amount of disk space and memory
<hansfbaier> mithro: because the data is inherently relational and thus a relational database would be a great fit
<hansfbaier> mithro: Also sqlite has simple python bindings
<hansfbaier> mithro: I will look into moving 071-dump_all to sqlite
<mithro> hansfbaier: Dunno how those work, but it might be a quick solution to put a compression filter on the json output?
<hansfbaier> mithro: it would be the proper way to deduplicate (= normalize) the data, because each item is stored exactly once
<hansfbaier> mithro: But in order to to make a database schema I have to understand the logical structure of the data
<mithro> hansfbaier: Probably - but sometimes starting with an easier solution gets you to the end goal faster :-)
<tpb> Title: compress-json · PyPI (at pypi.org)
<hansfbaier> mithro: The compression does not solve my problem: create_node_tree still would have to uncompress everything and then it would take even more memory
<hansfbaier> mithro: And I am quite reluctant to buy more memory because, now I have 32GB and I even rarely maxed out 16G
<mithro> hansfbaier: Spin up a cloud machine?
<tpb> Title: JSON Compression: Alternative Binary Formats and Compression Methods - Lucidchart (at www.lucidchart.com)
<mithro> I wonder if we should be using ujson in some places too - https://pypi.org/project/ujson/
<tpb> Title: ujson · PyPI (at pypi.org)
<hansfbaier> mithro: saving space is not the only advantage, you also can create indices for oft used tables, this will speed up the processing of the data by orders of magnitude
<mithro> hansfbaier: I'm sure that litghost might have some thoughts on that but is currently on holidays until Jan
<mithro> Anyway, I'm heading out
<hansfbaier> mithro: I will discuss it in the ticket. I would aim at a proof of concept limited to 071-dump_all
andrewb1999 has quit [Ping timeout: 264 seconds]
<hansfbaier> mithro: sqlite seems to handle large datasets quite well: https://stackoverflow.com/questions/1033309/sqlite-for-large-data-sets
<tpb> Title: SQLite for large data sets? - Stack Overflow (at stackoverflow.com)
hansfbaier has quit [Ping timeout: 260 seconds]
hansfbaier has joined #symbiflow
hansfbaier has quit [Quit: WeeChat 2.8]
<umarcor> mithro: what about using yaml and anchors?
Degi_ has joined #symbiflow
Degi has quit [Ping timeout: 256 seconds]
Degi_ is now known as Degi
ASHR has joined #symbiflow
frubbl has quit [Ping timeout: 264 seconds]
hansfbaier has joined #symbiflow
frubbl has joined #symbiflow
kraiskil has joined #symbiflow
frubbl has quit [Ping timeout: 240 seconds]
bjorkintosh has joined #symbiflow
frubbl has joined #symbiflow
kraiskil has quit [Ping timeout: 272 seconds]
hansfbaier has quit [Read error: Connection reset by peer]
frubbl has quit [Ping timeout: 240 seconds]
frubbl has joined #symbiflow
kraiskil has joined #symbiflow
ASHR has quit [Ping timeout: 260 seconds]
kraiskil has quit [Read error: Connection reset by peer]
kraiskil has joined #symbiflow
mkru has joined #symbiflow
mkru has quit [Client Quit]
mkru has joined #symbiflow
mkru has quit [Quit: Leaving]
kraiskil has quit [Ping timeout: 246 seconds]
<mithro> umarcor: I think I would take json+compression over yaml+anchors personally...
frubbl has quit [Ping timeout: 256 seconds]
kraiskil has joined #symbiflow
FFY00 has quit [Remote host closed the connection]
FFY00 has joined #symbiflow
FFY00 has quit [Remote host closed the connection]
FFY00 has joined #symbiflow
FFY00 has quit [Remote host closed the connection]
andrewb1999 has joined #symbiflow
citypw has quit [Ping timeout: 240 seconds]
ASHR has joined #symbiflow
kgugala has joined #symbiflow
kgugala_ has quit [Read error: Connection reset by peer]
andrewb1999 has quit [Ping timeout: 260 seconds]
andrewb1999 has joined #symbiflow
FFY00 has joined #symbiflow
FFY00 has quit [Remote host closed the connection]
FFY00 has joined #symbiflow
rvalles_ has quit [Read error: Connection reset by peer]
rvalles_ has joined #symbiflow
ASHR has quit [Quit: Leaving]
tannewt has quit [Read error: Connection reset by peer]
elms has quit [Read error: Connection reset by peer]
elms has joined #symbiflow
tannewt has joined #symbiflow
kraiskil has quit [Ping timeout: 246 seconds]
ASHR has joined #symbiflow
ASHR has quit [Ping timeout: 264 seconds]