<mithro>
syed.ahemd: At the moment is it pretty quiet with people on holidays, I expect that acomodi and litghost will be back in Jan and able to help then
hansfbaier has joined #symbiflow
<mithro>
hansfbaier: You are correct that the 071-dump_all takes a pretty large amount of disk space
<hansfbaier>
mithro: I now really understand the reason for the ticket: In order to build the part for the Nitefury, the fully bonded part had to be built first. It creates a TON of JSON files and hung my linux system because create_node_tree consumed all my 32GB of RAM.
<hansfbaier>
mithro: I think if we could use sqlite instead of JSON that would save a HUGE amount of disk space and memory
<hansfbaier>
mithro: because the data is inherently relational and thus a relational database would be a great fit
<hansfbaier>
mithro: Also sqlite has simple python bindings
<hansfbaier>
mithro: I will look into moving 071-dump_all to sqlite
<mithro>
hansfbaier: Dunno how those work, but it might be a quick solution to put a compression filter on the json output?
<hansfbaier>
mithro: it would be the proper way to deduplicate (= normalize) the data, because each item is stored exactly once
<hansfbaier>
mithro: But in order to to make a database schema I have to understand the logical structure of the data
<mithro>
hansfbaier: Probably - but sometimes starting with an easier solution gets you to the end goal faster :-)
<hansfbaier>
mithro: The compression does not solve my problem: create_node_tree still would have to uncompress everything and then it would take even more memory
<hansfbaier>
mithro: And I am quite reluctant to buy more memory because, now I have 32GB and I even rarely maxed out 16G
<hansfbaier>
mithro: saving space is not the only advantage, you also can create indices for oft used tables, this will speed up the processing of the data by orders of magnitude
<mithro>
hansfbaier: I'm sure that litghost might have some thoughts on that but is currently on holidays until Jan
<mithro>
Anyway, I'm heading out
<hansfbaier>
mithro: I will discuss it in the ticket. I would aim at a proof of concept limited to 071-dump_all