#picolisp on 2018-11-16 — irc logs at freenode.irclog.whitequark.org

2018-09-14 18:41 ChanServ changed the topic of #picolisp to: PicoLisp language | Channel Log: https://irclog.whitequark.org/picolisp/ | Check also http://www.picolisp.com for more information

00:53 xkapastel has joined #picolisp

02:21 ubLIX has quit [Quit: ubLIX]

04:23 xkapastel has quit [Quit: Connection closed for inactivity]

04:46 xkapastel has joined #picolisp

05:22 alexshendi has quit [Read error: Connection reset by peer]

06:09 orivej has quit [Ping timeout: 244 seconds]

06:27 DKordic has quit [Ping timeout: 250 seconds]

06:53 razzy has quit [Ping timeout: 272 seconds]

07:23 rob_w has joined #picolisp

07:24 razzy has joined #picolisp

07:24 orivej has joined #picolisp

08:53 xkapastel has quit [Quit: Connection closed for inactivity]

10:40 razzy has quit [Ping timeout: 272 seconds]

13:02 <beneroth> hi all

13:02 <beneroth> Regenaxer, yeah the usual term is "premature optimization"

13:03 <Regenaxer> Hi beneroth! Thanks :)

13:05 <beneroth> "We should forget about small efficiencies, say about 97% of the time: premature optimization is the root of all evil. Yet we should not pass up our opportunities in that critical 3%"

13:05 <beneroth> Donald Knuth

13:05 <Regenaxer> Wise man

13:06 <beneroth> I didn't know about skip lists

13:09 <beneroth> it seems they're better for use cases where you 1) do concurrency 2) many writes/changes to the structure. With a skip list modifications are more local to parts of the whole structure where a tree requires modifications to more parts (e.g. for re-balancing), therefore the skip list is better suited for concurrent modifications

13:10 <Regenaxer> I looked at them some time ago, but forgot the details

13:12 <beneroth> as pilDB is normally used with a global full lock, real full serializing of all transactions/actions, skip list makes no sense. it could make sense if one would want to optimize pilDB for high frequency concurrent writes, but this would anyway require some substantial changes to e.g. the whole locking and transaction handling

13:12 <Regenaxer> It has not to do with full locking

13:13 <Regenaxer> When a tree is modified, some nodes may change

13:13 <beneroth> no, skip list hasn't. but skip list makes no sense as long one does full locking anyway, I think.

13:13 <Regenaxer> they are sent by 'tell'

13:13 <Regenaxer> So no global lock is needed

13:13 <beneroth> I refer to the global lock in the normal (dbSync) ... (commit 'upd) pattern

13:13 <Regenaxer> Nodes in b-trees are just like other external syms

13:14 <Regenaxer> yes

13:14 <Regenaxer> but that is not relevant here

13:14 <Regenaxer> you could implement local locks and still use b-trees

13:14 <Regenaxer> like other objects

13:14 <beneroth> the point of skip lists is, as I understand it, that a modification of a skip list would usually involve changes to a smaller number of nodes/external syms than in a btree

13:15 <Regenaxer> yes, but b-trees too

13:15 <beneroth> too what?

13:15 <Regenaxer> normally only a single node is affected

13:15 <Regenaxer> a single sym

13:15 <beneroth> balancing?

13:15 <Regenaxer> yes, when splitting or joining

13:15 <beneroth> e.g. adding new indexed values

13:16 <Regenaxer> then more may be

13:16 <Regenaxer> right

13:16 <Regenaxer> but you add and add for a while

13:16 <beneroth> yeah. for this use cases, the skip list would involve less modifications all in all, that is the argument for skip list.

13:16 <Regenaxer> then the node splits

13:16 <beneroth> aye. no or less node splitting with skip lists, as they are unbalanced.

13:17 <Regenaxer> and you have a lot more nodes

13:17 <beneroth> BTrees remain unbeaten in query speed, afaik

13:17 <Regenaxer> I hope so

13:17 <Regenaxer> the nice point is that they stay balanced

13:18 <beneroth> afaik they're still universally the best thing for all index use cases, except when you have certain index/usage patterns with certain guarantees that you can use a more optimal approach than BTree. but BTree is the best universal optimal approach, still nothing better found (and lot of people tried).

13:19 <Regenaxer> yep

13:20 <beneroth> but yeah, optimizing for high concurrent write throughput requires quite some different structures and algorithms than traditional database setups, which are mostly optimised for querying and some writes (but in general less writes than reads)

13:20 <beneroth> and high write availability are use cases people build databases for this days, e.g. log data stream processing

13:21 <beneroth> mainly to handle website visitor tracking stuff, server/system logging usually does not produce so much data that this is really required ;-)

13:23 <beneroth> btw. Regenaxer, picolisp is pretty unique in that it does actual in-place modifications. most databases (also the old big ones) do only physically add/extend data, producing a stream of snapshots and changes, and then garbage collect regularly (hopefully)

13:24 <beneroth> so they have a lot of overhead to properly keep track of that stuff, but it allows higher concurrent throughput in many situations.

13:24 <Regenaxer> I see, didn't know

13:24 <beneroth> but pilDB is much simpler and for typical applications more than enough :)

13:25 <beneroth> maybe pilDB has more IO operations, many small ones vs. others tend to have less but bigger IO operations.

13:26 <Regenaxer> Not sure, as all stuff is fetched only once usually

13:26 <beneroth> pilDB transactions are always real serializable transactions (well, they're serialized), which is the most safe/secure way to do things (especially for e.g. accounting applications)

13:26 <beneroth> Regenaxer, I mean writes

13:26 <Regenaxer> ok

13:27 <beneroth> for reads you are probally right, there pilDB probably looks like other DBs, likely it even has better IO patterns.

13:27 <beneroth> guesstimate, not benchmarks

13:27 <Regenaxer> Writes in 'commit' are sorted in sequence they are in the file

13:27 <beneroth> ok, so optimized for the OS/disk?

13:27 <beneroth> nice

13:28 <Regenaxer> A little

13:28 <Regenaxer> but they go to the OS disk buffer first anyway

13:28 <beneroth> well, less sorting to do for the buffer

13:28 <beneroth> _=

13:28 <Regenaxer> Modern OSes do a lot of buffering anyway

13:28 <beneroth> :)

13:28 <beneroth> yeah

13:28 <Regenaxer> T

13:28 <beneroth> I think your approach of building/trusting on that is good engineering.

13:28 <Regenaxer> I think other DBs do their own buffering

13:29 <beneroth> most databases try to do everything themselves, but this is a lot of second-inventing/implementing

13:29 <Regenaxer> T

13:29 <beneroth> of course that can give better results on bad OS bad hardware, and it gives more stable/predictable behaviour which might be a pretty important property for proper operation / load estimates

13:30 <beneroth> ah nice link for you, haha

13:30 <Regenaxer> yes, allows finer control eg for block caches

13:31 <beneroth> you will feel confirmed in your insisting on KISS

13:32 <beneroth> ex-oracle DB dev ranting about oracle DB source code / development: https://news.ycombinator.com/item?id=18442941

13:32 <beneroth> ping tankf33der

13:32 <Regenaxer> wow! loc

13:32 <beneroth> and oracle was THE first big leader for relational SQL database business, and remained leader in that space until about 15 years ago

13:33 <Regenaxer> yeah

13:33 <beneroth> and a commenter writes "Even PostgreSQL is 1.3M lines of code"

13:34 <beneroth> I guess PostgreSQL is the most sane implementation of SQL/relational model in widespread use

13:35 <beneroth> pilDB has not all the features and properties of those databases, but for typical business applications and websites etc. it has more than enough, graph and OOP is even better suited to this task than relational model, and pilDB is...

13:36 <beneroth> LOC: 1413 db.l + 545 too.l 512 btree.l ?

13:36 <beneroth> plus that it is based on PLIO and pil IPC and pil datatypes

13:36 <Regenaxer> The relational paradigm by itself introduces a lot of complexity

13:37 <beneroth> aye

13:37 <beneroth> and SQL

13:37 <Regenaxer> yes

13:37 <Regenaxer> a separate language

13:37 <beneroth> especially standard SQL which is a mix of incompatible vendor-specific weird stuff

13:38 <beneroth> "SQLite is just 130k SLoC"

13:39 <beneroth> pilDB is more powerful than SQLite I think

13:39 <Regenaxer> good to hear :)

13:41 <beneroth> Regenaxer, see state of the industry IT/software is horrible insane. That's why we cannot have nice things :)

13:41 <Regenaxer> Well, we can have nice things :)

13:42 <beneroth> :-)

13:42 <Regenaxer> Let's just do them :)

13:42 <Regenaxer> ok, must run

13:42 <Regenaxer> bbl :)

13:42 <beneroth> https://news.ycombinator.com/item?id=18463181

13:42 <beneroth> cu

13:49 rob_w has quit [Quit: Leaving]

13:57 <beneroth> tankf33der, thanks for the link to https://meta.sr.ht/ - will look into it.

15:16 <tankf33der> beneroth: yea

15:43 xkapastel has joined #picolisp

17:27 orivej has quit [Ping timeout: 272 seconds]

18:33 orivej has joined #picolisp

19:26 orivej has quit [Ping timeout: 272 seconds]

19:26 orivej has joined #picolisp

19:37 <rick42> hola gente de pil

20:05 ubLIX has joined #picolisp

20:12 orivej has quit [Ping timeout: 244 seconds]

20:22 razzy has joined #picolisp

20:57 alexshendi has joined #picolisp

21:01 alexshendi has quit [Ping timeout: 252 seconds]

21:02 alexshendi has joined #picolisp