ChanServ changed the topic of #picolisp to: PicoLisp language | Channel Log: https://irclog.whitequark.org/picolisp/ | Check also http://www.picolisp.com for more information
rob_w has quit [Quit: Leaving]
orivej has quit [Read error: Connection reset by peer]
orivej has joined #picolisp
_whitelogger has joined #picolisp
_whitelogger has joined #picolisp
_whitelogger has joined #picolisp
orivej has quit [Ping timeout: 246 seconds]
orivej has joined #picolisp
orivej has quit [Ping timeout: 250 seconds]
alexshendi has quit [Read error: Connection reset by peer]
alexshendi has joined #picolisp
<tankf33der> morning
<tankf33der> checking utf8 4th byte on this file
<tankf33der> comparing with older versions of pil
<tankf33der> im not sure what i doing
<tankf33der> hm
<Regenaxer> Good morning tankf33der
<Regenaxer> The 4th byte is used for characters > 65536
<Regenaxer> eg smileys
<Regenaxer> 128543
<tankf33der> my idea if i read with older version i got corrupted output
<tankf33der> and last char is not 9
<Regenaxer> Yes, you get corrupted data
<tankf33der> i do: in file until eof prinl char
<Regenaxer> The file must contain such characters
<tankf33der> i dont see last 9 on older version
<tankf33der> i will continue
<Regenaxer> What is "last 9"?
<tankf33der> this link above from github
<tankf33der> raw file from repo
<tankf33der> last number is 9
<Regenaxer> Try this test file: http://ix.io/1wOx
<Regenaxer> (in "thisFile" (read))
<tankf33der> ok
<Regenaxer> or (in "thisFile" (line))
<Regenaxer> I can input such characters with Penti directly
<Regenaxer> so it now also works in the REPL
<Regenaxer> Also things like (in "thisFile" (chop (read)))
<Regenaxer> and (pack @) then
<tankf33der> yea
<tankf33der> also different
<tankf33der> seems ok
<tankf33der> two versions of pil on the same file
<tankf33der> older pil on left
<tankf33der> yea
<tankf33der> afk
<Regenaxer> Great :)
<Regenaxer> Thanks for testing tankf33der
rob_w has joined #picolisp
rob_w has quit [Changing host]
rob_w has joined #picolisp
<alexshendi> Merry Christmas (somewhat belatedly).
alexshendi has quit [Read error: Connection reset by peer]
alexshendi has joined #picolisp
<Regenaxer> alexshendi: Merry Christmas to you too!
<tankf33der> doing dummy tests for UTF8
<tankf33der> downloaded utf8 files with full range in it, sum char char should be exactly number
<tankf33der> already see how old pil breaks after 65k
<tankf33der> will test future releases if required.
<Regenaxer> Cool, very good
alexshendi has quit [Ping timeout: 268 seconds]
libertas has quit [Ping timeout: 272 seconds]
libertas has joined #picolisp
orivej has joined #picolisp
<tankf33der> idea failed. i need something different.
alexshendi has joined #picolisp
alexshendi has quit [Read error: Connection reset by peer]
alexshendi has joined #picolisp
<Regenaxer> Sum char char would give which number?
<tankf33der> huge
<Regenaxer> yes, but how do you know the right result?
<tankf33der> yea, implement on another language
<Regenaxer> right
<tankf33der> since i cant find good test file i should generate it first
<Regenaxer> difficult
<Regenaxer> iirc there are unused ranges
<Regenaxer> Better use existing unicode stuff
<Regenaxer> www.unicode.org
<Regenaxer> There are many tables
<Regenaxer> In txt format, with comments etc
<Regenaxer> in @lib/wide.l
<Regenaxer> s/in/for/
<tankf33der> i see
<tankf33der> i know old and new pil understand all this
<Regenaxer> yes, should be. There seems nothing above 65536
<Regenaxer> It is mostly emoticons which are normally used in that high range
<tankf33der> i can generate file with bytes in range 65000 1000000
<tankf33der> i know golang says it all valid
<Regenaxer> It is exactly the range in hex 010000 .. 10FFFF
<Regenaxer> Nothing above that
<Regenaxer> UTF-4 four-byte 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
<Regenaxer> yep
<tankf33der> afk
orivej has quit [Ping timeout: 250 seconds]
alexshendi has quit [Ping timeout: 246 seconds]
orivej has joined #picolisp
orivej has quit [Ping timeout: 246 seconds]
alexshendi has joined #picolisp
orivej has joined #picolisp
_whitelogger has joined #picolisp
orivej has quit [Ping timeout: 244 seconds]
alexshendi has quit [Ping timeout: 244 seconds]
orivej has joined #picolisp
freemint has joined #picolisp
<tankf33der> so
<tankf33der> python3 vs. picolisp sum char char difference is always 3
<tankf33der> big progress.
<tankf33der> i will continue tomorrow.
<freemint> huh?
<tankf33der> i wrote code on python
<freemint> on?
<tankf33der> so
<tankf33der> if do char char from beginning of file i got:
<tankf33der> 9
<tankf33der> 10
<tankf33der> 10
<tankf33der> 12
<tankf33der> 11
<tankf33der> 32
<tankf33der> 33
<tankf33der> ....
<tankf33der> python3:
<tankf33der> no
<tankf33der> stop
<tankf33der> forget above.
<freemint> the problem is that (char) in picolisp and python can be differently big and you adjusted 'char so it works almost like pythons?
<tankf33der> this is correct output from python and picolisp
<tankf33der> on this file: https://i.imgur.com/D8REVNt.jpg
<tankf33der> this is how difference 3 appears !
<tankf33der> i will continue tomorrow.
<tankf33der> afk
<Regenaxer> hmm, so there is no difference?
<tankf33der> no
<tankf33der> piclisp rules
<Regenaxer> Same results?
<tankf33der> same
<Regenaxer> great
<tankf33der> sum char char is equal on pyrhon and picolisp
<Regenaxer> ok, the sum of all chars in the file
<Regenaxer> Sorry, I cant look at imgur jpegs, they crash my tablet
<freemint> it is a hex dumpof he file in question
<Regenaxer> firefox uses too much memory when opening an imgur page, so it kills my terminal on my tablet with only 2 GiB of RAM
<Regenaxer> So good to hear it seems correct
<tankf33der> and pil from jun2018 not equal to python
<Regenaxer> yeah, until now it supported only up to 3 bytes UTF-8
<tankf33der> and pil32 and pil64 dec2018 are equal to python
<Regenaxer> like original Java
<tankf33der> sleep.
_whitelogger has joined #picolisp
rob_w has quit [Read error: Connection reset by peer]
freemint has quit [Quit: Page closed]