kentonv changed the topic of #sandstorm to: Welcome to #sandstorm: home of all things sandstorm.io. Say hi! | Have a question but no one is here? Try asking in the discussion group: https://groups.google.com/group/sandstorm-dev | Public logs at https://botbot.me/freenode/sandstorm/
<kentonv> yeah they apparently don't consider this to be an outage, you're just expected to know that this can happen, even though they don't document it anywhere AFAICT
<kentonv> I mean I guess it's relatively obvious in retrospect?
<kentonv> but mostly they seem to say "shame on you for not making your app multi-homed, even though that's an enormous engineering effort"
<TimMc> Ugh.
<TimMc> So what happened here, you (or an automated process) tried to roll a cluster using deallocate/allocate and the allocate failed?
<TimMc> (also happy to wait for a postmorterm or whatever)
<kentonv> it's cool, I'm just waiting for disks to move...
<kentonv> the blackrock cluster master automatically recreates instances as needed. Presumably an instance died for one reason or another, and then the master couldn't recreate it
<TimMc> So, attrition.
<TimMc> things die and can't be replaced
<kentonv> the master aborts if gcloud gives it an unexpected error. It restarts immediately and tries to rebuild the cluster. Unfortunately, it always has to recreate all worker machines in this case, because bugs in nbd mean they tend to get into bad states easily.
<kentonv> so it shut down the worker machines and then couldn't start them back up
<kentonv> meanwhile the gateway machine (which receives all incoming traffic) was also destroyed at some point. Maybe it was the first machine to be destroyed, I dunno.
<kentonv> so it was stuck in a loop trying to recreate all these
<TimMc> and you've got a whole datacenter's worth of other customers circling your instances like vultures
<kentonv> probably
xet7 has quit [Remote host closed the connection]
coyotebush has quit [Remote host closed the connection]
iovec has quit [Quit: Connection closed for inactivity]
coyotebush has joined #sandstorm
_whitelogger has joined #sandstorm
_whitelogger has joined #sandstorm
iovec has joined #sandstorm
xet7 has joined #sandstorm
iovec has quit [Quit: Connection closed for inactivity]
ogres has joined #sandstorm
zipppy has joined #sandstorm
iovec has joined #sandstorm
ogres has quit [Quit: Connection closed for inactivity]
sy has quit [Ping timeout: 246 seconds]
sy has joined #sandstorm
sy has quit [Quit: sy]
iovec has quit [Quit: Connection closed for inactivity]
iovec has joined #sandstorm