Hello List,
recently, I've moved /apps to its own device. This invalidates NFS handles on the nodes, of course, so I started to reboot them. To my surprise, they don't come up again. The nodes complain about a time-out, "Failed to request QluMan node config in time", and ask me to check qlumand and qluman-route on the head.
These two processes are indeed running. I've checked the logs, but couldn't fine anything helpful (to me) in there.
qlumand seems to see the node briefly: 2019-03-05 11:45:06,615 [29219] INFO server.admin - Identifying node from '00-25-90-d9-08-86' 2019-03-05 11:45:06,617 [29219] INFO server.admin - Registering Execd 'node31-35' 2019-03-05 11:45:39,645 [29219] INFO server.admin - Execd '00-25-90-d9-08-86' disconnected
I've attached the router log.
There's one thing that's conspicuous, but it doesn't seem to be correlated with the nodes booting: a stack trace when accessing the database.
I'll be grateful for any pointers.
Thanks,
A.