1) Check hardware sensors and power supply status on the member that froze over a period of a few minutes, see if anything is bouncing into a bad place or borderline:

cpstat -o 5 -f sensors os...