Use an OS active/passive cluster for the Broker. If the active node fails, the OS will make the passive node the active node. The clients will reconnect to the newly active node. Normally the cutover happens within 5-10 minutes (depending upon cluster configuration). This results in service delay, rather than an outage.