We have active-active cluster environment of UM for messaging.From last few days we are facing intermediate connection issue and it is only happening in particular time stamp.
Automatically it got connected after few disconnectivity.Below are logs:
[249]2017-07-28 07:01:14 CEST [ISS.0153.0049E] UM connection alias IS_UM_CONNECTION was disconnected from its primary UM session. The primary UM session will automatically attempt to reconnect 5 time(s) before stopping the UM connection alias.
[248]2017-07-28 07:01:14 CEST [ISS.0153.0998E] [WMM.UM.ALIAS.03] Alias IS_UM_CONNECTION was disconnected: state = STARTED; clusterTryAgainAttempt = -1
[247]2017-07-28 07:01:14 CEST [ISS.0153.0050I] UM connection alias IS_UM_CONNECTION is now reconnected to its primary session.
[246]2017-07-28 07:01:12 CEST [ISS.0153.0049E] UM connection alias IS_UM_CONNECTION was disconnected from its primary UM session. The primary UM session will automatically attempt to reconnect 5 time(s) before stopping the UM connection alias.
[245]2017-07-28 07:01:12 CEST [ISS.0153.0998E] [WMM.UM.ALIAS.03] Alias IS_UM_CONNECTION was disconnected: state = STARTED; clusterTryAgainAttempt = -1
[244]2017-07-28 07:01:12 CEST [ISS.0153.0050I] UM connection alias IS_UM_CONNECTION is now reconnected to its primary session.
Action was taken,
I have restarted UM cluster services still having the same issue.
I have checked the cluster status it is working fine and connected.
Here are few steps to avoid these kinds of errors, check and confirm if this helps
1> Make sure you have installed the UM server and client levels fixes including IS
2> What are the min and max memory assigned to each node of UM
3> Analyse the um logs while you encounter the issues
4> Change the logging level of UM to trace and capture the log file for further analysis
5> Check your UM A-A cluster configuration if it is correctly configured as per the doc
6> Provide how many nodes you have in A-A cluster I assume it should be more than 3 nodes and do you use sites as part of cluster?