I have issues with the clustering in Universal messaging 9.8 version.
I have configured the cluster of 2 universal messaging server with sites as Primary and Secondary, Primary has IsPrime flag, but i have issue when we restart the servers. below logs are showing in cluster.logs
Cluster Members shows
eaiitg Master Primary eaiitg Local Online
eaiitg1 nothing here Secondary nothing here Disconnected Local
,Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:12 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:12 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:14 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:14 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:14 MDT 2015],Cluster> Cluster State Manager: Failed to establish viable cluster, resetting links
[Mon Jul 27 13:20:16 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:16 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:18 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:19 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:19 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:21 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
[Mon Jul 27 13:20:21 MDT 2015],Cluster> Setting potential master to eaiitg1 yet master count is only 1.0 while we we need more than 1.0 for quorum
we are on latest fix and patch, i tried uninstall and reinstall couple of times but after restart of UM the issue is coming back, i have open a ticket with SAG, waiting on solution. thanks for update.
Auditing for the install location /eaiums/wM9/umserver
i am sure you must have already done this. but i will tell you how i resolved issue.
deleted existing instances using ninstancemanager.bat|sh
deleted data folder under umserver. better delete umserver folder under SAG_HOME/UniversalMessaging/server
then recreate instance using ninstancemanager
and recreated cluster and everything worked fine. i am still actively monitoring UM cluster to see if there are any issues reoccuring but i have seen none in last 3-4 days.
one thing to notice when you reinstall or recreate make sure you delete umserver folder. UM keep reference on file system and even after reinstall it may keep reference to old instance.
Not sure if this is the best way to go. mine is qa environment so i could do it.
We see similar issue on our Production UM servers.
One of the UM is not joining cluster with error.
Cluster State Manager: Failed to establish viable cluster
Cluster> Found existing Master in cluster as ulvwsbms01, setting local state to that of cluster
[Tue Aug 25 15:53:40 BST 2015],Cluster> Found existing Master in cluster as xxxxxxxx, setting local state to that of cluster
[Tue Aug 25 15:53:42 BST 2015],Cluster> Found existing Master in cluster as xxxxxxxx, setting local state to that of cluster
Could you please let me know if have you got any update from SAG on this issue? I cant delete/recreate the instance as this is Production.