2 Replies Latest reply on Aug 20, 2012 9:01 AM by master v

    large cluster - adding removing nodes causes cluster to fall apart

    master v Newbie

      Hi there

       

      Sorry I am part of the operations team supporting one of our applications which is around 50 node infinispan cluster, it is an older version of 5.0.0 and the current issue faced is if a new node is added specially whilst cluster is active, it can make node appear as part of a new multicast even though it is joining the right multicast ip, sometimes stopping a node causes other nodes to fall out too.

       

      The other issue is when starting up certain nodes the values of node id's is returned as hexadecimal values.

       

      Having worked with different size inifispan clusters, it would seem if the cluster consists of less nodes unsure what the best range is but lets assume a cluster containing 10 nodes would work fine and removal, addition of nodes would be less complicated so it would seem.

       

      Trying to find the best solution for this current dilemma since most times this issue causes an outage and would like to find a way of causing less disturbance.

       

       

      1. Would it be possible to run a large cluster like above on lets say 4 different multicast groups but then have them work together or share cache across multicast - if so what would this be configuration or development or a case of both?

       

      2. What if the master node starting up the cluster was to be a much better specification than the rest of the nodes in the cluster and was doing nothing else besides being the master cluster server i.e. not publicly giving out data, would this help addition/removal of cluster nodes without disturbing/breaking entire cluster.