Jump to content

After running for a period of time, some instances under certain services will be failed.


Mark123

Recommended Posts

Hello,every one

    The customer's current production environment is as follows: two Spotfire servers form a cluster.

    image.png.566bc587e3132d51db469a6b26917c82.png

    There has always been a problem where after running for a period of time, some instances under  certain services will be failed.

    Viewing node logs, there is an error message like this:

    image.thumb.jpeg.730c7759b600b7ad2d996516dc2a1e9f.jpeg

    There are a few confirmed and speculated things:
    1.The template for automated tasks does not use maps.
    2.The remaining memory of the NM server is about 300G.
    3.The NM server will report that the remaining disk space is less than half of the remaining memory space (uncertain if it is related to the problem).
    4.Does every NM have a virtual memory limit?Will too many instances cause insufficient virtual memory?

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...