Can someone help me with a large cluster setup? I have 32 servers, each with 64 cores

At first I assigned one free core to the master node, paired with 31 slave nodes, and all slave nodes started fine. When I started the master node, everything worked fine for the first few minutes, then it reported an error: ‘rpc/data_worker_ipc_server.go:125’,‘msg’: ‘parent process not found’’

I then thought it was a config file issue, because it was a configuration file from another different cpu model.

Then I tried 1 master node with two slave nodes, and something magical happened, it started fine.

This made me think that the master node had too few free cores allocated, so I increased the master node’s free cores to 13, and then uploaded all the configuration files to the 32 servers, but it still didn’t work. The slave nodes were all fine, but the master node reported the same error.

I then changed the number of slave nodes to 27 and 10 respectively, but both gave the same error. I was so confused.