Dear All,
I have encountered problems when running TM (6.3.1) with
large number of CPU-cores, namely 256. The calculation I am
trying to make run is big, ~ 4000 basis functions, hence so many
CPUs. The problem occurs in the dscf module. It starts OK, reaches
the following line in the output:
" DSCF restart information will be dumped onto file mos"
and then it stops/hangs up. I noticed that there is an extra process
on one (the first) node (out of 256 nodes). I mean I use nodes=32:ppn=8
but for some reason there turned 9 processes on just only the first node.
When I reduced the number of CPU-cores to 128, i.e. nodes=16,ppn=8
everything got fine, the job is running and there only 8 processes on each
node as it should be.
Any comment on this problem will be greatly appreciated.
Best regards,
Evgeniy