Installation and usage of TURBOMOLE > Parallel Runs

aoforce parallel version problem

(1/2) > >>

gopa:
Hi,

    I am running an aoforce calculation for a molecule with 240 atoms
on a HPC Cluster (24 core with 128GB total RAM). I set the following
parameters in my shell script for running the parallel version of aoforce.


--- Code: ---setenv LD_LIBRARY_PATH /TURBOMOLDE/libso/x86_64-unknown-linux-gnu_smp
setenv PARA_ARCH SMP
setenv PARNODES 24
setenv TURBOMOLE_SYSNAME x86_64-unknown-linux-gnu_smp
setenv TURBOARCH x86_64-unknown-linux-gnu_smp
setenv SMPCPUS 24
setenv TURBOMOLE_SYSNAME `sysname`
--- End code ---

I see that aoforce module starts the parallel version with the following
lines printed in the beginning


--- Code: ---SMPCPUS    set: Shared-memory Parallelization with  24 CPUs.
SMP Parallelization Reference:
C. van Wullen, J. Comput. Chem. 32 (2011) 1195--1201
operating system is UNIX !
--- End code ---

I have set the maxcor value to 84000 MB (which will be roughly around
65% of the total RAM.

However, the aoforce is terminating with the following error
(please see the last part of aoforce output.


 
--- Code: ---CONSTRUCTING S(i,j)xi
      ...terminated. cpu:     217.45       wall:     217.63
 
 
 
 
 CONSTRUCTING <i|x,y,z|j>*S(i,j)xi          -> Dip. deriv.
      ...terminated. cpu:       7.56       wall:       7.57
 
 
 
 
 CONSTRUCTING epsilon(i)*S(i,j)xi*S(i,j)chi -> Hessian
      ...terminated. cpu:      20.71       wall:      20.73
 
 
 
 
 CONSTRUCTING G(a,i)[S(k,l)xi]              -> RHS
              G(i,j)[S(k,l)xi]*S(i,j)chi    -> Hessian
 
         Maximum core memory set to                  84000 MB
         This corresponds to                   971 vectors in CAO basis

========================
 internal module stack:
------------------------
    force
    mkg_sxi
    lpdrc1
========================

 smp_fork: cannot fork
 force ended abnormally
--- End code ---


I have explored the forum and was unlucky.
Could somebody help me telling what is going wrong here?

uwe:
Hi,

which version of Turbomole are you using? In the Turbomole installation directory there should be a file named TURBOMOLE_<version-number>. The default parallelization changed from the shared-memory implementation of C. van Wullen (as given in the output) to the OpenMP version in newer Turbomole releases.

Regards,
Uwe

gopa:
I figured out the problem. It was memory issue.
I was also setting $ricore parameter to a large value along with $maxcor, and it was blowing-up the memory.
I reduced the $ricore value and the calculation went well.
Thanks for lot for all the help and sorry for this trivial post.

gopa:

--- Quote from: uwe on February 11, 2021, 11:07:07 AM ---Hi,

which version of Turbomole are you using? In the Turbomole installation directory there should be a file named TURBOMOLE_<version-number>. The default parallelization changed from the shared-memory implementation of C. van Wullen (as given in the output) to the OpenMP version in newer Turbomole releases.

Regards,
Uwe

--- End quote ---

I was using version 7.3.1
Can you confirm whether default parallelization is SMP or OpenMP for this version?

uwe:
Hi,

Turbomole 7.3.1 already uses the OpenMP version as default. The output of aoforce should contain:

 
--- Quote ---         OpenMP Shared-Memory Parallelization:  8 CPUs.

            By: Erik P. Almaraz and Filipp Furche
            Copyright 2009-2011 by UCI and TURBOMOLE GmbH.
--- End quote ---

Either you are using an older version (check the output, it should start with a line telling you which version you use), or you did set TM_PAR_FORK environment variable which switches back from OpenMP to the fork/shared-memory version.

About memory problems: Note that the settings in $ricore and $maxcor are per thread. So if you set $maxcor 1000 and run the job on 40 cores, it will use 40 times 1000 MB, so roughly 40GB. Setting it to 84 GB using 24 CPUs will need 2 TB memory.

Regards,
Uwe

Navigation

[0] Message Index

[#] Next page

Go to full version