I have an HPC with 40 procs and with almost 380GB RAM in a Master-Slave configuration (one master node and 2 compute nodes).
Now when I ran to compute a GW using PPApprox, I observed that even when I wrote the followings:
y['X_all_q_ROLEs'] = 'q.k.c.v'
y['X_all_q_CPU'] = '4.4.2.1'
y['SE_ROLEs'] = 'q.qp.b'
y['SE_CPU'] = '4.4.2'
y['X_all_q_nCPU_invert'] = '4'
y['X_all_q_nCPU_LinAlg_INV'] = '2'
the 32 CPUs are not used. However, the program runs without any errors and the report-file shows :
* CPU : 1
* THREADS (max): 1
* THREADS TOT(max): 1
* I/O NODES : 1
* Fragmented WFs :yes
CORE databases in .
Additional I/O in .
Communications in reference
Input file is reference.in
Report file is reference/r-reference_em1d_ppa_HF_and_locXC_gw0_rim_cut
Job string(main): reference
Log files in reference/LOG
Am I missing anything to write additionally? I am also wondering, if this scheme of CPU division is an optimized parallelism!

Should I be writing something in this line:
os.system('cd gw_conv; %s -F %s -J %s -C %s 2> %s.log'%(yambo,filename,folder,folder,folder))
With regards,
Sitangshu