I am performing a G0W0 calculation for a large 2D carbon-allotrope system using Yambo-5.2.4. Due to the large system size, I have implemented a parallelization scheme and split the calculations across multiple bands using QPKrange, with the intention of merging the databases afterward. However, the calculations are progressing too slowly and fail to complete within the 72-hour job runtime limit.
Below is the parallelization setup I am currently using:
Code: Select all
PAR_def_mode= "memory" # [PARALLEL] Default distribution mode
X_and_IO_CPU= "1 1 1 16 5" # [PARALLEL] CPUs for each role
X_and_IO_ROLEs= "q g k c v" # [PARALLEL] CPUs roles (q,g,k,c,v)
X_and_IO_nCPU_LinAlg_INV=-1 # [PARALLEL] CPUs for Linear Algebra (automatic)
DIP_CPU= "1 16 5" # [PARALLEL] CPUs for each role
DIP_ROLEs= "k c v" # [PARALLEL] CPUs roles (k,c,v)
SE_CPU= "q qp b" # [PARALLEL] CPUs for each role
SE_ROLEs= "1 1 80" # [PARALLEL] CPUs roles (q,qp,b)
Could you please suggest any strategies to optimize the calculation? For example, are there adjustments to the parallelization scheme, alternative distribution modes, or other parameters that could improve performance for this system?
Best regards,
Tushar Waghmare