bug in BS solve or kernel
Posted: Tue Jan 26, 2016 7:21 pm
Dear Yambo Community,
I think I identified a bug, or perhaps it has to do with incorrect usage of the parallel parameters in the BS part without helpful error traps. I'm using the latest svn code in the development branch (revision 94).
I start with solid Si using an 8x8x8 k-point grid from Abinit. I did a previous G0W0 calculation and use it for QP correction. The inputs are attached. The BSE calculation that fails has
BS_ROLEs= "k eh t"
BS_CPU= "32 1 8"
It hangs when calling section() from yambo_driver() with the "Game Over" message. It actually is waiting for half the processors (128 of 256) in at mpi_barrier in PARALLEL_wait. If instead I use
BS_CPU= "32 2 4"
it works ok, I suppose, even though a warning message is produced:
<02s> P0001: [WARNING] n_eh_CPU > 1 in a system with symmetries. This is not efficient, in particular if you have many
kpts.
<02s> P0001: [WARNING] It is suggested to first distribute on "k" and "t". Maximum value for both is: 65
The dielectric function looks reasonable, even though it is not converged, in this case. In the previous case, the dielectric function does not match.
Let me know if I can do anything to help with solving this.
Thanks,
Gary
I think I identified a bug, or perhaps it has to do with incorrect usage of the parallel parameters in the BS part without helpful error traps. I'm using the latest svn code in the development branch (revision 94).
I start with solid Si using an 8x8x8 k-point grid from Abinit. I did a previous G0W0 calculation and use it for QP correction. The inputs are attached. The BSE calculation that fails has
BS_ROLEs= "k eh t"
BS_CPU= "32 1 8"
It hangs when calling section() from yambo_driver() with the "Game Over" message. It actually is waiting for half the processors (128 of 256) in at mpi_barrier in PARALLEL_wait. If instead I use
BS_CPU= "32 2 4"
it works ok, I suppose, even though a warning message is produced:
<02s> P0001: [WARNING] n_eh_CPU > 1 in a system with symmetries. This is not efficient, in particular if you have many
kpts.
<02s> P0001: [WARNING] It is suggested to first distribute on "k" and "t". Maximum value for both is: 65
The dielectric function looks reasonable, even though it is not converged, in this case. In the previous case, the dielectric function does not match.
Let me know if I can do anything to help with solving this.
Thanks,
Gary