memory issue in G0W0 run
Posted: Tue Nov 22, 2011 11:10 pm
Hi Yambo community,
I'm trying to do a G0W0 calculation on a graphene layer with 128 electrons (input file attached). I'm running on 6 processors (1processor per node) and allocating 16Gb memory per processor. Just to give you a sense of the size of the calculation, a colleague has done a very similar calculation using 5Gb memory on the Cineca sp6 machine, on 4 processors in 60 hours.
Even though there should be enough memory, the calculation crashes in a few minutes saying:
<03m-01s> [08] Bare local and non-local Exchange-Correlation
[ERROR] STOP signal received while in :[08] Bare local and non-local Exchange-Correlation
[ERROR]Mem All. failed. Element WF require 0.00000 [Gb]
(l_* and r_* files also attached, together with compilation options used).
We checked that Yambo was only using 3-4% on the memory allocated on each node when it crashed.
I also tried to decrease the RL cutoff to ~10000 : in that case the calculation can get to the next step, but then it crashes with a similar error:
[09] Dynamic Dielectric Matrix (PPA)
[ERROR]Mem All. failed. Element WF require 1.46268 [Gb]
Yambo runs otherwise fine on simple tests on our machine (we tried bulk Si RPA and the SiH4 tutorial), but we installed Yambo very recently and we're new to it, so there might be still a few things left to adjust, and I'm asking for your kind help on this.
Many thanks in advance.
Best,
Marco Bernardi
Ph.D. Candidate
Grossman Group at MIT (zeppola.mit.edu)
e-mail: bmarco@mit.edu
Phone: +1-617-258-0222
I'm trying to do a G0W0 calculation on a graphene layer with 128 electrons (input file attached). I'm running on 6 processors (1processor per node) and allocating 16Gb memory per processor. Just to give you a sense of the size of the calculation, a colleague has done a very similar calculation using 5Gb memory on the Cineca sp6 machine, on 4 processors in 60 hours.
Even though there should be enough memory, the calculation crashes in a few minutes saying:
<03m-01s> [08] Bare local and non-local Exchange-Correlation
[ERROR] STOP signal received while in :[08] Bare local and non-local Exchange-Correlation
[ERROR]Mem All. failed. Element WF require 0.00000 [Gb]
(l_* and r_* files also attached, together with compilation options used).
We checked that Yambo was only using 3-4% on the memory allocated on each node when it crashed.
I also tried to decrease the RL cutoff to ~10000 : in that case the calculation can get to the next step, but then it crashes with a similar error:
[09] Dynamic Dielectric Matrix (PPA)
[ERROR]Mem All. failed. Element WF require 1.46268 [Gb]
Yambo runs otherwise fine on simple tests on our machine (we tried bulk Si RPA and the SiH4 tutorial), but we installed Yambo very recently and we're new to it, so there might be still a few things left to adjust, and I'm asking for your kind help on this.
Many thanks in advance.
Best,
Marco Bernardi
Ph.D. Candidate
Grossman Group at MIT (zeppola.mit.edu)
e-mail: bmarco@mit.edu
Phone: +1-617-258-0222