COHSEX stopping without error messages
Posted: Tue Oct 11, 2011 10:02 am
Dear all,
I have found problems in running COHSEX, a bit different from those mentioned by someone else in a previous post viewtopic.php?f=14&t=323 so i'm not sure the cause of the problem is the same.
Also in my case the system is a molecule, I have only 1 k-point, and i'm using yambo-3.2.4-rev.17, internal rev 855.
i'm attaching a tar containing
the input file I use for COHSEX
the log and report files
l_dbs
report and setup files from yambo installation
LIST_log = list of all the files in the directory
What always happens (in different runs, also varying input parameters such as the energy cutoff, the n. of bands..) is that the run stops at this point (I'm reporting the last lines of the log file):
<03d-02h-17m-49s> P01: G0W0 COHSEX |################### | [095%] 02d-15h-56m-30s(E) 02d-19h-18m-25s(X)
<03d-02h-19m-11s> P02: G0W0 COHSEX |################### | [095%] 02d-15h-57m-53s(E) 02d-19h-19m-52s(X)
<03d-05h-38m-27s> P01: G0W0 COHSEX |####################| [100%] 02d-19h-17m-08s(E) 02d-19h-17m-08s(X)
<03d-05h-40m-40s> [M 5.350 Gb] Free ISC-GAMP ( 1.194)
<03d-05h-40m-40s> [M 4.158 Gb] Free X ( 1.192)
<03d-05h-40m-40s> [M 1.775 Gb] Free PPaR ( 2.384)
without indicating any “ERR” in the report file, which ends like this:
[09] Dyson equation: Newton solver
==================================
[Newton] Sc step [ev]: 0.100000
[Newton] Sc steps : 1
[Newton] SC iterations :0
[09.01] G0W0 : COHSEX
=====================
The situation is the same both if I run the calculation on a local cluster, or on CINECA SP6, with the same yambo version (the one on cineca with netcdf, the one on our cluster without netcdf): the only difference is that on cineca I have an empty report file (due to different file handling , input/output procedures etc on the two different machines, I guess), and the system indicates “Segmentation fault”, but the point where the run stops, as seen from the log file, is the same.
I'm running other calculations (TDDFT and BSE) on the same system from the same /SAVE without any errors etc.
Does anyone have ideas on how to solve this problem?
Thanks a lot in advance
cheers
Elena
Elena Molteni
Department of Physics
University of Milan
via Celoria, 16
I-20133, Milan, Italy
and European Theoretical Spectroscopy Facility
(ETSF) http://www.etsf.eu
PS: I know this is a big system, and that you suggest to post “small” examples, but at the moment I have some problems with abinit (which I normally use for creating KSS files, to create then the /SAVE dir etc), so creating a “small” test system at the moment would not be so easy and quick. (for yambo calculations i'm using the /SAVE directory I created some time ago, before the start of abinit problems)
I have found problems in running COHSEX, a bit different from those mentioned by someone else in a previous post viewtopic.php?f=14&t=323 so i'm not sure the cause of the problem is the same.
Also in my case the system is a molecule, I have only 1 k-point, and i'm using yambo-3.2.4-rev.17, internal rev 855.
i'm attaching a tar containing
the input file I use for COHSEX
the log and report files
l_dbs
report and setup files from yambo installation
LIST_log = list of all the files in the directory
What always happens (in different runs, also varying input parameters such as the energy cutoff, the n. of bands..) is that the run stops at this point (I'm reporting the last lines of the log file):
<03d-02h-17m-49s> P01: G0W0 COHSEX |################### | [095%] 02d-15h-56m-30s(E) 02d-19h-18m-25s(X)
<03d-02h-19m-11s> P02: G0W0 COHSEX |################### | [095%] 02d-15h-57m-53s(E) 02d-19h-19m-52s(X)
<03d-05h-38m-27s> P01: G0W0 COHSEX |####################| [100%] 02d-19h-17m-08s(E) 02d-19h-17m-08s(X)
<03d-05h-40m-40s> [M 5.350 Gb] Free ISC-GAMP ( 1.194)
<03d-05h-40m-40s> [M 4.158 Gb] Free X ( 1.192)
<03d-05h-40m-40s> [M 1.775 Gb] Free PPaR ( 2.384)
without indicating any “ERR” in the report file, which ends like this:
[09] Dyson equation: Newton solver
==================================
[Newton] Sc step [ev]: 0.100000
[Newton] Sc steps : 1
[Newton] SC iterations :0
[09.01] G0W0 : COHSEX
=====================
The situation is the same both if I run the calculation on a local cluster, or on CINECA SP6, with the same yambo version (the one on cineca with netcdf, the one on our cluster without netcdf): the only difference is that on cineca I have an empty report file (due to different file handling , input/output procedures etc on the two different machines, I guess), and the system indicates “Segmentation fault”, but the point where the run stops, as seen from the log file, is the same.
I'm running other calculations (TDDFT and BSE) on the same system from the same /SAVE without any errors etc.
Does anyone have ideas on how to solve this problem?
Thanks a lot in advance
cheers
Elena
Elena Molteni
Department of Physics
University of Milan
via Celoria, 16
I-20133, Milan, Italy
and European Theoretical Spectroscopy Facility
(ETSF) http://www.etsf.eu
PS: I know this is a big system, and that you suggest to post “small” examples, but at the moment I have some problems with abinit (which I normally use for creating KSS files, to create then the /SAVE dir etc), so creating a “small” test system at the moment would not be so easy and quick. (for yambo calculations i'm using the /SAVE directory I created some time ago, before the start of abinit problems)