I am trying to run optical absorption spectrum calculation using Yambo 5.0.4. As its a big system with 332 atoms and number of bands = 3600. With the modest parameters, we need around 224.2684 [Gb] [per MPI task; please help me know if my understanding is correct]
SInce, its a big system, the calculations will take quite long time, so we decided to resort to the parallelization strategies. And we use them in our input files [as shown here]
Code: Select all
#
# __ __ ________ ___ __ __ _______ ______
# /_/\/_/\ /_______/\ /__//_//_/\ /_______/\ /_____/\
# \ \ \ \ \\::: _ \ \\::\| \| \ \\::: _ \ \\:::_ \ \
# \:\_\ \ \\::(_) \ \\:. \ \\::(_) \/_\:\ \ \ \
# \::::_\/ \:: __ \ \\:.\-/\ \ \\:: _ \ \\:\ \ \ \
# \::\ \ \:.\ \ \ \\. \ \ \ \\::(_) \ \\:\_\ \ \
# \__\/ \__\/\__\/ \__\/ \__\/ \_______\/ \_____\/
#
#
# Version 5.0.4 Revision 19598 Hash 20b2ffa04
# Branch is 5.0
# MPI+OpenMP+SLK+HDF5_IO Build
# http://www.yambo-code.org
#
optics # [R] Linear Response optical properties
chi # [R][CHI] Dyson equation for Chi.
tddft # [R][K] Use TDDFT kernel
X_Threads=0 # [OPENMP/X] Number of threads for response functions
DIP_Threads=0 # [OPENMP/X] Number of threads for dipoles
Chimod= "ALDA" # [X] IP/Hartree/ALDA/LRC/PF/BSfxc
FxcGRLc= 1 Ry # [TDDFT] XC-kernel RL size
NGsBlkXd= 1 Ry # [Xd] Response block size
% QpntsRXd
1 | 1 | # [Xd] Transferred momenta
%
% BndsRnXd
1 | 3600 | # [Xd] Polarization function bands
%
% EnRngeXd
0.00000 | 6.00000 | eV # [Xd] Energy range
%
% DmRngeXd
0.100000 | 0.100000 | eV # [Xd] Damping range
%
ETStpsXd= 100 # [Xd] Total Energy steps
% LongDrXd
1.000000 | 1.000000 | 1.000000 | # [Xd] [cc] Electric Field
%
X_CPU= "1.4.8.1.1" # [PARALLEL] CPUs for each role
X_ROLEs= "g.v.c.k.q" # [PARALLEL] CPUs roles (q,g,k,c,v)
X_nCPU_LinAlg_INV= 32 # [PARALLEL] CPUs for Linear Algebra (if -1 it is automatically set)
DIP_CPU= "1 8 4" # [PARALLEL] CPUs for each role
DIP_ROLEs= "k c v" # [PARALLEL] CPUs roles (k,c,v)
PAR_def_mode= "memory"
X_all_q_ROLEs= "q k c v"
X_all_q_CPU= "1 1 8 4"
~
Code: Select all
#!/bin/bash -l
#SBATCH --job-name=E_top_3600
#SBATCH --nodes=8
#SBATCH --ntasks-per-node=4
#SBATCH --cpus-per-task=8
#SBATCH --time=24:00:00
#SBATCH --export=NONE
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.err
###SBATCH --mem=0
#SBATCH -p spr1tb
unset SLURM_EXPORT_ENV
module purge
module load intel/2021.4.0
module load intelmpi/2021.6.0
module load mkl/2021.4.0
module load hdf5/1.10.7-impi-intel
module load netcdf-c/4.8.1
module load netcdf-fortran/4.5.3-intel
export PATH=/home/woody/bccc/bccc128h/software/yambo_5.0_fritz_cpu/bin:$PATH
export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
export MKL_NUM_THREADS=$SLURM_CPUS_PER_TASK
srun yambo -F yambo.in_ALDA
Code: Select all
___ __ _____ __ __ _____ _____
| Y || _ || Y || _ \ | _ |
| | ||. | ||. ||. | / |. | |
\_ _/ |. _ ||.\_/ ||. _ \ |. | |
|: | |: | ||: | ||: | \|: | |
|::| |:.|:.||:.|:.||::. /|::. |
`--" `-- --"`-- --"`-----" `-----"
<---> P20: [01] MPI/OPENMP structure, Files & I/O Directories
<---> P20-f2274.nhr.fau.de: MPI Cores-Threads : 32(CPU)-8(threads)
<---> P20-f2274.nhr.fau.de: MPI Cores-Threads : DIP(environment)-1 8 4(CPUs)-k c v(ROLEs)
<---> P20-f2274.nhr.fau.de: MPI Cores-Threads : X(environment)-1.4.8.1.1(CPUs)-g.v.c.k.q(ROLEs)
<---> P20-f2274.nhr.fau.de: [02] CORE Variables Setup
<---> P20-f2274.nhr.fau.de: [02.01] Unit cells
<04s> P20-f2274.nhr.fau.de: [02.02] Symmetries
<04s> P20-f2274.nhr.fau.de: [02.03] Reciprocal space
<04s> P20-f2274.nhr.fau.de: [02.04] K-grid lattice
<04s> P20-f2274.nhr.fau.de: [02.05] Energies & Occupations
<04s> P20-f2274.nhr.fau.de: [WARNING][X] Metallic system
<04s> P20-f2274.nhr.fau.de: [03] Transferred momenta grid and indexing
<04s> P20-f2274.nhr.fau.de: [MEMORY] Alloc bare_qpg( 58.86900 [Mb]) TOTAL: 243.3530 [Mb] (traced) 96.28400 [Mb] (memstat)
<04s> P20-f2274.nhr.fau.de: [04] Dipoles
<04s> P20-f2274.nhr.fau.de: [PARALLEL DIPOLES for K(ibz) on 1 CPU] Loaded/Total (Percentual):1/1(100%)
<04s> P20-f2274.nhr.fau.de: [PARALLEL DIPOLES for CON bands on 8 CPU] Loaded/Total (Percentual):108/862(13%)
<04s> P20-f2274.nhr.fau.de: [PARALLEL DIPOLES for VAL bands on 4 CPU] Loaded/Total (Percentual):688/2752(25%)
<04s> P20-f2274.nhr.fau.de: [DIP] Checking dipoles header
<05s> P20-f2274.nhr.fau.de: [WARNING][DIP] Database not correct or missing. To be computed
<05s> P20-f2274.nhr.fau.de: [MEMORY] Alloc DIP_iR( 55.59900 [Mb]) TOTAL: 299.0520 [Mb] (traced) 96.28400 [Mb] (memstat)
<05s> P20-f2274.nhr.fau.de: [MEMORY] Alloc DIP_P( 55.59900 [Mb]) TOTAL: 354.6510 [Mb] (traced) 96.28400 [Mb] (memstat)
<05s> P20-f2274.nhr.fau.de: [MEMORY] Alloc DIP_v( 55.59900 [Mb]) TOTAL: 410.2500 [Mb] (traced) 96.28400 [Mb] (memstat)
<05s> P20-f2274.nhr.fau.de: [x,Vnl] computed using 7398 projectors
<05s> P20-f2274.nhr.fau.de: [WARNING] [x,Vnl] slows the Dipoles computation. To neglect it rename the ns.kb_pp file
<05s> P20-f2274.nhr.fau.de: [MEMORY] Alloc pp_kb( 128.7640 [Mb]) TOTAL: 539.0170 [Mb] (traced) 96.28400 [Mb] (memstat)
<05s> P20-f2274.nhr.fau.de: [MEMORY] Alloc pp_kbd( 128.7640 [Mb]) TOTAL: 667.7810 [Mb] (traced) 96.28400 [Mb] (memstat)
<15s> P20-f2274.nhr.fau.de: [MEMORY] Alloc kbv( 217.7363 [Gb]) TOTAL: 218.4041 [Gb] (traced) 96.28400 [Mb] (memstat)
<15s> P20-f2274.nhr.fau.de: Dipoles: P, V and iR (T) | | [000%] --(E) --(X)
<15s> P20-f2274.nhr.fau.de: [MEMORY] Alloc WF%c( 5.856924 [Gb]) TOTAL: 224.2610 [Gb] (traced) 96.28400 [Mb] (memstat)
<15s> P20-f2274.nhr.fau.de: [PARALLEL distribution for Wave-Function states] Loaded/Total(Percentual):796/3600(22%)
<16s> P20-f2274.nhr.fau.de: [MEMORY] Alloc wf_disk( 5.856924 [Gb]) TOTAL: 230.1327 [Gb] (traced) 96.28400 [Mb] (memstat)
<44s> P20-f2274.nhr.fau.de: [MEMORY] Free wf_disk( 5.856924 [Gb]) TOTAL: 224.2758 [Gb] (traced) 96.28400 [Mb] (memstat)
Code: Select all
____ ____ _ ____ ____ ______ ___
|_ _||_ _| / \ |_ \ / _||_ _ \ ." `.
\ \ / / / _ \ | \/ | | |_) | / .-. \
\ \/ / / ___ \ | |\ /| | | __". | | | |
_| |_ _/ / \ \_ _| |_\/_| |_ _| |__) |\ `-" /
|______||____| |____||_____||_____||_______/ `.___."
Version 5.0.4 Revision 19598 Hash 20b2ffa04
Branch is 5.0
MPI+OpenMP+SLK+HDF5_IO Build
http://www.yambo-code.org
06/02/2026 at 14:55 yambo @ f2268.nhr.fau.de
==================================================
Cores-Threads : 32(CPU)-8(threads)
Cores-Threads : DIP(environment)-1 8 4(CPUs)-k c v(ROLEs)
Cores-Threads : X(environment)-1.4.8.1.1(CPUs)-g.v.c.k.q(ROLEs)
MPI Cores : 32
Threads per core : 8
Threads total : 256
Nodes Computing : 8
Nodes IO : 1
Fragmented WFs : yes
CORE databases : .
Additional I/O : .
Communications : .
Input file : yambo.in_ALDA
Report file : ./r_optics_chi_tddft_19
Verbose log/report : no
Log files : ./LOG
Precision : SINGLE
[RD./SAVE//ns.db1]--------------------------------------------------------------
Bands : 3600
K-points : 1
G-vectors : 7535261 [RL space]
Components : 941817 [wavefunctions]
Symmetries : 2 [spatial+T-reV]
Spinor components : 1
Spin polarizations : 1
Temperature : 0.025852 [eV]
Electrons : 5488.00
WF G-vectors : 941817
Max atoms/species : 280
No. of atom species : 5
Exact exchange fraction in XC : 0.000000
Exact exchange screening in XC : 0.000000
Magnetic symmetries : no
- S/N 002268 ---------------------------------------------- v.05.00.04 r.19598 -
[02] CORE Variables Setup
=========================
[02.01] Unit cells
==================
Cell kind : Unknown
Atoms in the cell : Au C H S O
number of Au atoms : 280
number of C atoms : 32
number of H atoms : 16
number of S atoms : 2
number of O atoms : 2
Alat factors : 47.30090 39.54959 41.66742 [a.u.]
Direct lattice volume : 77948.5 [a.u.]
Direct lattice vectors: A[ 1 ] A[ 2 ] A[ 3 ]
A[ 1 ]: 1.000000 0.000000 0.000000 [iru]
A[ 2 ]: 0.000000 1.000000 0.000000 [iru]
A[ 3 ]: 0.000000 0.000000 1.000000 [iru]
Recip. lattice volume : 0.003182 [a.u.]
Recip. lattice vectors: B[ 1 ] B[ 2 ] B[ 3 ]
B[ 1 ]: 1.000000 0.000000 0.000000 [iku]
B[ 2 ]: 0.000000 1.000000 0.000000 [iku]
B[ 3 ]: 0.000000 0.000000 1.000000 [iku]
[02.02] Symmetries
==================
Inversion symmetry : yes
Spatial inversion : no
Inversion index : 2
K-space Time-reversal : yes
Magnetic symmetries : no
Time-reversal derived K-space symmetries: 2 2
Group table correct : yes
Symmetries units : [cc]
[S 1]: 1.000000 0.000000 0.000000 0.000000 1.000000 0.000000 0.000000 0.000000 1.000000
[S*2]: -1.000000 0.000000 0.000000 0.000000 -1.000000 0.000000 0.000000 0.000000 -1.000000
[02.03] Reciprocal space
========================
nG shells : 514990
nG charge : 7535261
nG WFs : 941817
nC WFs : 941817
G-vecs. in first 80 shells: [ Number ]
1 3 5 7 11 15 19 27 29
31 35 39 41 45 49 57 61 65
73 81 83 87 91 95 99 107 111
119 121 129 137 141 143 147 151 159
163 167 171 179 187 195 199 201 209
213 221 225 229 233 237 245 253 261
269 277 281 283 287 295 299 303 307
323 327 335 337 345 353 361 365 369
373 377 379 387 395 403 407 415
...
Shell energy in first 80 shells: [ mHa ]
0.00000 8.82249 11.36938 12.61961 20.19186 21.44209 23.98899 32.81147 35.28995
45.47751 46.65932 47.90955 50.47843 54.30000 58.09712 59.27893 59.30093 61.84781
66.91961 70.67030 79.40240 80.76746 85.76838 90.77177 92.02200 93.38707 95.95594
97.1378 102.3244 103.3914 104.7784 111.1469 113.5765 114.9440 122.3989 123.7665
124.8799 124.9458 129.8808 131.2459 133.7683 137.4995 137.6143 141.1598 141.2502
148.8664 150.2339 152.5292 152.8028 153.7794 159.0540 160.2358 161.6253 165.1488
167.8765 175.3584 181.7268 181.9100 186.6373 188.0928 190.7325 191.6382 192.9789
194.3439 194.5297 199.2569 201.9137 203.0076 203.3521 204.3482 210.7362 213.2831
215.9009 217.2000 220.5622 222.1056 224.7233 229.8196 231.9316 232.2052
...
[02.04] K-grid lattice
======================
Compatible Grid is : 0D
K lattice UC volume : 0.003182 [a.u.]
[02.05] Energies & Occupations
==============================
[X] === General ===
[X] Electronic Temperature : 0.258606E-1 300.100 [eV K]
[X] Bosonic Temperature : 0.258606E-1 300.100 [eV K]
[X] Finite Temperature mode : yes
[X] El. density : 0.47512E+24 [cm-3]
[X] Fermi Level : 6.085646 [eV]
[X] === Gaps and Widths ===
[X] Conduction Band Min : 6.085646 [eV]
[X] Valence Band Max : 6.085646 [eV]
[X] Filled Bands : 2738
[X] Metallic Bands : 2739 2752
[X] Empty Bands : 2753 3600
[X] === Metallic Characters ===
[X] N of el / N of met el : 5488.00 12.0000
[X] Average metallic occ. : 0.428572
[WARNING][X] Metallic system
Timing [Min/Max/Average]: 03s/03s/03s
[03] Transferred momenta grid and indexing
==========================================
[RD./SAVE//ndb.kindx]-----------------------------------------------------------
Fragmentation : no
Polarization last K : 1
QP states : 1 1
X grid is uniform : yes
Grids : X S
BS scattering : no
COLL scattering : no
Sigma scattering : yes
X scattering : yes
- S/N 002268 ---------------------------------------------- v.05.00.04 r.19598 -
IBZ Q-points : 1
BZ Q-points : 1
K/Q-points units:
rlu = crystal or reduced units; cc = cartesian coordinates; iku = interal k-units
Q [1]: 0.000000 0.000000 0.000000 [rlu]
[04] Dipoles
============
[WARNING][DIP] Database not correct or missing. To be computed
[RD./SAVE//ns.kb_pp_pwscf]------------------------------------------------------
Fragmentation : yes
- S/N 002268 ---------------------------------------------- v.05.00.04 r.19598 -
[WARNING] [x,Vnl] slows the Dipoles computation. To neglect it rename the ns.kb_pp file
[WF-Oscillators/G space/Transverse up loader] Normalization (few states) min/max : 0.18709E-10 1.0000
Thanks a lot in advance.
--
Best regards,
Vipul