Dear Davide,
I appreciate your super quick response. That give me time to try out the options.
My system is not super big [572 occupied bands] and I am able to run the full diagonalization for small bands window like 500|600.
But for the band window like 500|750 [I did this, to confirm that the spectra does not change with the increasing bands], but I get the error
Code: Select all
Abort(594434) on node 11 (rank 11 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x15315429a700, rbuf=0x15379036b860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(738791938) on node 23 (rank 23 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14e25d768440, rbuf=0x14e898817860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(403247618) on node 12 (rank 12 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14998b69b740, rbuf=0x149fc0bc7860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(134812162) on node 21 (rank 21 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14c0d1a60440, rbuf=0x14c7061c7860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(134812162) on node 13 (rank 13 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14d70ca9b700, rbuf=0x14dd48337860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(604574210) on node 28 (rank 28 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x152103360480, rbuf=0x152736dc3860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(269029890) on node 15 (rank 15 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x151bfff99700, rbuf=0x1522361c77c0, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(738791938) on node 31 (rank 31 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14ec5abdf440, rbuf=0x14f29499b860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(671683074) on node 14 (rank 14 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14d51fca1700, rbuf=0x14db5c6df860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(67703298) on node 29 (rank 29 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x14c890b46440, rbuf=0x14cecd163860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 Abort(537465346) on node 30 (rank 30 in comm 0): Fatal error in PMPI_Allreduce: Invalid count, error stack: PMPI_Allreduce(390): MPI_Allreduce(sbuf=0x149000845440, rbuf=0x14963ce6f860, count=-1593462720, datatype=MPI_COMPLEX, op=MPI_SUM, comm=MPI_COMM_WORLD) failed PMPI_Allreduce(332): Negative count, value is -1593462720 [2026-06-15T18:11:08.565] error: *** STEP 3682133.0 ON f0701 CANCELLED AT 2026-06-15T18:11:08 DUE to SIGNAL Killed *** srun: Job step aborted: Waiting up to 32 seconds for job step to finish.
This was the input file used by me
Code: Select all
optics # [R] Linear Response optical properties
bss # [R] BSE solver
bse # [R][BSE] Bethe Salpeter Equation.
tddft # [R][K] Use TDDFT kernel
X_Threads=0 # [OPENMP/X] Number of threads for response functions
DIP_Threads=0 # [OPENMP/X] Number of threads for dipoles
K_Threads=0 # [OPENMP/BSK] Number of threads for response functions
BSEmod= "coupling" # [BSE] resonant/retarded/coupling
BSKmod= "ALDA" # [BSE] IP/Hartree/HF/ALDA/SEX/BSfxc
BSSmod= "d" # [BSS] (h)aydock/(d)iagonalization/(s)lepc/(i)nversion/(t)ddft`
BSENGexx= 3 Ry # [BSK] Exchange components
BSEprop= "abs" # [BSS] abs/kerr/magn/dichr trace
% BSEQptR
1 | 1 | # [BSK] Transferred momenta range
%
% BSEBands
500 | 750 | # [BSK] Bands range
%
% BEnRange
0.00000 | 6.00000 | eV # [BSS] Energy range
%
% BDmRange
0.100000 | 0.100000 | eV # [BSS] Damping range
%
BEnSteps= 100 # [BSS] Energy steps
% BLongDir
1.000000 | 1.000000 | 1.000000 | # [BSS] [cc] Electric Field
%
BS_ROLEs= "k eh t"
BS_CPU= "1 4 4" # 1 * 4 * 4 = 16 total MPI tasks
BS_nCPU_LinAlg_DIAGO= 4 # Perfectly structures an efficient 4x4 ScaLAPACK matrix grid
BS_nCPU_LinAlg_INV= 4
Can this be due to the fact that my system is large for running full diagonaliation? If yes, I also looked into using the Lanczos-Haydock solver [for the spectra} followed by SLEPc solver [for doing excitons analysis] ,and it seems possible to that way [Source: Yambo tutorials] . But I am not sure if thats recommended for the tddft? Can you help me know that?
Your inputs will be of great help.
PS: I use Yambo 5.0.4, I tried to get Yambo 5.3/5.4 running, but they just give me Segmentation error, I am still not able to figure that out.
--
Best regards,
Vipul