I want to calculate dielectric function using IP RPA approximation. The input file (shared below) I have prepared contains total 625 Q points. When I submit the job on parallel, I have seen the job is not finished. Every time I am getting the following errors:
When I check output, I have seen total 142 files ( q points) are being generated instead of 625 (total q points). Here is my input:Fatal error in PMPI_Comm_split:
Other MPI error, error stack:
PMPI_Comm_split(1294)...............: MPI_Comm_split(comm=0xc40032a6, color=2, key=3, new_comm=0xe09157c) failed
PMPI_Comm_split(1276)...............:
MPIR_Comm_split_allgather(1005).....:
MPIR_Get_contextid_sparse_group(613): Too many communicators (24/2048 free on this process; ignore_id=0)
[cli_3]: aborting job:
Fatal error in PMPI_Comm_split:
Other MPI error, error stack:
PMPI_Comm_split(1294)...............: MPI_Comm_split(comm=0xc40032a6, color=2, key=3, new_comm=0xe09157c) failed
PMPI_Comm_split(1276)...............:
MPIR_Comm_split_allgather(1005).....:
MPIR_Get_contextid_sparse_group(613): Too many communicators (24/2048 free on this process; ignore_id=0)
Code: Select all
optics # [R] Linear Response optical properties
chi # [R][CHI] Dyson equation for Chi.
dipoles # [R] Oscillator strenghts (or dipoles)
Chimod= "IP" # [X] IP/Hartree/ALDA/LRC/PF/BSfxc
% QpntsRXd
1 | 625 | # [Xd] Transferred momenta
%
% BndsRnXd
1 | 24 | # [Xd] Polarization function bands
%
% EnRngeXd
0.00000 | 1.00000 | eV # [Xd] Energy range
%
% DmRngeXd
0.003000 | 0.003000 | eV # [Xd] Damping range
%
ETStpsXd= 200 # [Xd] Total Energy steps
% LongDrXd
1.000000 | 0.000000 | 0.000000 | # [Xd] [cc] Electric Field
This is the job file, I submitted on parallel mode.
Is there any recommendation I can follow to finish the job smoothly? Thank you in advance.#!/usr/bin/env bash
#SBATCH --job-name=gr_300K
#SBATCH --nodes=3 # node count
#SBATCH --ntasks-per-node=24 # number of tasks per node
#SBATCH --cpus-per-task=1 # cpu-cores per task (>1 if multi-threaded tasks)
#SBATCH --mem-per-cpu=5gb # Job memory request
#SBATCH --time=60:00:00 # Time limit hrs:min:sec
#SBATCH --output=sdc.txt # Standard output and error log
#SBATCH --partition=skylake # MOAB/Torque called these queues
module load yambo
srun yambo -F input.in -J Full_mpi_final
Best
M J Hasan
PhD Student
Mechanical Engineering
University of Maine