Search Now
HPC Software Stack
- Home
- Research & Development
- Products
- HPC Software Stack
Indigenous Software Stack Development:
As part of the NSM, a sophisticated HPC software stack was indigenously developed utilizing open-source technologies. This suite was engineered to meet the diverse requirements of NSM systems, enhancing performance and ensuring seamless integration across HPC environments.

Core Components of the NSM HPC Software Stack are:
- Operating System: CentOS/Alma has been selected as the foundational operating system, providing a stable and robust platform for HPC operations.
- Drivers: The suite includes critical drivers such as CUDA for GPU acceleration, alongside network and storage drivers, to facilitate efficient data management and connectivity.
- File Systems: Support for both local file systems and Lustre, a high-performance distributed file system, is integrated to manage extensive data volumes effectively.
- Provisioning: The xCAT (Extreme Cluster Administration Toolkit) is employed for provisioning, automating the deployment and configuration of cluster resources.
- Resource Management: SLURM (Simple Linux Utility for Resource Management) is utilized for job scheduling and resource management, optimizing the allocation and use of resources.
- Cluster Management: It is achieved through tools such as Ganglia and Nagios, supplemented by C-DAC tools and Osticket for efficient issue tracking and support.
- Communication Libraries: Essential communication libraries, including MPI (Message Passing Interface), Intel MPI, MVAPICH, and PGAS (Partitioned Global Address Space), are integrated to enable effective parallel communication and data exchange.
- Development Tools: The suite features GNU Compiler & Intel oneAPI Compilers to support a broad spectrum of programming languages and optimizing computational performance.
- Application Libraries: Key libraries such as NetCDF for network data, mathematical libraries for numerical computations, GNU libraries, and Deep Learning/Machine Learning (DL/ML) libraries are included to support a wide range of application needs.
- Visualization Tools: Scientific visualization and data analysis are facilitated through tools like GNU Plot, ParaView, and VMD (Visual Molecular Dynamics), enabling detailed examination and interpretation of simulation results.
- Performance Monitoring: The software stack incorporates performance monitoring tools and benchmarks to continuously evaluate and enhance system performance.
- Software Ecosystem: The creation of software products like C-CHAKSHU, CHReME, HPC Tutor PARADE, CAPC, User Creation Portal and Ticketing Support platforms reflects our emphasis on nurturing a robust software ecosystem. These solutions cater to diverse needs ranging from computational analysis and data visualization to project management, thereby augmenting research efficiency and productivity across scientific disciplines.
- C-CHAKSHU: C-Chakshu is an HPC multi-cluster monitoring and Management platform that provides a unified dashboard over the web for all NSM sites with different geographic locations across India. Its web-enabled interface manages clusters of varied configurations and facilitates researchers and scientists of varied domains to run their applications with minimal efforts.
- CHReME: To access Linux-based HPC environments, scientists and researchers require expertise in Linux and HPC, which is something many do not have. CHReME empowers users with an intuitive GUI to exploit HPC resources and provides a layer of abstraction to shield them from the complexity of accessing HPC resources. This enables them to concentrate on their core research/scientific work. It’s Web Interface makes clusters of different magnitudes easy to manage and monitor, which makes things easier for the HPC systems administrator while enabling researchers and scientists of varied domains to carry out their scientific simulations with minimal effort.
- HPC Tutor: HPC Tutor is a revolutionary web based blended learning platform, designed to empower novice users with hands-on experience in the realm of High-Performance Computing (HPC). It seamlessly integrates theoretical knowledge with practical application, offering an immersive and educational journey in HPC Domain. This platform presents a unified web page, merging comprehensive presentation materials seamlessly with direct shell access to HPC clusters. This unique feature ensures a user-friendly and cohesive learning experience.
- User Creation Portal : The User Creation Portal streamlines the account creation process by automating data collection and eliminating manual paperwork. It also enhances transparency by enabling users to monitor the progress of their requests and receive timely updates via email. This centralized platform ensures data accuracy and integrity, as users are required to provide official information verified through secure authentication mechanisms. Moreover, the portal offers a robust access control system, ensuring that sensitive user data is accessible only to authorized administrative personnel. This not only maintains data privacy but also facilitates efficient user management and oversight. By consolidating user information in a single location, the portal simplifies account auditing, reporting, and compliance processes, thereby improving overall operational efficiency and security.
- PARADE: ParaDE is an environment for creating parallel applications on HPC platforms. Being web-based, it helps to access HPC irrespective of geographical location and carry out complete software development from compiling, debugging, automatic job submission, and project management, using multiple parallel paradigms such as OpenMP, MPI, etc. It enables users to just log in to the environment and use all software on the cluster without the hassles of installation or configuration on the client machine
- CAPC: CAPC is an innovative solution that automatically parallelizes sequential code to different types of target parallel hardware without any inputs/contributions from the user. It liberates the application developer from the complexities of learning new parallel paradigms to write parallel programs for multicore and GPU parallel architectures
- OS Ticket: It is a ticketing system that streamlines HPC user support by routing inquiries, allowing for customization of data collection, and preventing duplicate responses from agents. It empowers agents with fast responses, SLA management, and collaboration tools. Users benefit from a quick response portal. This Desk provides valuable reporting and scales with the NSM vision, making it a powerful asset for managing user support. This HPC software stack ensures optimal performance and compatibility with NSM systems. It also aligns with India’s strategic goals of fostering technological self-reliance. By leveraging open-source technologies, the stack provides a flexible and cost-effective solution, driving innovation and bolstering capabilities within the national research and academic communities.
This HPC software stack ensures optimal performance and compatibility with NSM systems. It also aligns with India’s strategic goals of fostering technological self-reliance. By leveraging open-source technologies, the stack provides a flexible and cost-effective solution, driving innovation and bolstering capabilities within the national research and academic communities.
The deployment of this software suite has significantly enhanced the computational capabilities of NSM systems, empowering researchers and scientists to conduct high-impact research and achieve substantial advancements across various scientific and engineering disciplines.
This suit was deployed across all NSM HPC Facilities