Didn't find the right job?

Get expert career advice to help you find the ideal role and improve your job search strategy.

3,126 Hpc Professionals jobs in the United States

HPC Systems Engineer (Advanced Research Computing) - #Staff

21217 Baltimore, Maryland Johns Hopkins University

Posted 2 days ago

Tap Again To Close

Job Description

The Advanced Research Computing at Hopkins (ARCH) group is seeking a highly qualified and motivated **_HPC Systems Engineer_** to join the systems team. This system (ROCKFISH), with over 45,000 cores and several petabytes of storage, serves the HPC and data intensive science needs of researchers at Johns Hopkins University. The Systems Engineer contributes to the strategic planning, design, testing, organization and implementation of cutting-edge technology projects for the facility. The systems team is responsible for the day-to-day administration of HPC clusters, High Performance storage systems, backups, networking, security and any other services related to the operation of a large HPC center. The successful candidate will have experience in similar roles in high performance computing (HPC) labs or university settings.
**Specific Duties & Responsibilities**
_70% Systems Engineering, Administration, Security, and Oversight_
+ Work with Sr staff to design, organize, plan, test and implement cutting-edge hardware designs for an HPC environment.
+ Extensively document systems processes so that users can easily find useful information and other IT staff can perform routine tasks and provide backup.
+ Provides stable solutions for HPC resources.
+ Maintain job scheduling and storage allocation systems and policies to ensure fair allocation of shared resources.
+ Maintain extensive monitoring systems to facilitate quick, proactive responses to routine failures, and to provide comprehensive performance data logging.
+ Provide general system administration backup and escalation for other staff.
+ Assist with facilities-related issues that directly affect MARCC
+ Ensure resources meet the community's needs and are highly available to the group with limited interruption.
+ Manage inventory of resources in coordination with respective vendors.
+ Automate user account creation, management, and purging.
+ Contribute to planning sessions on network and security issues for MARCC. Work closely with the central networking group.
+ Implement network configuration and security measures to assure effective utilization of resources.
+ Understand HPC technical needs. Work closely with the facility's director and oversight groups to successfully implement policies and procedures.
+ Create and maintain a stable, secure operating system and software environment, which continues to meet users' evolving research needs.
+ Implement and maintain secure measures to protect data subject to restrictions.
+ Manage data access restrictions on a per user and group basis.
+ Implement and maintain monitoring measures for data and system access.
+ Other Systems Tasks as assigned by supervisor.
_20% Technological Research_
+ Offer technical advice on new projects that directly involve HPC computing at Hopkins.
+ Develop custom tools where necessary and contribute useful creations back to open-source development efforts where appropriate.
+ Implement and test new technologies that could be beneficial to HPC.
_10% Training/Education_
+ Continuously evaluate new tools and technologies for use in existing and future clusters.
+ Attending department and University-sponsored training to increase knowledge, improve skills, and learn new skills. May substitute University training for supervisor approved commercial job-related course offerings.
**Special Knowledge, Skills, & Abilities:**
+ Proven experience deploying large-complex scale projects.
+ Proven experience across multiple technologies with background in applications, databases, middleware, etc.
+ In-depth knowledge of the design and organization of cutting-edge technology in HPC environments.
+ In-depth understanding of HPC Cluster hardware and management software.
+ Understanding of massive high performance parallel storage and methodologies.
+ Expert knowledge of Unix/Linux systems administration, including all aspects of management, monitoring, performance analysis, and integration in potentially complex heterogeneous environments.
+ Knowledge of networking, high speed interconnects, and network security principles in an HPC environment.
+ Use of configuration management tools (e.g. Bright, xCAT, puppet, IPMI, ROCKS) to help maintain large-scale Linux clusters, supercomputers, storage systems, and smaller systems.
+ The ability to interact with peer institutions to support HPC directives effectively, furthering the goals of the MARCC facility.
+ Understand, implement, troubleshoot, and support job scheduling, resource management and workload management systems, including diagnosis of failed jobs, implementation of policies, and investigations of new features and services.
+ Understand and support hierarchical file system infrastructure, software and services, including high performance parallel storage, backup systems, and robotic tape libraries.
+ Develop reports and customize tools that automate the monitoring process of critical systems and alert team of issues automatically.
+ Evaluate, implement and manage appropriate high level complex software and hardware solutions by using best practices for the environment to ensure system integrity.
+ Install and configure infrastructure applications by following the industry's best practices to deliver effective solutions.
+ Maintain an effective schedule for systems backups and archive operations for mission critical systems.
+ Audit and maintain user access, authorization and authentication.
+ Generate periodic reports on resource utilization.
+ Maintain resource inventory using best practice applications.
+ Advanced knowledge of Linux, Apache, SQL, PHP/Python/Perl (LAMP) technology/toolkits.
+ Ability to handle high priority escalations whenever necessary
+ Ability to multitask while managing time and priorities
+ Troubleshoot and solve difficult system issues as they arise.
+ Must be adaptable and able to meet conflicting deadlines.
+ Exceptional organizational skills.
+ Maintain effective and thorough documentation of all configuration and tasks performed.
+ Ability to automate systems administration tasks wherever possible.
+ Excellent oral and written interpersonal skills.
+ Ability to meet the physical requirements of the position.
+ Keep up to date on emerging technologies.
+ Research, recommend, and implement new technologies based on their value to the research facility.
+ Ability to maintain confidentiality.
+ Excellent customer service skills.
+ Excellent communication skills
+ Must demonstrate strong critical thinking and analytical reasoning.
_Internal and External Contacts_
+ This position will interact with an array of departmental and central administrative offices, faculty, staff, researchers, and students, and with numerous external constituents (i.e. other college administrators and faculty, private businesses, industry partners, officials of federal and local agencies and research foundations) for the purpose of accomplishing HPC technology goals.
+ This includes providing instruction on protocol, regulations and guidelines pertinent to the agency and/or University.
+ Works routinely with JHU and UMCP faculty, administrators, students, and researchers.
+ Collaborates regularly with professional colleagues from the central organization, and from other academic departments.
+ Collaborates regularly with colleagues in industry and at other peer institutions.
**Minimum Qualifications**
+ Bachelor's Degree.
+ Five years related experience.
+ Additional education may substitute for required experience and additional related experience may substitute for required education, to the extent permitted by the JHU equivalency formula.
**Preferred Qualifications**
+ Seven (7) years experience managing Linux servers, with direct experience managing HPC clusters.
+ Experience as a high-level Linux system administrator.
+ Experience managing mission critical services.
+ Familiarity with configuration of the HPC software stack, including MPI, OpenMP, Intel, and GNU compilers, Math libraries.
+ Experience with open-source software compilation.
+ In-depth knowledge of TCP/IP networking and related protocols, InfiniBand, etc.
+ Experience with scientific application management packages like pymodules, modules.
+ Excellent scripting skills, python, perl, shell.
+ Programming skills in C, C++, or scientific language, desired but not required
+ Experience with MySQL or Mariadb database programming, desired but not required.
+ Expert level knowledge of configuration management and monitoring tools (puppet, nagios, etc).
+ Experience configuring resource manager applications (like SLURM).
+ Experience with Apache administration.
+ Knowledge of scientific software applications in academic supercomputing environments.
+ Familiarity or experience with data subject to restrictions, desired but not required.
Classified Title: Systems Engineer
Job Posting Title (Working Title): HPC Systems Engineer (Advanced Research Computing)
Role/Level/Range: ATP/04/PE
Starting Salary Range: $73,300 - $128,300 Annually (Commensurate w/exp.)
Employee group: Full Time
Schedule: 37.5 hrs/wk, M-F
FLSA Status: Exempt
Location: Hybrid/Homewood Campus
Department name: Research Computing
Personnel area: University Administration
The listed salary range represents the minimum and maximum Johns Hopkins University offers for this position, based on a good faith estimate at the time of posting. Actual compensation will vary depending on factors such as location, skills, experience, market conditions, education, and internal equity. Not all candidates will qualify for the highest salary in the range.
Johns Hopkins provides a comprehensive benefits package supporting health, career, and retirement. Learn more: Opportunity Employer
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
EEO is the Law

View Now

High-Performance Computing (HPC) Engineer

21705 Frederick, Maryland CACI International

Posted 2 days ago

Tap Again To Close

Job Description

High-Performance Computing (HPC) Engineer
Job Category: Information Technology
Time Type: Full time
Minimum Clearance Required to Start: TS/SCI with Polygraph
Employee Type: Regular
Percentage of Travel Required: Up to 10%
Type of Travel: Local
* * *
**The Opportunity**
+ The position is to support a Special Project for an Intelligence Community customer in Frederick, Maryland.
+ The project is to maintain and evolve an existing set of Modeling and Simulation applications that run on a High Performance Computing (HPC) System
**Responsibilities**
+ Provides backend software development inside an AGILE development environment.
+ Develop and run simulations on a High-Performance Computing (HPC) System to forecast Chemical and Radiological hazard areas.
+ Use modeling and simulation tools to include the Operational Multi-Scale Environment Model with grid Adaptivity (OMEGA).
+ Support the Chemical Hazard Area Modeling Program (CHAMPS)
+ Switch the existing schedule and resource management from Torque qsub to Slurm batch.
+ Provides backend architecture, system integration, program logic, database integration, ETL, and security controls.
+ Updates and incorporates new backend systems to support technical improvements.
**Qualifications**
_Required:_
+ Top Secret SCI security clearance preferably with a recent polygraph.
+ Expertise in software development tasks and High-Performance Computing (HPC)
+ Experience with schedule and resource management with Torque qsub and Slurm batch
+ Strong programming skills with multiple languages to include C Shell
+ Experience with Linux operating systems
+ Bachelor's degree in computer science, data science, math or a medical field with 7+ years of experience.
_Desired:_
+ Previous experience in a medical organization.
+ Previous experience with an intelligence organization.
+ Strong verbal and written communication skills.
+ Familiarity with DoD Instruction DoDI 8500.1, DoDI8500.2, DoDI 10.01, ICD 503 and the national Institute of Standards and Technology and Technology Special Publications 800-144 and 145.
-
**___**
**What You Can Expect:**
**A culture of integrity.**
At CACI, we place character and innovation at the center of everything we do. As a valued team member, you'll be part of a high-performing group dedicated to our customer's missions and driven by a higher purpose - to ensure the safety of our nation.
**An environment of trust.**
CACI values the unique contributions that every employee brings to our company and our customers - every day. You'll have the autonomy to take the time you need through a unique flexible time off benefit and have access to robust learning resources to make your ambitions a reality.
**A focus on continuous growth.**
Together, we will advance our nation's most critical missions, build on our lengthy track record of business success, and find opportunities to break new ground - in your career and in our legacy.
**Your potential is limitless.** So is ours.
Learn more about CACI here. ( Range** : There are a host of factors that can influence final salary including, but not limited to, geographic location, Federal Government contract labor categories and contract wage rates, relevant prior work experience, specific skills and competencies, education, and certifications. Our employees value the flexibility at CACI that allows them to balance quality work and their personal lives. We offer competitive compensation, benefits and learning and development opportunities. Our broad and competitive mix of benefits options is designed to support and protect employees and their families. At CACI, you will receive comprehensive benefits such as; healthcare, wellness, financial, retirement, family support, continuing education, and time off benefits. Learn more here ( .
The proposed salary range for this position is:
$113,200 - $237,800
_CACI is_ _an Equal Opportunity Employer._ _All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any_ _other protected characteristic._

View Now

Sr. HPC System Administrator

60290 Chicago, Illinois The University of Chicago

Posted 3 days ago

Tap Again To Close

Job Description

Department

Provost Research Computing Center

About the Department

The University of Chicago Research Computing Center (RCC), a unit in the Office of Research, provides high-end research computing resources to researchers at the University of Chicago. It is dedicated to enabling research by providing access to centrally managed High-Performance Computing (HPC), storage, and visualization resources. These resources include hardware, software, high-level scientific and technical user support, and the education and training required to help researchers make full use of modern HPC technology and local and national supercomputing resources. The Office of Research oversees the conduct of sponsored research, research program development, and contract management functions.

Job Summary

The job uses specialized knowledge and breadth of expertise to design automated, scalable, and rapidly deployable solutions to infrastructure development and server configuration. Leads installation, configuration, and maintenance of operating systems. Uses best practices and systems knowledge to monitor and alert systems, utility software, and firewalls. Guides maintenance for production servers as well as Windows and Linux servers.

The University of Chicago is seeking a highly qualified Senior HPC System Administrator to join the system and operation team that builds and manages RCC HPC systems and facility operations. The individual in this position will be involved in the procurement and management of HPC hardware and software.

This is a hybrid position requiring 3 days onsite.

Responsibilities

Installing, configuring, and maintaining large computer clusters/servers and software.
Day-to-day operations of the systems including systems administration, monitoring and storage performance up to and including network components. Management of the systems network switch, parallel file system and HPC software stack and tools.
Configuration of the scheduling and queuing system.
Diagnosing and resolving system operational problems quickly and effectively. Coordinating with vendors to resolve hardware and software problems. Assist users with access and other help desk ticket requests or issues.
Use scripting/programming skills to enable system-level automation, problem detection, security maintenance and patch management.
Building and deploying open-source software and software from vendors/partners.
Providing reliable and efficient backups/restores for all managed systems.
Documenting system administration procedures for routine and complex tasks.
Maintaining and monitoring the security of the HPC systems and servers.
Plans and installs necessary patches and upgrades for servers and their associated storage, network, communications, and peripheral sub-systems. Installs and maintains an appropriate level of intrusion detection, monitoring, and auditing software as required.
Tracks compliance and maintains documentation for hardware, software, and service inventories for management reports.
Performs other related work as needed.

Minimum Qualifications

Education:

Minimum requirements include a college or university degree in related field.

Work Experience:

Minimum requirements include knowledge and skills developed through 5-7 years of work experience in a related job discipline.

Certifications:

---

Preferred Qualifications

Education:

Masters degree in Computer Science or closely related field.

Experience:

Full time Linux system administration experience in a large distributed computing environment.
Previous experience in providing support for Linux HPC cluster used for scientific research.

Technical Skills or Knowledge:

Experience with installing, configuring, and maintaining job management tools (such as SLURM, Moab, TORQUE, PBS, etc.).
Experience configuring, installing and troubleshooting MPI and OpenMP.
Experience with operating system deployment tools (e.g. XCAT, ROCKS).
Experience configuring, administering, and supporting network storage subsystems (e.g. IBM, NetAppl DataDirect Network, LSI, etc.).
Hands-on experience of at least one distributed file system (Spectrum Scale-GPFS, Lustre, BeeGFS, Gluster, IMRIX, PVFS, etc.).
Direct experience working with Infiniband (must at least be able to demonstrate a working knowledge of Infiniband concepts, OFED layers, sub-net managers).
Experience configuring, installing, tuning and maintaining scientific application software on large-scale systems.
Experience supporting HPC compilers and libraries.
Experience with systems automation tools such as Ansible or Puppet.
Experience configuring, installing, maintaining and/or using performance monitoring and optimization tools.

Preferred Competencies

Ability to work well with faculty and researchers.
Ability to identify and gain expertise in appropriate new technologies and/or software tools.
Ability to function as part of an interactive team while demonstrating self-initiative to achieve project's goals and Research Computing Center's mission.
Strong analytical skills and problem-solving ability.

Application Documents

Cover letter (preferred)
Resume (required)

When applying, the document(s) MUST be uploaded via the My Experience page, in the section titled Application Documents of the application.

Job Family

Information Technology

Role Impact

Individual Contributor

Scheduled Weekly Hours

37.5

Drug Test Required

Health Screen Required

Motor Vehicle Record Inquiry Required

Pay Rate Type

Salary

FLSA Status

Exempt

Pay Range

$100,300.00 - $129,800.00

The included pay rate or range represents the Universitys good faith estimate of the possible compensation offer for this role at the time of posting.

Benefits Eligible

Yes

The University of Chicago offers a wide range of benefits programs and resources for eligible employees, including health, retirement, and paid time off. Information about the benefit offerings can be found in the Benefits Guidebook .

Posting Statement

The University of Chicago is an equal opportunity employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender, gender identity, or expression, national or ethnic origin, shared ancestry, age, status as an individual with a disability, military or veteran status, genetic information, or other protected classes under the law. For additional information please see the University's Notice of Nondiscrimination.

Job seekers in need of a reasonable accommodation to complete the application process should call or submit a request via Applicant Inquiry Form.

All offers of employment are contingent upon a background check that includes a review of conviction history. A conviction does not automatically preclude University employment. Rather, the University considers conviction information on a case-by-case basis and assesses the nature of the offense, the circumstances surrounding it, the proximity in time of the conviction, and its relevance to the position.

The University of Chicago's Annual Security & Fire Safety Report (Report) provides information about University offices and programs that provide safety support, crime and fire statistics, emergency response and communications plans, and other policies and information. The Report can be accessed online at: .Paper copies of the Report are available, upon request, from the University of Chicago Police Department, 850 E. 61st Street, Chicago, IL 60637.

#J-18808-Ljbffr

View Now

System Administrator IV - HPC

84045 Saratoga Springs, Utah Leidos

Posted 16 days ago

Tap Again To Close

Job Description

**Description**
The Digital Modernization Sector at Leidos currently has an immediate need for an experienced **System Administrator - IV HPC** for a new customer on a strategic High-Performance Computing (HPC) program. The Senior System Administrator will need to be a self-starter with excellent analytical and problem-solving skills, flexibility, good judgment, and the ability to work within a team to mature the HPC capabilities of our customer.
**Locations:**
These positions will be onsite. Candidates need to be located near Saratoga Springs, UT, to be considered.
**Primary Responsibilities:**
Manage essential infrastructure services, ensuring high availability and performance of data center services, physical and virtual server-class systems, and storage. Lead medium-to-large scale projects, design and implement system policies, conduct advance troubleshooting, and are pivotal in the direct recovery efforts during critical system failures. Mentor Tier I/II/III staff members. Ability to work alone and as part of larger team to complete projects on time.
**Basic Qualifications:**
+ Must possess a TS/SCI clearance with polygraph
+ IAT Level II Certification Required. Accepted professional IAT Level II certifications include RHCSA or higher Red Hat certification, and/or VMWare certification.
+ Candidates shall have a bachelor's degree in computer science or related field and twelve (12) years of experience in a large and complex IT environment providing industry and government recognized functional expertise, or a master's degree with ten (10) years of experience. In lieu of a bachelor's degree, the individual shall have five (5) years of full-time computer science experience and at least ten (10) years in a large and complex IT environment providing industry and government recognized functional expertise. An industry recognized professional certification as listed below may substitute as one (1) year experience.
+ Experience with installation, configuration, tuning and support of:
+ Multi-vendor servers running a plethora of COTS, open source, and in-house applications to accommodate HPC division IT support requirements.
+ Multi-vendor servers running Redhat or SUSE with direct attached, FC SAN storage or SSDs.
+ Distributing computing tools such as RES, LSF and SLURM
+ HPC farm systems, HPC MPP clustered systems, Front End servers of SPDs.
+ BM or HP Blade servers with FC/SAS/Network back end
+ Multi-vendor file systems such as XFS, GPFS and Lustre
+ Pre-Factory testing, factory testing, system integration and acceptance testing during the purchase process of HPC systems.
**Original Posting:**
June 18, 2025
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
**Pay Range:**
Pay Range $112,450.00 - $203,275.00
The Leidos pay range for this job level is a general guideline onlyand not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
REQNUMBER: R-
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status. Leidos will consider qualified applicants with criminal histories for employment in accordance with relevant Laws. Leidos is an equal opportunity employer/disability/vet.

View Now

High Performance Computing (HPC) Engineering Technician

84045 Saratoga Springs, Utah BAE Systems USA

Posted 6 days ago

Tap Again To Close

Job Description

Job Description Unlock the Power of Supercomputing: Join BAE Systems-One of the Leading Service Providers of HPCs

Contribute to one of our longest running programs where we orchestrate the support and sustainment of some of the world’s largest and most advanced supercomputers.

If you have the skills required for a HPC Engineering Technician , we are interested in speaking with you.

The ideal candidate in this position, conducts technical analysis of product implementations, modifications and enhancements to product in accordance with specific customer specifications and implementations. Troubleshoots technical problems and issues, determines technical solution in accordance with product and customer specifications, and recommends actions to company or customer representatives. Assesses product needs in accordance with customer specifications.

Shift work required.

#LI-PB2 Required Education, Experience, & Skills

Must hold a TS/SCI Clearance with appropriate poly
Candidates must have experience with an understanding of the concepts, procedures and guidelines to solve highly complex problems in the maintenance and hardware/software network infrastructure.
Experience performing system set-up, experiments and diagnostics to evaluate printed circuit board exchanges, and troubleshoot and make component repairs based on test results.
Knowledge and experience of LINUX Operations Systems.
Knowledge and experience in electronics component repair.
Ability to communicate and work well in an effective team environment.
SEC+ Certification
Ability to lift 50 lbs.

MDOPS

Pay Information
Full-Time Salary Range: $70735 - $

Please note: This range is based on our market pay structures. However, individual salaries are determined by a variety of factors including, but not limited to: business considerations, local market conditions, and internal equity, as well as candidate qualifications, such as skills, education, and experience.

Employee Benefits: At BAE Systems, we support our employees in all aspects of their life, including their health and financial well-being. Regular employees scheduled to work 20+ hours per week are offered: health, dental, and vision insurance; health savings accounts; a 401(k) savings plan; disability coverage; and life and accident insurance. We also have an employee assistance program, a legal plan, and other perks including discounts on things like home, auto, and pet insurance. Our leave programs include paid time off, paid holidays, as well as other types of leave, including paid parental, military, bereavement, and any applicable federal and state sick leave. Employees may participate in the company recognition program to receive monetary or non-monetary recognition awards. Other incentives may be available based on position level and/or job specifics.
About BAE Systems Intelligence & Security BAE Systems, Inc. is the U.S. subsidiary of BAE Systems plc, an international defense, aerospace and security company which delivers a full range of products and services for air, land and naval forces, as well as advanced electronics, security, information technology solutions and customer support services. Improving the future and protecting lives is an ambitious mission, but it’s what we do at BAE Systems. Working here means using your passion and ingenuity where it counts – defending national security with breakthrough technology, superior products, and intelligence solutions. As you develop the latest technology and defend national security, you will continually hone your skills on a team—making a big impact on a global scale. At BAE Systems, you’ll find a rewarding career that truly makes a difference.

Intelligence & Security (I&S), based in McLean, Virginia, designs and delivers advanced defense, intelligence, and security solutions that support the important missions of our customers. Our pride and dedication shows in everything we do—from intelligence analysis, cyber operations and IT expertise to systems development, systems integration, and operations and maintenance services. Knowing that our work enables the U.S. military and government to recognize, manage and defeat threats inspires us to push ourselves and our technologies to new levels.

This position will be posted for at least 5 calendar days. The posting will remain active until the position is filled, or a qualified pool of candidates is identified.

View Now

High Performance Computing (HPC) Engineering Technician

84045 Saratoga Springs, Utah BAE Systems

Posted 6 days ago

Tap Again To Close

Job Description

**Job Description**
Unlock the Power of Supercomputing: Join BAE Systems-One of the Leading Service Providers of HPCs
Contribute to one of our longest running programs where we orchestrate the support and sustainment of some of the world s largest and most advanced supercomputers.
If you have the skills required for a **HPC Engineering Technician** , we are interested in speaking with you.
The ideal candidate in this position, conducts technical analysis of product implementations, modifications and enhancements to product in accordance with specific customer specifications and implementations. Troubleshoots technical problems and issues, determines technical solution in accordance with product and customer specifications, and recommends actions to company or customer representatives. Assesses product needs in accordance with customer specifications.
**Shift work required.**
#LI-PB2
**Required Education, Experience, & Skills**
+ Must hold a TS/SCI Clearance with appropriate poly
+ Candidates must have experience with an understanding of the concepts, procedures and guidelines to solve highly complex problems in the maintenance and hardware/software network infrastructure.
+ Experience performing system set-up, experiments and diagnostics to evaluate printed circuit board exchanges, and troubleshoot and make component repairs based on test results.
+ Knowledge and experience of LINUX Operations Systems.
+ Knowledge and experience in electronics component repair.
+ Ability to communicate and work well in an effective team environment.
+ SEC Certification
+ Ability to lift ~50 lbs.
MDOPS
**Pay Information**
Full-Time Salary Range: $70735 - $
Please note: This range is based on our market pay structures. However, individual salaries are determined by a variety of factors including, but not limited to: business considerations, local market conditions, and internal equity, as well as candidate qualifications, such as skills, education, and experience.
Employee Benefits: At BAE Systems, we support our employees in all aspects of their life, including their health and financial well-being. Regular employees scheduled to work 20 hours per week are offered: health, dental, and vision insurance; health savings accounts; a 401(k) savings plan; disability coverage; and life and accident insurance. We also have an employee assistance program, a legal plan, and other perks including discounts on things like home, auto, and pet insurance. Our leave programs include paid time off, paid holidays, as well as other types of leave, including paid parental, military, bereavement, and any applicable federal and state sick leave. Employees may participate in the company recognition program to receive monetary or non-monetary recognition awards. Other incentives may be available based on position level and/or job specifics.
**High Performance Computing (HPC) Engineering Technician**
** BR**
EEO Career Site Equal Opportunity Employer. Minorities . females . veterans . individuals with disabilities . sexual orientation . gender identity . gender expression

View Now

High Performance Computing (HPC) Technical Expert

84045 Saratoga Springs, Utah BAE Systems

Posted 6 days ago

Tap Again To Close

Job Description

**Job Description**
Unlock the Power of Supercomputing: Join BAE Systems-One of the Leading Service Providers of HPCs.
At BAE Systems, we promote a strong, collaborative culture and provide our employees with the tools, skills and training they need to succeed.
We offer a flexible work environment to support the balance in your life and keep you performing at your best.
We are in search of a High Performance Computing (HPC) Technical Expert.
The High Performance Computing (HPC) Technical Expert (TE) provides leadership in developing cutting-edge solutions, strategies and protocols for the HPC Infrastructure environment.
+ They handle the most complex challenges, lead research and development initiatives, architect and design initiatives, and provide expert consultations to Program Managers and IT staff. Technical Experts are expected to set educational performance standards for the technical team.
+ TE s are also responsible for maintaining technical working aids, creating and developing standard operating procedures, and conducting technical exchanges and leading brown bag sessions.#LI-PB2
**Required Education, Experience, & Skills**
+ Must possess a TS/SCI clearance with appropriate poly
+ Must work on site 100% of the time
+ 15 years of experience in a system architecture or administration role
In addition to the above qualifications, must have experience in:
+ Configuration, tuning, testing, and advanced level troubleshooting and support of high performance filesystems such as XFS, GPFS and Lustre
+ Advanced level troubleshooting and support of HPC farm systems and associated applications such as Nagios, xcat, failover software, and compilersWorking knowledge of HPC MPP systems
+ Configuration, tuning, testing, and advanced level troubleshooting and support of distributed computing tools such as RES, LSF and SLURM.
+ Configuration, tuning, testing, and advanced level troubleshooting of RedHat and SuSe operating systems
+ **Accepted professional certifications:**
+ Valid RHCSA or higher Red Hat certification
+ Valid VM Ware certification
**IAT Level II Certification Required** (Sec )
MDOPS
**Pay Information**
Full-Time Salary Range: $ - $
Please note: This range is based on our market pay structures. However, individual salaries are determined by a variety of factors including, but not limited to: business considerations, local market conditions, and internal equity, as well as candidate qualifications, such as skills, education, and experience.
Employee Benefits: At BAE Systems, we support our employees in all aspects of their life, including their health and financial well-being. Regular employees scheduled to work 20 hours per week are offered: health, dental, and vision insurance; health savings accounts; a 401(k) savings plan; disability coverage; and life and accident insurance. We also have an employee assistance program, a legal plan, and other perks including discounts on things like home, auto, and pet insurance. Our leave programs include paid time off, paid holidays, as well as other types of leave, including paid parental, military, bereavement, and any applicable federal and state sick leave. Employees may participate in the company recognition program to receive monetary or non-monetary recognition awards. Other incentives may be available based on position level and/or job specifics.
**High Performance Computing (HPC) Technical Expert**
** BR**
EEO Career Site Equal Opportunity Employer. Minorities . females . veterans . individuals with disabilities . sexual orientation . gender identity . gender expression

View Now

Be The First To Know

About the latest Hpc professionals Jobs in United States !

Set Email Alert:

Enter your email

Job title

Location

Sr. Principal HPC System Administrator

21090 Linthicum Heights, Maryland Northrop Grumman

Posted 2 days ago

Tap Again To Close

Job Description

RELOCATION ASSISTANCE: No relocation assistance available
CLEARANCE TYPE: A US Government security clearance per customer's requirements.
TRAVEL: Yes, 10% of the Time
**Description**
At Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the world today, and for generations to come. Our pioneering and inventive spirit has enabled us to be at the forefront of many technological advancements in our nation's history - from the first flight across the Atlantic Ocean, to stealth bombers, to landing on the moon. We look for people who have bold new ideas, courage and a pioneering spirit to join forces to invent the future, and have fun along the way. Our culture thrives on intellectual curiosity, cognitive diversity and bringing your whole self to work - and we have an insatiable drive to do what others think is impossible. Our employees are not only part of history, they're making history.
At Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the world today, and for generations to come. Our pioneering and inventive spirit has enabled us to be at the forefront of many technological advancements in our nation's history - from the first flight across the Atlantic Ocean, to stealth bombers, to landing on the moon. We look for people who have bold new ideas, courage and a pioneering spirit to join forces to invent the future and have fun along the way. Our culture thrives on intellectual curiosity, cognitive diversity and bringing your whole self to work - and we have an insatiable drive to do what others think is impossible. Our employees are not only part of history, but they're also making history.
Northrop Grumman Mission Systems is a trusted provider of mission-enabling solutions for global security. Our Engineering and Sciences (E&S) organization pushes the boundaries of innovation, redefines engineering capabilities, and drives advances in various sciences. Our team is chartered with providing the skills, innovative technologies to develop, design, produce and sustain optimized product lines across the sector while providing a decisive advantage to the warfighter. Come be a part of our mission!
Northrop Grumman Mission Systems (NGMS) is seeking a **Principal or Sr. Principal HPC System Administrator** . This position is located at **Baltimore, MD** location.
**Responsibilities**
+ Support operation of a high-performance compute cluster
+ Investigate, diagnose, and resolve acute system faults
+ Maintain software deployments
+ Maintain security compliance
+ Monitor and maintain hardware
+ Interface with user support staff
**This is a dual level req and can be filled as a Principal or Sr. Principal HPC System Administrator**
Basic Qualifications **Principal HPC System Administrator:**
+ Bachelors degree with 5 years' relevant experience; 3 years' experience with a Master's Degree; 0 years with PhD. Will consider 4 years additional experience in lieu of degree.
+ Linux systems administration proficiency
+ Basic knowledge and experience with networking concepts
+ Basic knowledge and experience with file systems
+ Basic knowledge and experience with user management
+ Basic knowledge and experience with system security
+ Strong written and verbal communication skills
+ Must be a U.S. citizen
+ Active US Government security clearance per customers' requirements
**Basic Qualifications for Sr. Principal HPC System Administrator:**
+ Bachelors degree with 8 years' relevant experience; 6 years' experience with a Master's Degree; 4 years with PhD. Will consider 4 years additional experience in lieu of degree.
+ Linux systems administration proficiency
+ Basic knowledge and experience with networking concepts
+ Basic knowledge and experience with file systems
+ Basic knowledge and experience with user management
+ Basic knowledge and experience with system security
+ Strong written and verbal communication skills
+ Must be a U.S. citizen
+ Active US Government security clearance per customers' requirements
**Preferred Qualifications**
+ IAT Level II certification
+ Red Hat Linux systems administration experience
+ Knowledge and experience with concepts of high-performance computing system operations, including cluster management, multi-user login environments, job scheduling, and networked file systems
+ Knowledge and experience maintaining compliance with Security Technical Implementation Guides (STIGs)
+ Knowledge and experience with compiling software
+ Knowledge and experience monitoring and maintaining high-performance compute cluster hardware
+ Experience with MPI, high-speed low-latency network fabrics, parallel file systems, and/or GPUs
+ Exhibited ability to contribute to a team of technical professionals
Primary Level Salary Range: $100,300.00 - $50,500.00
Secondary Level Salary Range: 124,900.00 - 187,300.00
The above salary range represents a general guideline; however, Northrop Grumman considers a number of factors when determining base salary offers such as the scope and responsibilities of the position and the candidate's experience, education, skills and current market conditions.
Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay. Annual bonuses are designed to reward individual contributions as well as allow employees to share in company results. Employees in Vice President or Director positions may be eligible for Long Term Incentives. In addition, Northrop Grumman provides a variety of benefits including health insurance coverage, life and disability insurance, savings plan, Company paid holidays and paid time off (PTO) for vacation and/or personal business.
The application period for the job is estimated to be 20 days from the job posting date. However, this timeline may be shortened or extended depending on business needs and the availability of qualified candidates.
Northrop Grumman is an Equal Opportunity Employer, making decisions without regard to race, color, religion, creed, sex, sexual orientation, gender identity, marital status, national origin, age, veteran status, disability, or any other protected class. For our complete EEO and pay transparency statement, please visit U.S. Citizenship is required for all positions with a government clearance and certain other restricted positions.

View Now

Sr. Principal HPC System Administrator

20701 Annapolis Junction, Maryland Northrop Grumman

Posted 2 days ago

Tap Again To Close

Job Description

View Now

Senior HPC Linux System Administrator

30309 Midtown Atlanta, Georgia Leidos

Posted 2 days ago

Tap Again To Close

Job Description

**Description**
The Public Health and Human Services Operation of Leidos is seeking a **Senior HPC** **Linux System Administrator** to lead a team of system administrator professionals inmanaging a high-performance computing (HPC) infrastructure used by public health researchers and scientists. This senior-level position requires extensive Linux expertise combined with a deep understanding of the specialized hardware, software, and networking required for scientific research and large-scale data analysis.
**Candidate MUST:**
be located in the Atlanta, GA area for partial onsite work
be a US Citizen with the ability to obtain a Public Trust Clearance
The candidate provides secure and always-on infrastructure services, accessed by researchers to customer-sponsored data hosted in an on-premise infrastructure and the cloud, and secure access to the high performance computing resources for scientific researches.
+ High-performance Computing infrastructure management:Deploy, administer, monitor HPC clusters. Manage multi-petabtyes of data using Pure Storage flash memory storage, AWS S3 Glacier.
+ Software and resource management:Install, maintain, and upgrade scientific software, libraries, and batch schedulers such as GridEngine and Slurm. The role also involves developing effective process and solution for sharing resources across multiple research teams.
+ VMware: Manage the VMware vSphere Foundation for virtual server provisioning, deployment, and configuration, as well as hardware and software implementation and maintenance.
+ System Operations: System monitoring, routine and ad hoc security patch management, trouble shooting, performance tuning,
+ Project planning and coordination: Advise customer and Project Manager in designing and documenting technical solutions. Support infrastructure projects, from planning, coordinating team activities, executing planned activities, and providing status update. Communicate and work collaboratively with internal and client team members across the program, provide technical council, and/or alternative designs, solutions, and or processes to leadership.
+ Automation and scripting:Lead automation efforts to streamline system management tasks using scripting languages (Bash, Python) and configuration management tools (Puppet,Ansible).
+ Research collaboration:Work closely with scientists, bioinformatics developers, and principal investigators to understand their computational needs and translate scientific goals into technical configurations. This includes providing technical support to help optimize workflows.
+ System architecture and deployment **:** Lead the technical design, integration, and optimization of on-site HPC and cloud resources.
+ Mentorship and team coordination:Guide and mentor other system administrators on best practices for system administration and troubleshooting. Some roles involve managing a team of system administrators.
+ Security and compliance:Implement robust security measures, manage access controls, and design architectures that meet compliance standards such as HIPAA or NIST. Support SA&A process
+ Disaster recovery and monitoring **:** Design and implement backup and disaster recovery plans. Integrate monitoring and alerting systems to ensure system availability and reliability.
**REQUIRED EDUCATION AND EXPERIENCE**
+ A Bachelor's degree in computer science or a related field, plus 10 years of System Administration experience.
+ Requires extensive experience (7+ years) in designing and operating **HPC infrastructure. (High performance computing)**
+ Linux expertise:Mastery of Linux systems and administration, including troubleshooting, security, performance monitoring, and various distributions (e.g., Red Hat, Ubunut) to support scientific computing.
+ Soft skills **:** Strong problem-solving and communication skills are critical for collaborating with customers, bioinformatics developers, researchers and leading a team. Experience working with a team to introduce and integrate new technologies and process into existing production environments
+ Network: Proficiency in working with applicable network devices to include routers and switches, gateways and hubs
+ Security: Develop the infrastructure deliverables, continuous diagnostics and mitigation, threat mitigation and incident response, security architecture support, critical infrastructure protection, patch management, vulnerability management, risk management, information assurance, and Security Assessment and Authorization (SA&A) documentation.
+ VMWare: Experienced in managing VM infrastructure.
+ Leadership: Proven leadership in planning, coordinating infrastructure support activities, leading and mentoring system administrators
+ HPC and cluster management: Proven experience with HPC clusters, job schedulers (Slurm), and high-speed networking (10/40/100Gb)
+ Other technical skills: Proficiency in Bash and Python scripting for automation is essential. Experience with cloud technologies (hybrid-cloud integration) and container environments (e.g., Docker, Singularity, Kubernetes).
**DESIRED QUALIFICATIONS:**
+ A Master's Degree in in IT, engineering, or other relevant fields.
+ Experience of working at a federal government agency or a research organization
+ Large scale infrastructure design and implementation project experience
+ Red Hat Certified Engineer (RHCE), Red Hat Certified Architect (RHCA), or equivalent certifications.
+ Experience with computer networking protocols including, but not limited to TCP, IP, UDP, HTTP, DHCP, and DNS. Understanding of network design and management - LAN, WAN, and VPN.
+ Experience optimizing Cloud utilization patterns, support development, validation, operations, and security with migration experience from an on-premises model to a hybrid model.
+ AWS or Azure Cloud engineer certification
If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo - because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 - and moving faster than anyone else dares.
**Original Posting:**
September 29, 2025
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
**Pay Range:**
Pay Range $89,700.00 - $162,150.00
The Leidos pay range for this job level is a general guideline onlyand not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
#Remote
#Featuredjob
REQNUMBER: R-
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status. Leidos will consider qualified applicants with criminal histories for employment in accordance with relevant Laws. Leidos is an equal opportunity employer/disability/vet.

View Now

Industry

View All Hpc Professionals Jobs

Menu

Search Suggestions

Recent Searches

Popular Searches

Location Suggestions

Popular Locations

Nearby Locations

Other Jobs Near Me

Industry

3,126 Hpc Professionals jobs in the United States

HPC Systems Engineer (Advanced Research Computing) - #Staff

Job Description

High-Performance Computing (HPC) Engineer

Job Description

Sr. HPC System Administrator

Job Description

System Administrator IV - HPC

Job Description

High Performance Computing (HPC) Engineering Technician

Job Description

High Performance Computing (HPC) Engineering Technician

Job Description

High Performance Computing (HPC) Technical Expert

Job Description

Be The First To Know

Sr. Principal HPC System Administrator

Job Description

Sr. Principal HPC System Administrator

Job Description

Senior HPC Linux System Administrator

Job Description

Nearby Locations

Other Jobs Near Me

Industry