Explore opportunities in high performance computing (HPC), a field experiencing substantial growth. HPC jobs involve designing, developing, and maintaining advanced computing systems used for complex simulations, data analysis, and scientific research. Professionals in this area work with cutting-edge technologies, including parallel processing, distributed computing, and specialized hardware architectures.

The demand for HPC experts spans various industries, from academia and government to finance and healthcare. Key roles include HPC system administrators, computational scientists, software engineers, and research scientists. These positions require a strong background in computer science, mathematics, and relevant domain expertise.

Job seekers can find HPC roles in research institutions, technology companies, and organizations with significant data processing needs. Career paths often involve continuous learning and adaptation to new technologies. The field offers opportunities for innovation and contribution to solving some of the world's most challenging problems.

What People Ask

Essential skills include proficiency in programming languages like C++, Python, and Fortran, along with experience in parallel computing frameworks such as MPI and CUDA. A solid understanding of computer architecture, operating systems, and networking is needed. Strong problem-solving and analytical abilities are valuable.

Responsibilities vary depending on the specific role, but often include system administration, performance tuning, software development, and user support. HPC professionals may also be involved in research, algorithm design, and data analysis. Collaboration with other scientists and engineers is common.

Top employers include national laboratories like Oak Ridge and Argonne, technology companies such as Intel and NVIDIA, and research universities with advanced computing facilities. These organizations offer diverse opportunities for HPC professionals.

The average salary for HPC roles in the US ranges from $90,000 to $160,000 per year, depending on experience, location, and specific job title. Senior-level positions and specialized roles may command higher salaries. Compensation packages often include benefits such as health insurance, retirement plans, and paid time off.

Career paths can lead to roles such as HPC system architect, computational scientist, research scientist, or software engineer specializing in parallel computing. Advancement opportunities may involve leading research projects, managing HPC infrastructure, or developing new algorithms and applications. Continuous learning and professional development are important for career growth.

Industry

View All High Performance Computing Jobs

138 High Performance Computing jobs in the United States

Q: What is the average salary for high performance computing jobs in the US?

The average salary for HPC roles in the US ranges from $90,000 to $160,000 per year, depending on experience, location, and specific job title. Senior-level positions and specialized roles may command higher salaries. Compensation packages often include benefits such as health insurance, retirement plans, and paid time off.

High Performance Computing Graduate Student

90079 Los Angeles, California New Mexico Staffing

Posted today

Tap Again To Close

Job Description

Internship Opportunity In The High Performance Computing Division

The High Performance Computing Division (HPC) managers world-class Supercomputing Centers. Our employees engage in research, development, and state-of-the-art software engineering, supporting development, design and effective use of large scale data and computational environments. Delivering computational capability and data management at petascale (i.e., across tens of thousands of processors and some the largest, fastest, and most complex data movement and storage systems in the world) gives rise to cutting edge technical challenges. HPC Division innovates to meet those challenges.

HPC is seeking recent graduates looking for a challenging paid post graduate internships at the post-baccalaureate level. All selected candidates will be provided with a mentor, a challenging project for their appointment, and an opportunity to present their project work and progress to colleagues. The projects we have to offer will vary depending on the skills and interests of the candidate and our current needs.

Minimum Job Requirements:

We are seeking a wide variety of computational skills employed in an HPC environment. To receive full consideration, candidates must have received a degree in one of the following:

Computer Science
Computer Engineering
Electrical and Computer Engineering
Systems Administration
Computer Networking
Computer Systems Analysis
Software Engineer
Electrical Engineering
Information Systems Security Management
Mathematics or Physics

AND, knowledge and introductory experience in one or more of the following:

Cluster Computing and System Administration
Programming skills (C++, Python, SQL, Perl, etc.)
Linux System Management
Linux-based Computer Security
Parallel Programming
FPGA Programming
HPC System Software
Data Storage
Computer Networking

Note to Applicants: A cover letter expressing the candidate's strengths and interests will help create the best match between applicant and internship. It is highly recommended to submit a cover letter if you are interested in the Supercomputing Institute. All applications received prior to the first Tuesday in January will be considered for our Supercomputer Institute.

Required Application Materials (for all intern levels - do not delete):

Current resume
Current official transcripts
Personal statement of interest (not to exceed one page)

Due to federal restrictions, citizens of the People's Republic of China, the Islamic Republic of Iran, the Democratic People's Republic of North Korea, and the Russian Federation, who are not Lawful Permanent Residents ("green card" holders) are prohibited from accessing facilities that support the mission, functions, and operations of national security laboratories and nuclear weapons production facilities, which includes Los Alamos National Laboratory.

Located in Northern New Mexico, Los Alamos National Laboratory (LANL) is a multidisciplinary research institution engaged in strategic science on behalf of national security. LANL enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns.

Our generous benefits package includes:

PPO or High Deductible medical insurance with the same large nationwide network
Dental and vision insurance
Free basic life and disability insurance
Paid childbirth and parental leave
Award-winning 401(k) (6% matching plus 3.5% annually)
Learning opportunities and tuition assistance
Flexible schedules and time off (PTO and holidays)
Onsite gyms and wellness programs
Extensive relocation packages (outside a 50 mile radius)

Equal Opportunity: Los Alamos National Laboratory is an equal opportunity employer. All employment practices are based on qualification and merit, without regard to protected categories such as race, color, national origin, ancestry, religion, age, sex, gender identity, sexual orientation, marital status or spousal affiliation, physical or mental disability, medical conditions, pregnancy, status as a protected veteran, genetic information, or citizenship within the limits imposed by applicable federal, state and local laws and regulations. The Laboratory is also committed to making our workplace accessible to individuals with disabilities and will provide reasonable accommodations, upon request, for individuals to participate in the application and hiring process.

View Now

Software Engineer, High Performance Computing

90250 Federal, California SpaceX

Posted 4 days ago

Tap Again To Close

Job Description

Software Engineer, High Performance Computing

Hawthorne, CA

Apply

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SOFTWARE ENGINEER, HIGH PERFORMANCE COMPUTING

Starshield leverages SpaceX’s Starlink technology and launch capability to support national security efforts. While Starlink is designed for consumer and commercial use, Starshield is designed for government use, with an initial focus on earth observation, communications, and hosted payloads.

The Starshield software team is building highly reliable in-space mesh networks, designing secure systems to guarantee access to space, designing next-gen communication and sensing software, and more. Aerospace experience is not required to be successful here - we want our engineers to bring fresh ideas from all areas. We look for engineers who love solving problems and seek to make an impact on an inspiring mission. As we expand this team, we're looking for versatile, motivated, and collaborative engineers with hands-on experience developing C++ software for real world systems.

Our team is involved in designing the vehicle systems at every phase of development. We build tools that enable us to work more efficiently, and that help us build software systems that are secure, reliable, and autonomous. Our software engineers are responsible for the life cycle of the software they create, including development, testing, and operational support.

RESPONSIBILITIES:

Create highly reliable software systems that control hundreds of satellites in low earth orbit
Leverage software design to improve satellite constellation performance, security, and availability to meet the needs of a wide range of users
See your software through from start to finish: from figuring out the core needs to prototyping, developing, and testing; to on-orbit rollout and beyond
Work with interdisciplinary teams to brainstorm, design, and build the next generation of satellite capabilities, from cutting-edge sensors and inter-satellite lasers to space-based cloud compute

There are several roles within the Starshield software team with different focus areas. Applicants will interview for specific focus areas based on hiring needs and qualifications. Specific role responsibilities may include:

Write high quality Linux-based C++ software for common processors and micro controllers (e.g. ARM, PowerPC, x86, etc.)
Implement networking technologies to direct data across a variety of satellites, ground operations centers, and users
Build automated ground-based software systems that integrate smart data processing with command and control of the satellites
Develop models and simulations for flight-like vehicle software testing, network performance analysis, or research & development projects
Develop tools that allow for test execution across multiple environments: virtualized hardware, real hardware-in-the-loop, and even vehicle-in-the-loop testing
Invent new systems that enable more frequent and reliable software deployment, test execution, and data analysis as part of a continuous integration and release system

BASIC QUALIFICATIONS:

Bachelor's degree in computer science, engineering, math, or engineering discipline; OR 2+ years of professional experience in software development in lieu of a degree
Development experience in C, C++, or Python or full stack software development experience

PREFERRED SKILLS AND EXPERIENCE:

Experience in C++ for high performance systems
Developed and deployed software that has been used real-world applications and projects
Solid fundamental knowledge of computer architecture and networks
Strong skills in debugging, performance optimization and unit testing
Ability to work effectively and creatively in a dynamic environment with changing needs and requirements
Ability to work independently and in a team, take initiative, and communicate effectively
Ability to obtain and maintain a Top Secret or Top Secret SCI clearance

Some preferred skills and experience depend on the specific team within flight software and may include:

Experience with networking protocols (TCP, UDP, etc)
Experience developing in the Linux kernel
Experience with image data processing and machine learning
Strong background in math and physics

ADDITIONAL REQUIREMENTS:

Note that an active clearance may provide the opportunity for you to work on sensitive SpaceX missions; if so, you will be subject to pre-employment drug and random drug and alcohol testing
Must be willing to work extended hours and weekends as needed

COMPENSATION AND BENEFITS:

Pay Range:

Software Engineer/Level I: $120,000.00 - $45,000.00/per year

Software Engineer/Level II: 140,000.00 - 170,000.00/per year

Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience.

Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation & will be eligible for 10 or more paid holidays per year. Exempt employees are eligible for 5 days of sick leave per year.

ITAR REQUIREMENTS:

To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITARhere ( .

SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should reach out to

View Now

Senior High Performance Computing Engineer

94025 Menlo Park, California SLAC National Accelerator Laboratory

Posted 1 day ago

Tap Again To Close

Job Description

Senior High Performance Computing Engineer
Job ID
6383
Location
SLAC - Menlo Park, CA
Full-Time
Regular
**SLAC Job Postings**
**About SLAC:**
The SLAC National Accelerator Laboratory, operated by Stanford University, is a premier national laboratory at the forefront of advancing the frontiers of scientific research and innovation.
SLAC is home to groundbreaking facilities such as the Linac Coherent Light Source - which generates incredibly brief bursts of X-rays to capture stunning movies of atomic and molecular processes in real time with rates of up to 1 million pulses per second. This capability generates an astounding amount of data, with expected data rates reaching up to 1 petabyte per week during full operations. This immense volume of data is essential for researchers to understand dynamic processes in areas such as materials science, chemistry, and biology.
Another prominent endeavor at SLAC is the Rubin Observatory, which is set to conduct the Legacy Survey of Space and Time (LSST) - an unprecedented project to map the entire southern night sky every few days for over a decade. As the data facility for all of the Rubin data, the Rubin Observatory is projected to generate about half an exabyte of data, providing unprecedented insights into dark matter and dark energy, along with discovering and cataloging and transient astronomical events like supernovae and near-Earth asteroids.
SLAC¿s commitment to exploring fundamental questions about our universe is embodied in its collaborative and multidisciplinary research culture, enabling scientists and engineers to delve into the interactions of light, matter, and the foundational principles governing the world we live in. Our lab equips its teams with innovative technologies and unparalleled expertise, fostering a dynamic environment where scientific inquiry can thrive.
**Given the nature of this position, SLAC is open to on-site and hybrid work options.**
**Position Overview:**
As a Senior High Performance Computing Engineer in the Scientific Computing Services Division of the Technology and Innovation Directorate (TID) at SLAC, you will play a critical role in managing and optimizing our High Performance Computing (HPC) environment in support of these groundbreaking scientific projects. You will be responsible for the advanced administration of our Slurm batch system, alongside deploying, optimizing, and debugging applications, scientific libraries and software environments. Additionally, you will contribute to the management and planning of our scientific software catalog to ensure it meets the diverse needs of our research community. This position offers the opportunity to work on challenges that push technological boundaries while mentoring junior staff and guiding the evolution of our HPC capabilities.
**Your specific responsibilities will be to:**
+ Administer, optimize and maintain Slurm for effective job scheduling and resource management in a multi-user HPC environment.
+ Provide implementation, debugging and performance tuning of parallel applications, ensuring high levels of efficiency and reliability.
+ Manage and plan a comprehensive scientific software catalog, ensuring that software tools are current, properly configured, and aligned with users¿ research objectives.
+ Collaborate with multidisciplinary teams to identify performance bottlenecks and software needs, devising innovative solutions to enhance computational workflows.
+ Spearhead initiatives for the design, scaling, and deployment of advanced computing infrastructure to support evolving research and operational demands.
+ Conduct performance analysis and benchmarking of HPC applications, effectively communicating results and recommendations to stakeholders.
+ Stay attuned to emerging trends and technologies in HPC, proposing strategic enhancements to maintain our competitive advantage.
**To be successful in this position you will bring:**
+ Bachelor's degree in computer science, computer engineering, or a related field and 5 years of relevant experience below or Master's degree and 3 years of relevant experience below:
+ Proficiency in debugging and profiling tools for high-performance parallel applications (e.g., gdb, Valgrind, Nvidia Nsight).
+ In-depth knowledge of Linux operating systems and advanced shell scripting.
+ Proven expertise in programming with C, C++, and Fortran, Python, along with deep experience in OpenMPI.
+ Strong problem-solving abilities complemented by exceptional communication skills to bridge technical concepts with non-technical stakeholders.
**Preferred Qualifications:**
+ Experience working in scientific or academic environments, collaborating closely with researchers and understanding their computational needs.
+ Familiarity with the scientific research process and the ability to translate research requirements into technical solutions.
+ Prior exposure to scientific computing applications and tools commonly used in fields such as physics, astrophysics, biophysics, and materials science.
+ Previous roles as a consultant or technical liaison between researchers and IT departments will be advantageous.
**Why Join Us?**
+ Innovative Environment: Work at the forefront of cutting-edge science and technology, contributing to revolutionary projects like LCLS and the Rubin Observatory that will redefine our understanding of the universe.
+ Collaborative Culture: Join a vibrant team of experts in a multidisciplinary environment, fostering collaboration across various scientific disciplines.
+ Professional Growth: Benefit from continuous learning and development opportunities, including access to training programs, workshops, and conferences.
+ Work-Life Balance: Enjoy a supportive work environment that values your well-being, with flexible working arrangements to support a healthy work-life balance.
+ Comprehensive Benefits: SLAC offers a competitive salary and a generous benefits package, including health, dental, and vision insurance, retirement savings plans, and tuition assistance for continued education.
**SLAC Employee Competencies:**
+ **Effective Decisions** : Uses job knowledge and solid judgment to make quality decisions in a timely manner.
+ **Self-Development** : Pursues a variety of venues and opportunities to continue learning and developing.
+ **Dependability** : Can be counted on to deliver results with a sense of personal responsibility for expected outcomes.
+ **Initiative** : Pursues work and interactions proactively with optimism, positive energy, and motivation to move things forward.
+ **Adaptability** : Flexes as needed when change occurs, maintains an open outlook while adjusting and accommodating changes.
+ **Communication** : Ensures effective information flow to various audiences and creates and delivers clear, appropriate written, spoken, presented messages.
+ **Relationships** : Builds relationships to foster trust, collaboration, and a positive climate to achieve common goals.
**Physical Requirements and Working Conditions:**
+ Consistent with its obligations under the law, the University will provide reasonable accommodation to any employee with a disability who requires accommodation to perform the essential functions of the job. May work extended hours during peak business cycles.
**Work Standards** :
+ Interpersonal Skills: Demonstrates the ability to work well with Stanford colleagues and clients and with external organizations.
+ Promote Culture of Safety: Demonstrates commitment to personal responsibility and value for environment, safety and security; communicates related concerns; uses and promotes safe behaviors based on training and lessons learned.Meets the applicable roles and responsibilities as described in the ESH Manual, Chapter 1¿General Policy and Responsibilities: Subject to and expected to comply with all applicable University policies and procedures, including but not limited to the personnel policies and other policies found in the University's Administrative Guide, Title: System Administrator 2
Grade: I
Job code: 4832
Duration: Regular Continuing
_The expected pay range for this position is $_ _$22,024 to 147,076 per annum. SLAC National Accelerator Laboratory/Stanford University provides pay ranges representing its good faith estimate of what the university reasonably expects to pay for a position. The pay offered to a selected candidate will be determined based on factors such as (but not limited to) the scope and responsibilities of the position, the qualifications of the selected candidate, departmental budget availability, internal equity, geographic location and external market pay for comparable jobs._
SLAC National Accelerator Laboratory is an Affirmative Action / Equal Opportunity Employer and supports diversity in the workplace. All employment decisions are made without regard to race, color, religion, sex, national origin, age, disability, veteran status, marital or family status, sexual orientation, gender identity, or genetic information. All staff at SLAC National Accelerator Laboratory must be able to demonstrate the legal right to work in the United States. SLAC is an E-Verify employer.

View Now

High-Performance Computing (HPC) Engineer

21705 Frederick, Maryland CACI International

Posted 1 day ago

Tap Again To Close

Job Description

High-Performance Computing (HPC) Engineer
Job Category: Information Technology
Time Type: Full time
Minimum Clearance Required to Start: TS/SCI with Polygraph
Employee Type: Regular
Percentage of Travel Required: Up to 10%
Type of Travel: Local
* * *
**The Opportunity**
+ The position is to support a Special Project for an Intelligence Community customer in Frederick, Maryland.
+ The project is to maintain and evolve an existing set of Modeling and Simulation applications that run on a High Performance Computing (HPC) System
**Responsibilities**
+ Provides backend software development inside an AGILE development environment.
+ Develop and run simulations on a High-Performance Computing (HPC) System to forecast Chemical and Radiological hazard areas.
+ Use modeling and simulation tools to include the Operational Multi-Scale Environment Model with grid Adaptivity (OMEGA).
+ Support the Chemical Hazard Area Modeling Program (CHAMPS)
+ Switch the existing schedule and resource management from Torque qsub to Slurm batch.
+ Provides backend architecture, system integration, program logic, database integration, ETL, and security controls.
+ Updates and incorporates new backend systems to support technical improvements.
**Qualifications**
_Required:_
+ Top Secret SCI security clearance preferably with a recent polygraph.
+ Expertise in software development tasks and High-Performance Computing (HPC)
+ Experience with schedule and resource management with Torque qsub and Slurm batch
+ Strong programming skills with multiple languages to include C Shell
+ Experience with Linux operating systems
+ Bachelor's degree in computer science, data science, math or a medical field with 7+ years of experience.
_Desired:_
+ Previous experience in a medical organization.
+ Previous experience with an intelligence organization.
+ Strong verbal and written communication skills.
+ Familiarity with DoD Instruction DoDI 8500.1, DoDI8500.2, DoDI 10.01, ICD 503 and the national Institute of Standards and Technology and Technology Special Publications 800-144 and 145.
-
**___**
**What You Can Expect:**
**A culture of integrity.**
At CACI, we place character and innovation at the center of everything we do. As a valued team member, you'll be part of a high-performing group dedicated to our customer's missions and driven by a higher purpose - to ensure the safety of our nation.
**An environment of trust.**
CACI values the unique contributions that every employee brings to our company and our customers - every day. You'll have the autonomy to take the time you need through a unique flexible time off benefit and have access to robust learning resources to make your ambitions a reality.
**A focus on continuous growth.**
Together, we will advance our nation's most critical missions, build on our lengthy track record of business success, and find opportunities to break new ground - in your career and in our legacy.
**Your potential is limitless.** So is ours.
Learn more about CACI here. ( Range** : There are a host of factors that can influence final salary including, but not limited to, geographic location, Federal Government contract labor categories and contract wage rates, relevant prior work experience, specific skills and competencies, education, and certifications. Our employees value the flexibility at CACI that allows them to balance quality work and their personal lives. We offer competitive compensation, benefits and learning and development opportunities. Our broad and competitive mix of benefits options is designed to support and protect employees and their families. At CACI, you will receive comprehensive benefits such as; healthcare, wellness, financial, retirement, family support, continuing education, and time off benefits. Learn more here ( .
The proposed salary range for this position is:
$113,200 - $237,800
_CACI is_ _an Equal Opportunity Employer._ _All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any_ _other protected characteristic._

View Now

Software Engineer, High Performance Computing

98033 Kirkland, Washington Google

Posted today

Tap Again To Close

Job Description

Software Engineer, High Performance Computing
_corporate_fare_ Google _place_ Kirkland, WA, USA
**Mid**
Experience driving progress, solving problems, and mentoring more junior team members; deeper expertise and applied knowledge within relevant area.
**Minimum qualifications:**
+ Bachelor's degree or equivalent practical experience.
+ 2 years of experience in high performance computing (HPC) system architecture and applications.
+ 2 years of experience testing, and launching software products, and experience with software design and architecture.
**Preferred qualifications:**
+ Advanced degree in physics, mathematics, life sciences engineering, computer science, engineering, or a similar technical field.
+ 4 years of experience in software development in C++, Python, Julia or similar programming languages used for technical/scientific/engineering computing.
+ 4 years of experience in scientific computing (workflows, applications) from one or more domains (health care/life science, manufacturing (CAE, EDA, energy), financial services industry).
**About the job**
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
Our mission is to enable our customers to run their most demanding workloads for technical, scientific and engineering issues on our Google Cloud Platform (GCP). This High Performance Computing (HPC) role offers supercomputer-class infrastructure (CPUs, GPUs or TPUs) that interoperates with other cloud services from storage to AI.
We offer a range of Virtual Machine (VM) families tailored for HPC use and innovative control plane constructs to build scalable systems. We enable our customers to create tailor-made HPC environments from cloud building blocks or derived from use case focused reference architectures. We help our customers navigate the integration of AI into computational workflows and workloads.
The AI and Infrastructure team works on the world's toughest problems, redefining what's possible and the possible easy. We empower Google customers by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Googler Cloud customers, and billions of Google users worldwide. We're at the center of amazing work at Google by being the "flywheel" that enables our advanced AI models, delivers computing power across global services, and offers platforms that developers use to build services.
In AI and Infrastructure, we shape the future of hyperscale computing by inventing and creating world-leading future technology, and drive global impact by contributing to Google infrastructure, from software to hardware (including building Vertex AI for Google Cloud). We work on complex technologies at a global scale with key players in the AI and systems space. Join a team of talented individuals who not only work together to keep data centers operating efficiently but also create a legacy of driving innovation by building some of the most complex systems technologies.
The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more aboutbenefits at Google ( .
**Responsibilities**
+ Understand our customers computing goals and generalize them into repeatable usage patterns and adequate cloud architecture.
+ Implement specific HPC solutions as Infrastructure-as-Code and necessary deployment tooling functionality.
+ Work closely with technical leads, product managers and partner service engineering teams to get high-quality features through the software project life-cycle.
+ Manage project schedules, identify technical risks and clearly communicate them to project stakeholders.
+ Collaborate with Program Manager (PM) and Go-to-Market (GTM) teams to develop solution collateral (guides, whitepapers, blog posts, etc.) to onboard customers and drive adoption.
Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google'sApplicant and Candidate Privacy Policy (./privacy-policy) .
Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See alsoGoogle's EEO Policy ( ,Know your rights: workplace discrimination is illegal ( ,Belonging at Google ( , andHow we hire ( .
If you have a need that requires accommodation, please let us know by completing ourAccommodations for Applicants form ( .
Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.
To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also and If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form:

View Now

Program Director - High Performance Computing

22037 Fairfax, Virginia General Dynamics Information Technology

Posted 7 days ago

Tap Again To Close

Job Description

**Req ID:** RQ
**Type of Requisition:** Regular
**Clearance Level Must Be Able to Obtain:** None
**Public Trust/Other Required:** None
**Job Family:** Program Delivery and Execution
**Skills:**
Employee Management,High-Performance Computing (HPC) Systems,Leadership,Program Management,Project Management
**Experience:**
15 + years of related experience
**US Citizenship Required:**
Yes
**Job Description:**
At GDIT, people are our differentiator. Our work depends on a Program Director joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS).
WCOSS provides NOAA the operational High Performance Computing (HPC) resources essential to process sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather and climate operational use. Operating 24/7, the 10-year WCOSS program will deliver significant computational capability that will evolve over time to keep pace with NOAA's growing environmental modeling needs.
We are looking for individuals to join GDIT's team to operate and support leading-edge technology for WCOSS and support the deployment of a new cutting edge HPC system.
**Responsibilities:**
+ Responsible for all aspects of the development and implementation of assigned projects, and provides a single point of contact for those projects
+ Serves as the Contractor interface for the Contracting Officer's Representative (COR)
+ Support WCOSS HPC operational systems 24x7 support team and processes
+ Supports implementation of new HPC operational system in concert with maintaining legacy system operations
+ Provides leadership and mentoring to senior HPC professionals
+ Ensures compliance with NIST and FISMA security control requirements
+ Takes projects from original concept through to final implementation
+ Interfaces with all areas affected by the project, including end users, computer services, and client services
+ Develops detailed work plans, schedules, project estimates, resource plans, and status reports
+ Conducts project meetings, and is responsible for project tracking and analysis
+ Ensures adherence to quality standards and reviews project deliverables
+ Leads the integration of vendor tasks, and tracks and reviews vendor deliverables
+ Provides technical and analytical guidance to project team
+ Recommends and takes action to direct the analysis and solutions of problems
**Required Qualifications:**
+ Bachelor's degree required; Master's degree preferred and minimum of 5 years of experience as a Program Manager for large ($500M), high-visibility federal government contract(s)
+ Project Management Professional Certification
+ Possesses project management experience in a High Performance Computing (HPC) environment
+ Must be able to communicate clearly and succinctly to Government stakeholders (e.g. Program Offices, HPC end users, etc.), including senior level management
+ Demonstrated technical experience architecting or maintaining high-performance computers
+ Demonstrated experience managing teams of geographically dispersed staff
+ Demonstrated experience developing and sustaining productive relationships with demanding clients in a mission-critical operational environment
Critical capabilities:
+ Understanding of high-performance computing architecture and engineering, including, but not limited to, compute, storage, and interconnect
+ Understanding of high-performance computing software, including schedulers and file systems
+ Understanding of data center operations, including electrical power, cooling, and fit up and management basics
+ Understanding of all aspects of government contract program management, including overall program execution oversight and strategy, client relationship management, staff management, financial management, and schedule management
**Desired Ideal additional capabilities and experience:**
+ Informed understanding of the role that high-performance computing plays in weather forecasting
+ Experience with Top500 Cray EX systems, including Slingshot high-speed fabric, liquid cooling, Lustre file systems, and/or HPCM cluster management software
+ Experience with mission-critical high-performance computing programs, with challenging operational SLAs and low tolerance limits for system outages or issues
+ Graduate degree in computer or physical sciences
+ PMP certification
The likely salary range for this position is $82,750 - 247,250. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year. Paid leave and paid holidays are prorated based on the employee's date of hire. The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.
We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.
Join our Talent Community to stay up to date on our career opportunities and events at Opportunity Employer / Individuals with Disabilities / Protected Veterans

View Now

High Performance Computing Undergraduate Student

87544 Los Alamos, New Mexico Los Alamos National Laboratory

Posted 15 days ago

Tap Again To Close

Job Description

**What You Will Do**
**The High Performance Computing Division (HPC) manages world-class Supercomputing Centers. Our employees engage in research, development, and state-of the-art software engineering, supporting development, design and effective use of largescale data and computational environments.**
**Delivering computational capability and data management at petascale (i.e., across tens of thousands of processors and some of the largest, fastest, and most complex data movement and storage systems in the world) gives rise to cutting edge technical challenges. HPC Division innovates to meet those challenges.** ** **.**
**HPC is seeking undergraduate level students looking for a challenging paid internship for summer 2026. All selected candidates will be provided with a mentor, a challenging project for their appointment, and an opportunity to present their project work and progress to colleagues. The student projects we have to offer will vary depending on the skills and interests of the candidate and our current needs.**
**What You Need**
**Minimum Job Requirements:**
+ **Currently enrolled (12 semester credit hours or full-time equivalent) in an accredited undergraduate degree-granting program (or international equivalent)**
+ **Must have and maintain a cumulative GPA of at least 2.75 on a 4.0 scale (or equivalent).**
+ **Entering first-year students must provide documentation indicating matriculation into an accredited undergraduate degree program.**
**We are seeking a wide variety of computational skills employed in an HPC environment. To receive full consideration, candidates must be pursuing one of the following undergraduate degrees (or a substantially equivalent degree), with a minimum of 60 hours completed:**
+ **Computer Science**
+ **Computer Engineering**
+ **Electrical and Computer Engineering**
+ **Systems Administration**
+ **Computer Networking**
+ **Computer Systems Analysis**
+ **Software Engineer**
+ **Electrical Engineering**
+ **Information Systems Security Management**
+ **Mathematics or Physics**
**AND, knowledge and introductory experience in one or more of the following:**
+ **Cluster Computing and System Administration**
+ **Programming skills (C++, Python, SQL, Perl, etc.)**
+ **Linux system management**
+ **Linux-based computer security**
+ **Parallel Programming**
+ **FPGA Programming**
+ **FPGA Programming**
+ **HPC System Software**
+ **Data Storage**
+ **Computer Networking**
**Note to Applicants:**
Applicants may send any questions to
Due to federal restrictions contained in the current National Defense Authorization Act, citizens of the People's Republic of China, the Islamic Republic of Iran, the Democratic People's Republic of North Korea, and the Russian Federation, who are not Lawful Permanent
Residents ("green card" holders) are prohibited from accessing facilities that support the mission, functions, and operations of national security laboratories and nuclear weapons production facilities, which includes Los Alamos National Laboratory.
**Where You Will Work**
Located in Northern New Mexico, Los Alamos National Laboratory (LANL) is a multidisciplinary research institution engaged in strategic science on behalf of national security. LANL enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns. Our generous benefits package includes:
+ PPO or High Deductible medical insurance with the same large nationwide network
+ Dental and vision insurance
+ Free basic life and disability insurance
+ Paid childbirth and parental leave
+ Award-winning 401(k) (6% matching plus 3.5% annually)
+ Learning opportunities and tuition assistance
+ Flexible schedules and time off (PTO and holidays)
+ Onsite gyms and wellness programs
+ Extensive relocation packages (outside a 50 mile radius)
**Additional Details**
Directive 206.2 - Employment with Triad requires a favorable decision by NNSA indicating employee is suitable under NNSA Supplemental Directive 206.2 ( . Please note that this requirement applies only to citizens of the United States. Foreign nationals are subject to a similar requirement under DOE Order 142.3A.
No Clearance: Position does not require a security clearance. Selected candidates will be subject to drug testing and other pre-employment background checks.
New-Employment Drug Test: The Laboratory requires successful applicants to complete a new-employment drug test and maintains a substance abuse policy that includes random drug testing. Although New Mexico and other states have legalized the use of marijuana, use and possession of marijuana remain illegal under federal law. A positive drug test for marijuana will result in termination of employment, even if the use was pre-offer.
Internal Applicants: Regular appointment employees who have served the required period of continuous service in their current position are eligible to apply for posted jobs throughout the Laboratory. If an employee has not served the required period of continuous service, they may only apply for Laboratory jobs with the documented approval of their Division Leader. Please refer to Policy Policy P701 ( for applicant eligibility requirements.
Incentive Compensation Program: For general program information refer to the Student Programs web page: Opportunity: Los Alamos National Laboratory is an equal opportunity employer. All employment practices are based on qualification and merit, without regard to protected categories such as race, color, national origin, ancestry, religion, age, sex, gender identity, sexual orientation, marital status or spousal affiliation, physical or mental disability, medical conditions, pregnancy, status as a protected veteran, genetic information, or citizenship within the limits imposed by applicable federal, state and local laws and regulations.
The Laboratory is also committed to making our workplace accessible to individuals with disabilities and will provide reasonable accommodations, upon request, for individuals to participate in the application and hiring process. To request a disability accommodation, email or call , opt. 3.
Instructions on How to Activate/Create a LANL Jobs Account:
Follow the instructions below if you have ever had an employee Z number, been a contractor, or received Los Alamos Lab insurance coverage to activate your account:
+ Select the Click Here button if you have been employed with the Lab or received insurance coverage.
+ Please enter only your first and last name and current email address (an email with your validation code will be sent to you) to activate the account currently in our system.
+ Enter your validation code as described in the email you receive and complete the 3-page registration form. Your account is now active, and you can apply for jobs or save to your basket. Important: Enter the validation code within 15 days to activate your account or your account will be deactivated.
Follow the instructions below if you if you have never been employed with the Lab or received insurance coverage to create an account:
+ Select the Register button if you have never been employed with the Lab or received insurance coverage to Create an Account.
+ From here, you will establish an account with username and password.
How to Apply: Login to Your Account to Complete the Application Process
+ Click the Vacancy Name number (in blue) to view any job's details.
+ Click Apply or Add to Basket to apply later. Tip: To apply for a job or save your basket, you must have a LANL jobs account.
If you experience any technical issues, please email for assistance.

View Now

Be The First To Know

About the latest High performance computing Jobs in United States !

Set Email Alert:

Enter your email

Job title

Location

High Performance Computing Graduate Student

87544 Los Alamos, New Mexico Los Alamos National Laboratory

Posted 15 days ago

Tap Again To Close

Job Description

**What You Will Do**
**The High Performance Computing Division (HPC) managers world-class Supercomputing Centers. Our employees engage in research, development, and state-of-the-art software engineering, supporting development, design and effective use of large scale data and computational environments. Delivering computational capability and data management at petascale (i.e., across tens of thousands of processors and some of the largest, fastest, and most complex data movement and storage systems in the world) gives rise to cutting edge technical challenges. HPC Division innovates to meet those challenges.**
**HPC is seeking recent graduates looking for a challenging paid post graduate internships at the post-baccalaureate level. All selected candidates will be provided with a mentor, a challenging project for their appointment, and an opportunity to present their project work and progress to colleagues. The projects we have to offer will vary depending on the skills and interests of the candidate and our current needs. ** You Need**
**Minimum Job Requirements:**
We are seeking a wide variety of computational skills employed in an HPC environment. To receive full consideration, candidates must have received a degree in one of the following
+ Computer Science
+ Computer Engineering
+ Electrical and Computer Engineering
+ Systems Administration
+ Computer Networking
+ Computer Systems Analysis
+ Software Engineer
+ Electrical Engineering
+ Information Systems Security Management
+ Mathematics or Physics
AND, knowledge and introductory experience in one or more of the following:
+ Cluster Computing and System Administration
+ Programming skills (C++, Python, SQL, Perl, etc.)
+ Linux system management
+ Linux-based computer security
+ Parallel Programming
+ FPGA Programming
+ FPGA Programming
+ HPC System Software
+ Data Storage
+ Computer Networking
**Note to Applicants:**
A cover letter expressing the candidate's strengths and interests will help create the best match between applicant and internship. It is highly recommended to submit a cover letter if you are interested in the Supercomputing Institute.
All applications received prior to the first Tuesday in January will be considered for our Supercomputer Institute. You can find information on that program here: may send any questions to
Required Application Materials (for all intern levels - do not delete):
+ Current resume
+ Current official transcripts
+ Personal statement of interest (not to exceed one page)
Due to federal restrictions contained in the current National Defense Authorization Act, citizens of the People's Republic of China, the Islamic Republic of Iran, the Democratic People's Republic of North Korea, and the Russian Federation, who are not Lawful Permanent
Residents ("green card" holders) are prohibited from accessing facilities that support the mission, functions, and operations of national security laboratories and nuclear weapons production facilities, which includes Los Alamos National Laboratory.
**Where You Will Work**
Located in Northern New Mexico, Los Alamos National Laboratory (LANL) is a multidisciplinary research institution engaged in strategic science on behalf of national security. LANL enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns. Our generous benefits package includes:
+ PPO or High Deductible medical insurance with the same large nationwide network
+ Dental and vision insurance
+ Free basic life and disability insurance
+ Paid childbirth and parental leave
+ Award-winning 401(k) (6% matching plus 3.5% annually)
+ Learning opportunities and tuition assistance
+ Flexible schedules and time off (PTO and holidays)
+ Onsite gyms and wellness programs
+ Extensive relocation packages (outside a 50 mile radius)
**Additional Details**
Directive 206.2 - Employment with Triad requires a favorable decision by NNSA indicating employee is suitable under NNSA Supplemental Directive 206.2 ( . Please note that this requirement applies only to citizens of the United States. Foreign nationals are subject to a similar requirement under DOE Order 142.3A.
No Clearance: Position does not require a security clearance. Selected candidates will be subject to drug testing and other pre-employment background checks.
New-Employment Drug Test: The Laboratory requires successful applicants to complete a new-employment drug test and maintains a substance abuse policy that includes random drug testing. Although New Mexico and other states have legalized the use of marijuana, use and possession of marijuana remain illegal under federal law. A positive drug test for marijuana will result in termination of employment, even if the use was pre-offer.
Internal Applicants: Regular appointment employees who have served the required period of continuous service in their current position are eligible to apply for posted jobs throughout the Laboratory. If an employee has not served the required period of continuous service, they may only apply for Laboratory jobs with the documented approval of their Division Leader. Please refer to Policy Policy P701 ( for applicant eligibility requirements.
Incentive Compensation Program: For general program information refer to the Student Programs web page: Opportunity: Los Alamos National Laboratory is an equal opportunity employer. All employment practices are based on qualification and merit, without regard to protected categories such as race, color, national origin, ancestry, religion, age, sex, gender identity, sexual orientation, marital status or spousal affiliation, physical or mental disability, medical conditions, pregnancy, status as a protected veteran, genetic information, or citizenship within the limits imposed by applicable federal, state and local laws and regulations.
The Laboratory is also committed to making our workplace accessible to individuals with disabilities and will provide reasonable accommodations, upon request, for individuals to participate in the application and hiring process. To request a disability accommodation, email or call , opt. 3.
Instructions on How to Activate/Create a LANL Jobs Account:
Follow the instructions below if you have ever had an employee Z number, been a contractor, or received Los Alamos Lab insurance coverage to activate your account:
+ Select the Click Here button if you have been employed with the Lab or received insurance coverage.
+ Please enter only your first and last name and current email address (an email with your validation code will be sent to you) to activate the account currently in our system.
+ Enter your validation code as described in the email you receive and complete the 3-page registration form. Your account is now active, and you can apply for jobs or save to your basket. Important: Enter the validation code within 15 days to activate your account or your account will be deactivated.
Follow the instructions below if you if you have never been employed with the Lab or received insurance coverage to create an account:
+ Select the Register button if you have never been employed with the Lab or received insurance coverage to Create an Account.
+ From here, you will establish an account with username and password.
How to Apply: Login to Your Account to Complete the Application Process
+ Click the Vacancy Name number (in blue) to view any job's details.
+ Click Apply or Add to Basket to apply later. Tip: To apply for a job or save your basket, you must have a LANL jobs account.
If you experience any technical issues, please email for assistance.

View Now

Senior High Performance Computing System Administrator

10261 New York, New York Icahn School of Medicine at Mount Sinai

Posted today

Tap Again To Close

Job Description

Roles & Responsibilities:

The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high-performance computing team, the research clinical data warehouse team and a research data services team.

The Senior HPC Administrator, High Performance Computational and Data Ecosystem , is responsible for a computational and data science ecosystem for researchers at Mount Sinai. This ecosystem includes high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. To meet Sinai’s scientific and clinical goals, the Senior Administrator has a good technical understanding for computational, data and software development systems along with a strong focus on customer service for researchers. The HPC Senior Administrator is an expert troubleshooter and productive team member and leads projects to effective and efficient completion independently under little to no supervision. This position reports to the Director for Computational & Data Ecosystem in Scientific Computing. Specific responsibilities are listed below.

Responsibilities

Design, deploy and maintain Scientific Computing’s computational and data science ecosystem including ~30,000 cores with high bandwidth, low latency interconnects, GPUs, large shared memory nodes, databases, scientific workflows and 30+ petabytes of storage in production, clinical data warehouse and software development environment.
Lead the troubleshooting, isolation and resolution of all technical issues including application, system, hardware, software, and network). Actively monitors the systems.
Maintains, tunes and manages computational, data, cloud technologies and workflow systems for ISMMS researchers, scientists and their external collaborators. Defines and deploys a comprehensive computational and data vision. Identifies and communicates system advantages/disadvantages and tradeoffs.
Designs, develops, implements system administration tasks, including hardware and software configuration, configuration management, system monitoring (including the development and maintenance of regression tests), usage reporting, system performance (file systems, scheduler, interconnect, high availability, etc.), security, networking and metrics, etc.
Collaborates effectively with research and hospital system IT, compliance, HIPAA, security and other departments to ensure compliance with all regulations and Sinai policies.
Participates in the integration of HPC resources with laboratory equipment such as sequencers, clinical and research data resources and systems, etc. Incorporate and link data and compute resources.
Researches, deploys and optimizes resource management and scheduling software and policies and actively monitoring. Designs, tunes, manages and upgrades parallel file systems, storage and data-oriented resources.
Researches, deploys and manages security infrastructure, including development of policies and procedures.
Maintain all necessary aspects of HPC in accordance with best practices. Develops and implements backup policies.
Prepares and manages budgets for hardware, software and maintenance. Participates in chargeback/fee recovery analysis and provides suggestions to make operations sustainable.
Assists in developing and writing system design for research proposals. Creates and provides clear documentation.
Works effectively and productively with other team members within the group and across Mount Sinai.
Performs related duties as assigned or requested.
Provides after hours support for critical system and production issues.
Answers and resolves user tickets.

Qualifications:

Bachelor's degree in computer science, engineering or another scientific field. Master's or PhD preferred
8+ years (higher preferred) of progressive HPC system administration and operations (preferably in a Redhat/CentOS Linux administration, Batch HPC cluster environment)
Must be an expert troubleshooter; Must be a team player and customer focused
Experience with job scheduler such as LSF or Slurm and parallel file systems and storage
Experience with networking and security
Experience with configuration management systems such as xCAT, Puppet and/or Ansible
Experience of databases and web services
Experience in Infiniband, Gigabit Ethernet
Experience in an academic or research community environment
Script and programming experience
Experience with Cloud Computing
Ability to multitask effectively in a dynamic environment
Excellent communication skills, analytical ability, strong judgment and management skills, and the ability to work effectively as a liaison between both research and technology teams.
Strong written, oral, and interpersonal communication skills

Preferred Experience

Advanced degree
Experience with GPFS, LSF, TSM, IB and ethernet networking
Experience with databases and web services is highly preferred

Strength through Unity and Inclusion

The Mount Sinai Health System is committed to fostering an environment where everyone can contribute to excellence. We share a common dedication to delivering outstanding patient care. When you join us, you become part of Mount Sinai’s unparalleled legacy of achievement, education, and innovation as we work together to transform healthcare. We encourage all team members to actively participate in creating a culture that ensures fair access to opportunities, promotes inclusive practices, and supports the success of every individual.

At Mount Sinai, our leaders are committed to fostering a workplace where all employees feel valued, respected, and empowered to grow. We strive to create an environment where collaboration, fairness, and continuous learning drive positive change, improving the well-being of our staff, patients, and organization. Our leaders are expected to challenge outdated practices, promote a culture of respect, and work toward meaningful improvements that enhance patient care and workplace experiences. We are dedicated to building a supportive and welcoming environment where everyone has the opportunity to thrive and advance professionally. Explore this opportunity and be part of the next chapter in our history.

About the Mount Sinai Health System:

Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 48,000 employees working across eight hospitals, more than 400 outpatient practices, more than 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time — discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it. Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients’ medical and emotional needs at the center of all treatment. The Health System includes more than 9,000 primary and specialty care physicians; 13 joint-venture outpatient surgery centers throughout the five boroughs of New York City, Westchester, Long Island, and Florida; and more than 30 affiliated community health centers. We are consistently ranked by U.S. News & World Report's Best Hospitals, receiving high "Honor Roll" status, and are highly ranked: No. 1 in Geriatrics, top 5 in Cardiology/Heart Surgery, and top 20 in Diabetes/Endocrinology, Gastroenterology/GI Surgery, Neurology/Neurosurgery, Orthopedics, Pulmonology/Lung Surgery, Rehabilitation, and Urology. New York Eye and Ear Infirmary of Mount Sinai is ranked No. 12 in Ophthalmology. U.S. News & World Report’s “Best Children’s Hospitals” ranks Mount Sinai Kravis Children's Hospital among the country’s best in several pediatric specialties. The Icahn School of Medicine at Mount Sinai is ranked No. 11 nationwide in National Institutes of Health funding and in the 99th percentile in research dollars per investigator according to the Association of American Medical Colleges. Newsweek’s “The World’s Best Smart Hospitals” ranks The Mount Sinai Hospital as No. 1 in New York and in the top five globally, and Mount Sinai Morningside in the top 20 globally.

Equal Opportunity Employer

The Mount Sinai Health System is an equal opportunity employer, complying with all applicable federal civil rights laws. We do not discriminate, exclude, or treat individuals differently based on race, color, national origin, age, religion, disability, sex, sexual orientation, gender, veteran status, or any other characteristic protected by law. We are deeply committed to fostering an environment where all faculty, staff, students, trainees, patients, visitors, and the communities we serve feel respected and supported. Our goal is to create a healthcare and learning institution that actively works to remove barriers, address challenges, and promote fairness in all aspects of our organization.

View Now

Senior High Performance Computing System Administrator

10261 New York, New York Icahn School of Medicine at Mount Sinai

Posted 3 days ago

Tap Again To Close

Job Description

Senior HPC Administrator

The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai supports a cutting-edge high-performance computing and data ecosystem for researchers, including the HPC systems, clinical research databases, and software development infrastructure. The Senior HPC Administrator is responsible for a computational and data science ecosystem that enables Mount Sinais scientific and clinical goals, combining technical expertise with strong customer service for researchers. This role reports to the Director for Computational & Data Ecosystem in Scientific Computing and leads projects to completion with little to no supervision.

Base pay range

$120,000.00/yr - $180,060.00/yr

Responsibilities

Design, deploy and maintain the scientific computing ecosystem, including ~30,000 cores with high bandwidth, low latency interconnects, GPUs, large shared memory nodes, databases, scientific workflows and 30+ petabytes of storage in production, clinical data warehouse and software development environment.
Lead troubleshooting, isolation and resolution of technical issues (application, system, hardware, software, and network); actively monitor systems.
Maintain, tune and manage computational, data, cloud technologies and workflow systems; define and deploy a comprehensive computational and data vision. Communicate system advantages/disadvantages and tradeoffs.
Design, develop and implement system administration tasks including hardware/software configuration, configuration management, system monitoring (regression tests), usage reporting, performance, security, networking and metrics.
Collaborate with research and hospital IT, compliance, HIPAA, security and other departments to ensure regulatory and policy compliance.
Integrate HPC resources with laboratory equipment and data resources; link data and compute resources.
Research, deploy and optimize resource management and scheduling software; design, tune, manage and upgrade parallel file systems, storage and data-oriented resources.
Develop and manage security infrastructure, including policies and procedures.
Maintain HPC operations following best practices; develop and implement backup policies.
Prepare and manage budgets for hardware, software and maintenance; participate in chargeback/fee recovery analyses and provide operational sustainability recommendations.
Contribute to system design for research proposals; create and maintain clear documentation.
Work effectively with team members and across Mount Sinai; provide after-hours support for critical system and production issues; respond to user tickets.

Qualifications

Bachelor's degree in computer science, engineering or a related scientific field; Masters or PhD preferred
8+ years (higher preferred) of progressive HPC system administration and operations (Redhat/CentOS Linux, Batch HPC cluster experience)
Expert troubleshooter; strong teamwork and customer-focused mindset
Experience with job schedulers such as LSF or Slurm and with parallel file systems and storage
Experience with networking and security
Experience with configuration management systems (xCAT, Puppet and/or Ansible)
Experience with databases and web services; Infiniband and Ethernet networking
Experience in an academic or research environment
Scripting and programming experience; experience with cloud computing
Ability to multitask in a dynamic environment; strong communication, analytical and leadership skills

Preferred Experience

Advanced degree
GPFS, LSF, TSM, IB and Ethernet networking experience
Databases and web services experience

Equal Opportunity Employer

Mt. Sinai Health System is committed to fostering an environment where all faculty, staff, students, trainees, patients, and communities feel respected and supported, with opportunities for growth and development for all.

#J-18808-Ljbffr

View Now

Industry

View All High Performance Computing Jobs

Menu

Search Suggestions

Recent Searches

Popular Searches

Location Suggestions

Popular Locations

What People Ask

Nearby Locations

Other Jobs Near Me

Industry

138 High Performance Computing jobs in the United States

High Performance Computing Graduate Student

Job Description

Software Engineer, High Performance Computing

Job Description

Senior High Performance Computing Engineer

Job Description

High-Performance Computing (HPC) Engineer

Job Description

Software Engineer, High Performance Computing

Job Description

Program Director - High Performance Computing

Job Description

High Performance Computing Undergraduate Student

Job Description

Be The First To Know

High Performance Computing Graduate Student

Job Description

Senior High Performance Computing System Administrator

Job Description

Senior High Performance Computing System Administrator

Job Description

Nearby Locations

Other Jobs Near Me

Industry

Search Suggestions

Recent Searches

Popular Searches

Location Suggestions

Popular Locations

What People Ask

What skills are needed for high performance computing jobs? expand_more

What are the typical responsibilities in high performance computing roles? expand_more

Who are the top employers for high performance computing in the US? expand_more

What is the average salary for high performance computing jobs in the US? expand_more

What career paths are available in high performance computing? expand_more

Nearby Locations

Other Jobs Near Me

Industry

138 High Performance Computing jobs in the United States

High Performance Computing Graduate Student

Job Description

Software Engineer, High Performance Computing

Job Description

Senior High Performance Computing Engineer

Job Description

High-Performance Computing (HPC) Engineer

Job Description

Software Engineer, High Performance Computing

Job Description

Program Director - High Performance Computing

Job Description

High Performance Computing Undergraduate Student

Job Description

Be The First To Know

High Performance Computing Graduate Student

Job Description

Senior High Performance Computing System Administrator

Job Description

Senior High Performance Computing System Administrator

Job Description

Nearby Locations

Other Jobs Near Me

Industry