HPC Programming and Runtime Environment (HPC Engineer 1/2)
- Req. Number: IRC140710
- Organization : HPC-ENV/HPC Environments
- City, State: Los Alamos, New Mexico
Join the High Performance Computing Environments Group (HPC-ENV) at Los Alamos National Laboratory (LANL), where we operate some of the world's most advanced computing systems. The Programming & Runtime Environments (PRE) Team ensures reliable system software and testing environments that support:
- Large-scale modeling & simulation (e.g. physics, engineering, climate)
- Artificial Intelligence & Machine Learning (AI/ML)
- Emerging Software for next-generation computing needs
Team Responsibilities
The PRE team works on the LANL HPC clusters as non-admin users to ensure that the computing resources are available and performant for the scientists and programs. The team's efforts include:
- Providing and supporting stable programming environments that enable the building, debugging, and efficient execution of scientific applications on our platforms.
- Deploying and running test suites to ensure that the clusters are fully operational from a user perspective, as well as measuring and monitoring cluster performance.
- Providing additional software services for users through dedicated Linux servers, largely to enable users to more easily build and maintain their applications.
We emphasize automation, modern tools, and close collaboration with users and other HPC groups.
HPC Engineer 1 ($94,500 - $154,600)
Responsibilities:
- Install, support and maintain software used on HPC & AI resources at the LANL, largely using a containerized, CI/CD driven workflow.
- Develop and maintain the tests and infrastructure to ensure the reliability and performance of HPC clusters.
- Configure and maintain Linux-based support servers using configuration management tools.
- Contribute to technical projects - typically with the goal of providing improved support, automation, and capabilities in the above areas.
- Collaborate across HPC teams and groups to accomplish tasks.
- Contribute to technical presentations and reports for conferences and meetings.
- Document technical contributions.
- Provide technical support to users - address user issues and requests via tickets.
- Participate in on-call rotation (during normal business hours), and occasional after hours support to ensure cluster health and uptime.
- Develop and enhance workflows enabling programming environment deployments and maintenance.
HPC Engineer 2 (104,100 - $172,200)
Additional responsibilities at level HPC Engineer 2:
- Work closely with software vendors to provide support for third-party packages.
- Provide technical leadership of projects to improve support, automation and capabilities for fulfilling team responsibilities.
- Actively communicate with users to find and provide centralized solutions to issues.
- Research, evaluate, and recommend new software tools and packages.
- Provide tool education opportunities to users of LANL's HPC platforms.
- May contribute to the development of novel custom tools, especially in the areas of software deployment and testing.
What You Need
Minimum Job Requirements:
- Linux knowledge and experience
- Programming experience (primarily Python, Bash, C/C++)
- Experience using open source build systems (make, autotools, cmake)
- Experience using containers in Linux
- Experience using CI/CD pipelines and jobs.
Additional Requirements for HPC Engineer 2:
- Significant experience in most of the minimum job requirements area.
- Experience with application compilation and linking
- Experience working with HPC clusters, MPI, and parallel programming models.
- Linux system administration experience, particularly using configuration management tools such as Ansible, Puppet or Chef.
Desired Qualifications:
- AI/ML Experience - AI/ML toolkits and workflows.
- Debugging & profiling tools such as GDB, NVIDIA NSIGHT, Totalview or Forge.
- Experience using Spack - Building and troubleshooting software builds via Spack.
- Experience with GPU programming
Education/Experience for HPC Engineer 1: Position Requires a bachelor's in Computer Science or Computer Engineering or a related field or equivalent combination of education and experience in a related field.
Education/Experience for HPC Engineer 2: Position Requires a bachelor's in Computer Science, Computer Engineering, or a related field, and 3 years of relevant experience in high performance computing or scalable AI computing, or data center environments or equivalent combination of education and experience in related field.
Work Location:
The work location for this position is hybrid and is located in Los Alamos. Hybrid is defined as working partially onsite/partially offsite but within 2 hours ground commute of this location. All work locations are at the discretion of management and can change at any time with appropriate notice. Current departmental policy requires a minimum of 60% on-site hours.
Position commitment:
Regular appointment employees are required to serve a period of continuous service in their current position in order to be eligible to apply for posted jobs throughout the Laboratory. If an employee has not served the time required, they may only apply for Laboratory jobs with the documented approval of their Division Leader. The position commitment for this position is 1 year.
Note to Applicants:
Due to federal restrictions contained in the current National Defense Authorization Act, citizens of the People's Republic of China-including the special administrative regions of Hong Kong and Macau-as well as citizens of the Islamic Republic of Iran, the Democratic People's Republic of Korea (North Korea), and the Russian Federation, who are not Lawful Permanent Residents ("green card" holders) are prohibited from accessing facilities that support the mission, functions, and operations of national security laboratories and nuclear weapons production facilities, which includes Los Alamos National Laboratory.
Where You Will Work
Located in beautiful northern New Mexico, Los Alamos National Laboratory (LANL) is a multidisciplinary research institution engaged in strategic science on behalf of national security. Our generous benefits package includes:
§ PPO or High Deductible medical insurance with the same large nationwide network
§ Dental and vision insurance
§ Free basic life and disability insurance
§ Paid childbirth and parental leave
§ Award-winning 401(k) (6% matching plus 3.5% annually)
§ Learning opportunities and tuition assistance
§ Flexible schedules and time off (PTO and holidays)
§ Onsite gyms and wellness programs
§ Extensive relocation packages (outside a 50 mile radius)
Additional Details
Directive 206.2 - Employment with Triad requires a favorable decision by NNSA indicating employee is suitable under NNSA Supplemental Directive 206.2. Please note that this requirement applies only to citizens of the United States. Foreign nationals are subject to a similar requirement under DOE Order 142.3A.
Clearance: Q (Position will be cleared to this level). Selected applicants will be subject to a background investigation conducted by or on behalf of the Federal Government, and must meet eligibility requirements* for access to classified matter. This position requires a Q clearance. and obtaining such clearance requires US Citizenship except in extremely rare circumstances. Dependent upon the position, additional authorization to access classified information may be required, which may or may not be available to dual citizens. Receipt of a Q clearance and additional access authorization ultimately is a decision of the Federal Government and not of Triad.
*Eligibility requirements: To obtain a clearance, an individual must be at least 18 years of age; U.S. citizenship is required except in very limited circumstances. See DOE Order 472.2 for additional information.
New-Employment Drug Test: The Laboratory requires successful applicants to complete a new-employment drug test and maintains a substance abuse policy that includes random drug testing. Although New Mexico and other states have legalized the use of marijuana, use and possession of marijuana remain illegal under federal law. A positive drug test for marijuana will result in termination of employment, even if the use was pre-offer.
Regular position: Term status Laboratory employees applying for regular-status positions are converted to regular status.
Internal Applicants: Regular appointment employees who have served the required period of continuous service in their current position are eligible to apply for posted jobs throughout the Laboratory. If an employee has not served the required period of continuous service, they may only apply for Laboratory jobs with the documented approval of their Division Leader. Please refer to Policy Policy P701 for applicant eligibility requirements.
Equal Opportunity: Los Alamos National Laboratory is an equal opportunity employer. All employment practices are based on qualification and merit, without regard to protected categories such as race, color, national origin, ancestry, religion, age, sex, gender identity, sexual orientation, marital status or spousal affiliation, physical or mental disability, medical conditions, pregnancy, status as a protected veteran, genetic information, or citizenship within the limits imposed by federal, state, and local laws and regulations. The Laboratory is also committed to making our workplace accessible to individuals with disabilities and will provide reasonable accommodations, upon request, for individuals to participate in the application and hiring process. To request such an accommodation, please send an email to applyhelp@lanl.gov or call (505)-664-6947 opt. 3.