Los Alamos National Laboratory Computing Systems Tec 3/4/5 in Los Alamos, New Mexico

What You Will Do

This position will be filled at either the CST-3, CST-4 or CST-5 level, depending on the skills of the selected candidate. Additional job responsibilities (outlined below) will be assigned if the candidate is hired at the higher level.

The High-Performance Computing Division (HPC) provides production high performance computing systems services to the Laboratory. The High Performance Computing Systems group has responsibility for the broad range of HPC platforms and infrastructure deployed within Laboratory HPC Data Centers.

HPC Systems receives, tests, integrates, supports, improves, and decommissions the full range of HPC equipment in support of a range of LANL supercomputing mission. The Division’s goal is to create an effective HPC environment in which scientists can be as productive as possible. Additionally, we support selected research activities that we deem important to our mission.

The High Performance Computing Support Team (HPCST) within HPC-SYS consists of a twenty-four hour day, seven days a week, 365 days a year work schedule. The selected candidate may work off shifts (grave and swing), weekends and holidays as assigned with minimum supervision. This position is considered essential personnel and as such the incumbent will be required to report to work during inclement weather and winter closures.

The successful candidate should have comprehensive understanding and wide application of technical principle, theories, and concepts in the field; will work independently and interactively with other technician when participating in day-to-day operations and problem solving on various super computing systems; will provide technical solutions to a wide range of difficult problems; and have work occasionally reviewed for soundness of judgment, overall adequacy and accuracy.

Computing Systems Tec 3 (CST-3) ($49,200 - $76,000)

The successful candidate will perform the full spectrum of duties, including but not limited to:

Duties for Computer Systems Technician 3 level position include:

  • Work off hours as assigned, an 8-hour shift every other weekend, holidays as assigned, on-call, and on short notice as needed.

  • Primary system administration on systems to address routine software issues.

  • Hardware support, which includes troubleshooting, diagnoses, and performance of preventive maintenance tasks on equipment in a multi system environment in both secure and open networks. Assignments are usually focused while working within established priorities, procedures, processes, and requirements or specifications. Impact of work is usually limited to a well-defined area of a project or specific assignment.

  • Set up and monitor the operation of computer consoles and peripheral equipment.

  • Identify areas in need of diagnostic tests to identify equipment malfunctions.

  • Monitor systems utilizing diagnostic tests, system messages, scripts and monitoring tools.

  • Respond to error messages, systems crashes, equipment failures, and user concerns.

  • Identify failures and initiate proper recovery procedures.

  • Monitor all production application jobs.

  • Repair hardware failures to component level with minimal to no supervision.

  • Perform preventive maintenance tasks on assigned equipment.

  • Participate in the installation and integration of new systems.

  • Work closely with on-call system administrators to develop and implement innovative technical solutions to complex software problems.

  • Perform logistic duties, which include inventory control, shipping and receiving of spare parts.

  • Report hardware failures on vendor supported systems.

  • Maintain required status reports, reboot and metrics databases.

  • Perform trend analysis on hardware failures and repairs.

  • Collaborate with Group members, other HPC Group personnel and vendor representatives as required.

  • Ability to join 24x7 on call rotation.

Computing Systems Tec 4 (CST-4) ($59,100 - $93,800)

In addition to the duties outlined above, the CST-4 will be required to:

  • Work closely with on-call system administrators to develop and implement innovative technical solutions to complex software problems.

  • Perform trend analysis on hardware failures and repairs.

  • Provide training and support to other Technicians on the team.

Computing Systems Tec 5 (CST-5) ($65,200 - $103,500)

In addition to the duties outlined above, the CST-5 will be required to:

  • Develop methods to streamline, improve, and automate processes and procedures

  • Direct and evaluate the work of other members of the team.

  • Provide second level technical on-call support off hours, weekends and Holidays for High Performance Computing and Storage systems

What You Need

Minimum Job Requirements:

  • Knowledge of complex heterogeneous computing systems.

  • Basic knowledge of UNIX, LINUX and/or Microsoft computer operating systems.

  • Basic experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers.

  • Experience in troubleshooting and maintaining redundant array of independent disks (RAID) systems.

  • Experience with midrange and mainframe computers and use of job control languages to determine software or hardware issues and system performance.

  • Demonstrated experience working in a team environment.

  • Familiarity with ticket and issue tracking systems

  • Ability to work on multiple projects at a time.

  • Effective communication (verbal and written).

  • Ability to partner with customers.

Additional Job Requirements for CST-4:

In addition to the Job Requirements outlined above, qualification at the CST-4 level requires:

  • Moderate knowledge of UNIX, LINUX and/or Microsoft computer operating systems.

  • Moderate experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers.

  • Demonstrated experience in modifying scripts in various languages such as shell, python, perl, etc.

  • Considerable experience with midrange and mainframe computers and use of job control languages to determine software or hardware issues and system performance.

Additional Job Requirements for CST-5:

In addition to the Job Requirements outlined above, qualification at the CST-5 level requires:

  • Extensive knowledge of UNIX, LINUX and/or Microsoft computer operating systems.

  • Extensive experience in troubleshooting, diagnosing and repairing hardware failures to component level on servers.

  • Demonstrated project management and leadership skills.

  • Demonstrated ability to develop, implement and maintain policies and procedures to trouble shoot, diagnose and repair computer hardware.

  • Advanced skills in identifying the need for diagnostic tools to monitor and diagnose system failures.

  • Ability to translate technical terms to lay person's terms and present them to audiences outside of organization.

Desired Skills:

  • Experience working in a production HPC environment

  • Familiarity with classified electronic media handling.

  • Ability to interpret facility monitoring error messages.

  • Experience in configuring, troubleshooting and repairing network switches.

  • An active Q clearance

  • An active SCI

Education:

CST-3: Position typically requires a high school diploma and two-to-four years of related experience, or an equivalent combination of education and experience. At this level, additional training, certification, and/or education may be desirable.

CST-4: Position typically requires a high school diploma and four-to-six years of related experience, or an equivalent combination of education and experience. At this level, additional training, certification, and/or education may be expected, including associate’s degrees and graduation from technical institutions.

CST-5: Position typically requires a high school diploma and four-to-six years of related experience, or an equivalent combination of education and experience. At this level, additional training, certification, and/or education may be expected, including associate’s degrees and graduation from technical institutions.

Additional Details:

Clearance: Q (Position will be cleared to this level). Applicants selected will be subject to a Federal background investigation and must meet eligibility requirements* for access to classified matter.

*Eligibility requirements: To obtain a clearance, an individual must be at least 18 years of age; U.S. citizenship is required except in very limited circumstances. See DOE Order 472.2 for additional information.

New-Employment Drug Test: The Laboratory requires successful applicants to complete a new-employment drug test and maintains a substance abuse policy that includes random drug testing.

Regular position:Term status Laboratory employees applying for regular-status positions are converted to regular status.

Equal Opportunity:

Los Alamos National Laboratory is an equal opportunity employer and supports a diverse and inclusive workforce. All employment practices are based on qualification and merit, without regards to race, color, national origin, ancestry, religion, age, sex, gender identity, sexual orientation or preference, marital status or spousal affiliation, physical or mental disability, medical conditions, pregnancy, status as a protected veteran, genetic information, or citizenship within the limits imposed by federal laws and regulations. The Laboratory is also committed to making our workplace accessible to individuals with disabilities and will provide reasonable accommodations, upon request, for individuals to participate in the application and hiring process. To request such an accommodation, please send an email to applyhelp@lanl.gov or call 1-505-665-4444 option 1.

Where You Will Work

Located in northern New Mexico, Los Alamos National Laboratory (LANL) is a multidisciplinary research institution engaged in strategic science on behalf of National Security. LANL enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns.

Location: Los Alamos, NM, US

Organization Name: HPC-SYS/High Performance Computing Systems

Job Title: Computing Systems Tec 3/4/5

Appointment Type: Regular

Req ID: IRC57704