Position Classification Description

Position Class Code / Title: E3017 / HPC Systems Splst 3
Recruitment Tier: Tier 1
FLSA: Exempt
Grade: 15

This is a description of a Staff Position Classification. It is not an announcement of a position opening. To view descriptions of current openings, please go to UNMJobs and Search Postings to view positions that are currently accepting applications.

The following statements are intended to describe, in broad terms, the general functions and responsibility levels characteristic of positions assigned to this classification. They should not be viewed as an exhaustive list of the specific duties and prerequisites applicable to individual positions that have been so classified.

Summary

Provides senior-level systems programming and systems management functions for large-scale, high performance computing systems. Responsibilities include the administration, integration and maintenance of parallel high performance computing systems as well as integrating other systems and peripherals, including advanced filesystems, enterprise storage systems, visualization environments, and networks. Independently accomplishes complex systems integration, deployment, and administration projects, advanced system performance analyses, problem resolution, and system security initiatives. Develops system management strategies, architectural assessments, and system tools and other software for the administration of production systems. Maintains current knowledge of state-of-the-art supercomputing architectures in order to provide guidance and consultation on future system procurements. Provides technical oversight and consultation for faculty and student researchers and technical staff users on the use of the high-performance computing platforms. Works closely with other staff and departmental entities to provide a comprehensive support infrastructure for a wide range of academic, commercial, and government users.

Duties and Responsibilities

  1. Provides advanced systems support for academic, supercomputing center to include the installation, integration and management of high performance computer systems, operating systems, peripherals, and system interfaces; monitors system usage; and assures that the high performance computing complex is operating at optimal performance and reliability levels; additional duties include consulting, training and the development and maintenance of systems documentation.
  2. Assumes a lead role in managing the hardware and systems software infrastructure and provides an effective, reliable, high performance, scalable computing environment.
  3. Oversees the configuration and tuning of batch queuing systems in a massively parallel production environment; collects and maintains parallel system utilization statistics; identifies and resolves computer system anomalies and operational problems; and provides systems support for electronic mail, name resolution, and file sharing services.
  4. Supervises junior-level systems personnel, to include project assignments, reviewing procedures and results, and providing systems level training; maintains effective problem resolution procedures.
  5. Periodically serves as lead analyst in a departmental subgroup when appropriate, integrating the work of staff members, each responsible for multiple projects.
  6. Maintains a comprehensive knowledge of state-of-the-art computing systems and peripherals; computer operating systems; and scalable, parallel architectures.
  7. Works with users and other computational professionals in evaluating user requirements, and in the configuration and deployment of computational resources.
  8. Works closely with computer hardware and software vendors to maintain a comprehensive understanding of industry trends and evolving technology.
  9. Provides consulting and technical support for marketing and outreach activities.
  10. Performs miscellaneous job-related duties as assigned.

Minimum Job Requirements

  • Bachelor's degree in a related Technical, Scientific, or Engineering discipline; at least 5 years of experience directly related to the duties and responsibilities specified.
  • Completed degree(s) from an accredited institution that are above the minimum education requirement may be substituted for experience on a year for year basis.

Knowledge, Skills and Abilities Required

  • Knowledge of high performance computing systems; scalable, parallel architectures; and basic aspects of the UNIX operating system.
  • Knowledge of advanced data storage technologies and high-speed network interfaces.
  • Ability to make complex technical design decisions involving software or hardware implementation strategies.
  • Ability to monitor system usage and performance statistics and to understand the impacts of operating system tuning parameters.
  • Working knowledge of one or more high-level programming languages such as C, C++, or Fortran.
  • Expert knowledge of one or more scripting languages such as csh, Bash, perl, Python, etc.
  • Skill in the installation and configuration of operating systems and application software.
  • A comprehensive knowledge of Linux operating system internals and multiple high performance computing architectures.
  • Knowledge of advanced problem resolution procedures, testing and evaluation methods, programming tools, and system network security.
  • Ability to assist technical management and Director in gathering user requirements and planning and designing computer systems.
  • Ability to define proper methods and procedures for the integration, testing, and installation of system modifications.
  • Ability to work under pressure and meet firm deadlines when evaluating technical requirements and providing recommendations and solutions.
  • Ability to analyze complex problems, interpret operational needs, and develop integrated, creative solutions.
  • Effective verbal and written communication skills.

Distinguishing Characteristics

    Position requires: a) Using independent judgment on tasks and problem solving of a complex nature; b) serving as project leader on sizeable projects; c) deploying a large parallel/configuration and system software components; d) demonstrated depth of understanding of specialty area; e) developing new ideas and guiding the organization in technology; f) serving as mentor to other high-performance computing systems engineers and technical staff.

Working Conditions and Physical Effort

  • Light physical effort. Requires handling of average-weight objects up to 10 pounds or some standing or walking. On average, effort applies to no more than two (2) hours per day.
  • No or very limited exposure to physical risk.
  • Work is normally performed in a typical interior/office work environment.

The University of New Mexico provides all training required by OSHA to ensure employee safety.

Revised Date: 03/20/2017