Fermilab, CMS

Internship ID:

FNAL-CMS-COMPOPSINTERN [#27943]

Internship Title:

CMS Computing Internship

Internship Type:

Other

Location:

Batavia, Illinois 60510-5011, United States of America

Subject Areas:

High Energy Physics / LHC
Scientific Computing

Appl Deadline:

2024/09/16 11:59PM help popup

finished (2024/07/26, finished 2025/03/22, listed until 2025/01/26)

Description:

*** this internship has been closed and new applications are no longer accepted. ***

Job Description

The CMS “Data Management” team is looking for a highly motivated and skilled DevOps Engineer to join our team. In this role, you will play a crucial role in developing, operating and evolving the data management system of the Compact Muon Solenoid (CMS). Come help us manage more than half an Exabyte of tape storage, 250 PB of disk, and about half a million CPU cores, provided by various data centers including High Performance Computers. We support the central operations of CMS. We are looking for one Research Intern employed by UW-Madison but located at Fermilab to operate the transfer system of CMS. On average, weekly about 10 PB is transferred between all CMS sites. The successful candidate would monitor the system, maintain its health and debug transfer problems together with site admins and transfer system developers. The successful candidate would be responsible for ensuring a smooth operation by developing and optimizing tools and monitoring the transfers using the RUCIO data management system. The successful candidate would receive training and constant technical advice and would be working with development teams of other parts of the CMS computing infrastructure.

CMS Program Description

The Compact Muon Solenoid (CMS) is an experiment that investigates a wide range of physics using the Large Hadron Collider (LHC) at CERN in Geneva Switzerland. The CMS collaboration uses a large and distributed computing infrastructure to record, process and analyze CMS data and simulation at petabyte scales. Centrally, we ensure that data processing jobs are run correctly and their output safely stored. CMS Computing Operations manages these central workflows starting with processing freshly recorded data at CERN, to re-processing data and producing Monte Carlo simulations at worldwide distributed computing centers. CMS Computing Operations also maintains and manages large scale data movements between these different centers and collaborates with the admins of the more than 70 computing sites across the world to maintain and sustain functional services and resources used by central workflows and physicists performing analysis of detector data and simulations. Computing systems used at and between the different sites are based on Grid technologies and use sophisticated software to achieve the required high scales of transfer volumes and processing throughput. State of the art programming technologies are used for the different systems. The processing infrastructure is based on distributed state-machines that handle job submission and tracking. The data transfer system is based on a highly performant agent infrastructure both centrally and at the individual computing centers. All systems use the latest database technologies, both local to the systems and distributed, both relational and NoSQL database systems. Central services use a hierarchical http cache infrastructure to provide access to alignment and calibration constants of the detector to all jobs running on the Grid. To check the performance of the sites, a sophisticated test suite is probing the different services of each Grid site regularly and allows for early discovery and mitigation of problems. The uninterrupted service of all sites and central services is the basis for successful delivery of all centrally produced samples to the CMS collaboration and to guarantee analysis access for all physicists to produce high quality physics results. Expert operators are needed to continually monitor all central workflows, transfers and site performance because most of the workflows are time critical and very important for the overall success of the CMS experiment. A good overview is shown in the following animation: http://cds.cern.ch/record/1541893?ln=en

Technical Requirements

Experience with python and bash scripting
Experience with Linux and shells
Experience with software development and code design using git
(Basic) Experience with SQL databases (Oracle knowledge is a plus)
Knowledge of batch systems and mass storage systems is recommended
Knowledge of GRID infrastructure and GRID technologies is recommended
Experience of monitoring tools like Prometheus, Grafana and Opensearch
Testing and troubleshooting distributed systems

Behavioral Requirements

Good Communications Skills
Working knowledge of English, both oral and written
Ability to work independently
Demonstrating flexibility: adapting quickly and resourcefully to shifting priorities and requirements.
Working in teams: working well in groups and readily fitting into a team; participating fully and taking an active role in team activities.
Learning and sharing knowledge: keeping up-to-date with developments in your own field of expertise and readily absorbing new information.
Organizational skills

Details

This is an opening for a term Research Intern hired through University of Wisconsin Madison (UW-Madison) and stationed at the Fermi National Accelerator Laboratory in Batavia, Illinois, USA. The successful candidate is required to have a Bachelor's degree in engineering, computing, mathematics or natural sciences. The stay is limited to 1 year with the possibility for an extension for a second year.

The successful candidate is provided with:

Salary as determined by experience/qualifications
Benefits (e.g. health, dental and vision) are included.
Travel to Fermilab and back to the home institute.
Paid legal holidays as determined by UW-Madison.
The UW-Madison visa office provides assistance to acquire a visa, if needed.
Reasonable accommodations for qualified individuals with disabilities.

The University of Wisconsin is an Equal Opportunity and Affirmative Action Employer. We promote excellence through diversity and encourage all qualified individuals to apply.

Application Materials Required:

Submit the following items online at this website to complete your application:

Cover letter
Curriculum Vitae

And anything else requested in the description.

Further Info:

630-840-2929

Fermilab, MS205
PO Box 500
Batavia IL 60510-5011
USA