Fermilab, CMS
Internship ID:
Internship Title:
CMS Computing Internship
Internship Type:
Other
Location:
Batavia, Illinois 60510-5011, United States of America
Subject Areas:
Appl Deadline:
2024/09/16 11:59PM (posted 2024/07/26, listed until 2025/01/26)
Description:
Description
Job DescriptionThe CMS “Data Management” team is looking for a highly motivated and skilled DevOps Engineer to join our team. In this role, you will play a crucial role in developing, operating and evolving the data management system of the Compact Muon Solenoid (CMS).
Come help us manage more than half an Exabyte of tape storage, 250 PB of disk, and about half a million CPU cores, provided by various data centers including High Performance Computers. We support the central operations of CMS. We are looking for one Research Intern employed by UW-Madison but located at Fermilab to operate the transfer system of CMS. On average, weekly about 10 PB is transferred between all CMS sites. The successful candidate would monitor the system, maintain its health and debug transfer problems together with site admins and transfer system developers. The successful candidate would be responsible for ensuring a smooth operation by developing and optimizing tools and monitoring the transfers using the RUCIO data management system. The successful candidate would receive training and constant technical advice and would be working with development teams of other parts of the CMS computing infrastructure.
The Compact Muon Solenoid (CMS) is an experiment that investigates a wide range of physics using the Large Hadron Collider (LHC) at CERN in Geneva Switzerland. The CMS collaboration uses a large and distributed computing infrastructure to record, process and analyze CMS data and simulation at petabyte scales. Centrally, we ensure that data processing jobs are run correctly and their output safely stored. CMS Computing Operations manages these central workflows starting with processing freshly recorded data at CERN, to re-processing data and producing Monte Carlo simulations at worldwide distributed computing centers. CMS Computing Operations also maintains and manages large scale data movements between these different centers and collaborates with the admins of the more than 70 computing sites across the world to maintain and sustain functional services and resources used by central workflows and physicists performing analysis of detector data and simulations.
Computing systems used at and between the different sites are based on Grid technologies and use sophisticated software to achieve the required high scales of transfer volumes and processing throughput. State of the art programming technologies are used for the different systems. The processing infrastructure is based on distributed state-machines that handle job submission and tracking. The data transfer system is based on a highly performant agent infrastructure both centrally and at the individual computing centers. All systems use the latest database technologies, both local to the systems and distributed, both relational and NoSQL database systems. Central services use a hierarchical http cache infrastructure to provide access to alignment and calibration constants of the detector to all jobs running on the Grid. To check the performance of the sites, a sophisticated test suite is probing the different services of each Grid site regularly and allows for early discovery and mitigation of problems. The uninterrupted service of all sites and central services is the basis for successful delivery of all centrally produced samples to the CMS collaboration and to guarantee analysis access for all physicists to produce high quality physics results. Expert operators are needed to continually monitor all central workflows, transfers and site performance because most of the workflows are time critical and very important for the overall success of the CMS experiment. A good overview is shown in the following animation: http://cds.cern.ch/record/1541893?ln=en
- Experience with python and bash scripting
- Experience with Linux and shells
- Experience with software development and code design using git
- (Basic) Experience with SQL databases (Oracle knowledge is a plus)
- Knowledge of batch systems and mass storage systems is recommended
- Knowledge of GRID infrastructure and GRID technologies is recommended
- Experience of monitoring tools like Prometheus, Grafana and Opensearch
- Testing and troubleshooting distributed systems
Behavioral Requirements
- Good Communications Skills
- Working knowledge of English, both oral and written
- Ability to work independently
- Demonstrating flexibility: adapting quickly and resourcefully to shifting priorities and requirements.
- Working in teams: working well in groups and readily fitting into a team; participating fully and taking an active role in team activities.
- Learning and sharing knowledge: keeping up-to-date with developments in your own field of expertise and readily absorbing new information.
- Organizational skills
Details
This is an opening for a term Research Intern hired through University of Wisconsin Madison (UW-Madison) and stationed at the Fermi National Accelerator Laboratory in Batavia, Illinois, USA. The successful candidate is required to have a Bachelor's degree in engineering, computing, mathematics or natural sciences. The stay is limited to 1 year with the possibility for an extension for a second year.
The successful candidate is provided with:
- Salary as determined by experience/qualifications
- Benefits (e.g. health, dental and vision) are included.
- Travel to Fermilab and back to the home institute.
- Paid legal holidays as determined by UW-Madison.
- The UW-Madison visa office provides assistance to acquire a visa, if needed.
- Reasonable accommodations for qualified individuals with disabilities.
The University of Wisconsin is an Equal Opportunity and Affirmative Action Employer. We promote excellence through diversity and encourage all qualified individuals to apply.
Application Materials Required:
Submit the following items online at this website to complete your application:
- Cover letter
- Curriculum Vitae
And anything else requested in the description.
Further Info:
630-840-2929
Fermilab, MS205
PO Box 500
Batavia IL 60510-5011
USA
PO Box 500
Batavia IL 60510-5011
USA