We are looking for candidates with a technical background who will become part of the Operations Department of the Centre.
The funding for these actions/fellowships and contracts comes from the European Union Recovery and Resilience Facility - Next Generation, within the framework of the General Invitation by the public business entity Red.es to participate in the talent attraction and retention programs within Investment 4 of Component 19 of the Recovery, Transformation, and Resilience Plan.
For more information, please check: https://www.bsc.es/join-us/excellence-career-opportunities/ai4s
Key Duties
Installation, maintenance, update and resolution of issues related to IT services of the centre (mail, web, databases, servers, etc.)
Configuration and administration of the different storage subsystems and backup system.
Configuration and administration of the BSC HPC supercomputing resources.
Configuration and administration of BSC cloud platforms (OpenStack, OpenNebula and ovirt ).
Configuration and administration of BSC AI platforms.
Requirements
Education
Degree/Master's degree in Computer Sciences or similar field.
Essential Knowledge and Professional Experience
Knowledge and experience in system administration of HPC Linux platforms (4 years minimum)
knowledge and experience in system administration of distributed file systems like GPFS (IBM Storage Scale) or lustre
Knowledge and experience in system administration of cloud platforms like openstack/opennebula
Additional Knowledge and Professional Experience
Experience with tools like Kubernetes, Docker Swarm, or Apache Mesos for container orchestration and resource management
Experience with GPU clusters, including tools like Nvidia Docker, CUDA, cuDNN, and managing NVIDIA GPUs in a clustered environment
Knowledge of AI/ML frameworks like TensorFlow, PyTornch, Nvidia Megatron. Understanding of its deployment and management in a cluster environment
Familiarity with docker and managing AI/ML containers with docker
Knowledge of object storage systems like Amazon S3, MinIO, or similar technologies
Competences
Initiative, responsibility and good organizational skills
Analytical problem-solving skills
Availability to travel and assist with project events/workshops
Conditions
The position will be located at BSC within the Operations Department
We offer a full-time contract (37.5h/week), a good working environment, a highly stimulating environment with state-of-the-art infrastructure, flexible working hours, extensive training plan, restaurant tickets, private health insurance
Duration: 4 years
Holidays: 23 paid vacation days plus 24th and 31st of December per our collective agreement
Salary: 50.000,00€
Additional Expenses Grant: Each fellowship will be associated with a grant for additional expenses, such as IT equipment, travel, training, stays, etc.
Starting date: asap - the incorporation for this vacancy must be before the 16th of December 2024