Posted on: 
March 9, 2024

Senior Site Reliability Engineer

Job Description

About us

We are Digital Science and we are advancing the research ecosystem.

We are a pioneering technology company, and our vision is of a future where a trusted and collaborative research ecosystem drives progress for all. We believe in better, open, collaborative and inclusive research. In creating the next generation of tools and working in partnership with the community we tackle some of the biggest challenges to research. In order to achieve our vision, we need innovative, inspiring and dynamic people to join our team. Want to join us?

Your new role

As our Senior Site Reliability Engineer, you will be part of our new AI Solutions Development team, within Central Technology.

The AI Solutions Development team supports the DS AI strategy through working closely with the AI Innovations and Product teams to develop AI functionalities centrally that could be plugged into the different Digital Science products.

As our Senior Site Reliability Engineer you will be focussed on supporting our Lead Software Architect to define our requirements and build and maintain our infrastructure, ensuring scalability, performance and security within a greenfield project.

What you’ll be doing

  • Joining a new team within a new area of Digital Science, working on a greenfield project (AWS)
  • Define infrastructure requirements and automation strategies.
  • Build and maintain infrastructure using IaC principles.
  • Set up and manage containerized environments.
  • Establish and maintain CI/CD pipelines for automated deployments.
  • Define, implement and configure monitoring and alerting systems for proactive issue resolution.
  • Lead incident response and post-incident reviews.
  • Collaborate with the Lead Software Architect on infrastructure design.
  • Ensure system scalability, performance, and security.

What you’ll bring to the role

Essential

  • Proficiency in scripting and automation (e.g., Bash, Python, Ansible).
  • Cloud platform experience (AWS, GCP preferred).
  • Infrastructure as Code (IaC) using Terraform.
  • Containerization and orchestration (e.g., Docker, Kubernetes).
  • Continuous integration and deployment (CI/CD) pipelines.
  • Monitoring and logging tools (e.g., Prometheus, ELK stack).
  • Incident management and response.
  • Networking and security fundamentals
  • An openness and willingness for continued learning and development, to build your AI and technical skills and capabilities
  • Experience of working on a greenfield project
  • The ability to work independently, create tasks and discuss these confidently with the team

Desired

We don’t need you to be an expert in AI but if you have experience in the following, it would be advantageous;

  • An understanding of the different kinds of language models and their applications and how to communicate and interact with LLMs through APIs of major libraries (Hugging Face, PyTorch, …)
  • A conceptual understanding of the complexities in bringing ML into a product (processing requests, cost estimates, model limitations)
  • Some experience in Natural Language Processing and / or Machine Learning
  • Background in Python
  • Knowledge of the required libraries (Hugging Face Transformers, PyTorch)
  • Knowledge in Vertex AI or Sagemaker

If you don’t have experience in AI, but have an openness to learn the required skills and build your AI capabilities, we would still like to hear from you.

Our vision and values

We invest in, nurture and support innovative businesses and technologies that make all parts of the research process more open, efficient and effective.

The talent we secure is fundamental to us achieving our vision and our growth plans. The values we live by are:

We are brave in the pursuit of better

We are collaborative and inclusive

We are always open-minded

We are from and for the community

We're an equal opportunity employer. All applicants will be considered for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status

About Digital Science

Digital Science is a technology company working to make research more efficient.

We invest in, nurture and support innovative businesses and technologies that make all parts of the research process more open and effective.

Our portfolio includes admired brands including Altmetric, Dimensions, Figshare, ReadCube, Symplectic, IFI Claims, Writefull, and Overleaf.

We believe that together, we can help researchers make a difference.

Apply now

More job openings