Senior Site Reliability Engineer
Job Description
About us
We are Digital Science and we are advancing the research ecosystem.
We are a pioneering technology company, and our vision is of a future where a trusted and collaborative research ecosystem drives progress for all. We believe in better, open, collaborative and inclusive research. In creating the next generation of tools and working in partnership with the community we tackle some of the biggest challenges to research. In order to achieve our vision, we need innovative, inspiring and dynamic people to join our team. Want to join us?
Your new role
As our Senior Site Reliability Engineer, you will be part of our new AI Solutions Development team, within Central Technology.
The AI Solutions Development team supports the DS AI strategy through working closely with the AI Innovations and Product teams to develop AI functionalities centrally that could be plugged into the different Digital Science products.
As our Senior Site Reliability Engineer you will be focussed on supporting our Lead Software Architect to define our requirements and build and maintain our infrastructure, ensuring scalability, performance and security within a greenfield project.
What you’ll be doing
- Joining a new team within a new area of Digital Science, working on a greenfield project (AWS)
- Define infrastructure requirements and automation strategies.
- Build and maintain infrastructure using IaC principles.
- Set up and manage containerized environments.
- Establish and maintain CI/CD pipelines for automated deployments.
- Define, implement and configure monitoring and alerting systems for proactive issue resolution.
- Lead incident response and post-incident reviews.
- Collaborate with the Lead Software Architect on infrastructure design.
- Ensure system scalability, performance, and security.
What you’ll bring to the role
Essential
- Proficiency in scripting and automation (e.g., Bash, Python, Ansible).
- Cloud platform experience (AWS, GCP preferred).
- Infrastructure as Code (IaC) using Terraform.
- Containerization and orchestration (e.g., Docker, Kubernetes).
- Continuous integration and deployment (CI/CD) pipelines.
- Monitoring and logging tools (e.g., Prometheus, ELK stack).
- Incident management and response.
- Networking and security fundamentals
- An openness and willingness for continued learning and development, to build your AI and technical skills and capabilities
- Experience of working on a greenfield project
- The ability to work independently, create tasks and discuss these confidently with the team
Desired
We don’t need you to be an expert in AI but if you have experience in the following, it would be advantageous;
- An understanding of the different kinds of language models and their applications and how to communicate and interact with LLMs through APIs of major libraries (Hugging Face, PyTorch, …)
- A conceptual understanding of the complexities in bringing ML into a product (processing requests, cost estimates, model limitations)
- Some experience in Natural Language Processing and / or Machine Learning
- Background in Python
- Knowledge of the required libraries (Hugging Face Transformers, PyTorch)
- Knowledge in Vertex AI or Sagemaker
If you don’t have experience in AI, but have an openness to learn the required skills and build your AI capabilities, we would still like to hear from you.
Our vision and values
We invest in, nurture and support innovative businesses and technologies that make all parts of the research process more open, efficient and effective.
The talent we secure is fundamental to us achieving our vision and our growth plans. The values we live by are:
We are brave in the pursuit of better
We are collaborative and inclusive
We are always open-minded
We are from and for the community
We're an equal opportunity employer. All applicants will be considered for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status
About Digital Science
Digital Science is a technology company working to make research more efficient.
We invest in, nurture and support innovative businesses and technologies that make all parts of the research process more open and effective.
Our portfolio includes admired brands including Altmetric, Dimensions, Figshare, ReadCube, Symplectic, IFI Claims, Writefull, and Overleaf.
We believe that together, we can help researchers make a difference.