On Thursday 13 March, we hosted Harry Peaker, a senior researcher at Smart Data Foundry. Harry talked about both the challenges and best practices of managing coding environments for scientists, highlighting the importance of creating structured, reproducible environments that allow projects to be easily shared and maintained over time. By taking a proactive approach, scientists can ensure their work remains accessible and functional long after initial development.
Without careful setup, dependencies can become really difficult to manage, leading to issues when revisiting old projects or collaborating with other people. However, there are tools such as Docker for containerization or Conda for package management, both of which greatly help in maintaining consistency across different machines. Appropriate documentation and version control are also crucial in supporting long-term reproducibility. A well-documented code, combined with structured workflows and automation, reduces issues when onboarding new team members or revisiting past research.
However, effective management goes even beyond technical strategies, requiring a shift in mindset toward sustainable coding practices. Rather than treating environment management as an afterthought, scientists should integrate it into their workflow from the very start. The long-term perspective ensures that projects remain manageable and reproducible, preventing unnecessary frustration in the future. By prioritising clarity and structure, researchers can create coding environments that support not only their own work but also the broader scientific community.
Watch the video