Eventil - Find Tech Events
Official mobile app
FREE - In Google Play
Official mobile app
FREE - In App Store

Software Engineering for Data Scientists

Feb 20 - 21, 2020 · London, United Kingdom

This is a 2 day on-site course held in Central London near to London Bridge Station in a small group of circa 10 people. We'll focus on developing a standard process from R&D through to production backed by code reviews, documentation, refactoring, unit tests and a Notebook based git process. This course is based on hard-won experience by Ian from client engagements aimed at getting you more reliably over the line with successful working code.

The photo above is from a sister course - this gives you an idea of the room layout and what students look like after a full day's learning with me.

"Ian's expertise and experience were essential to the development of our fledgling data science team. He played a fundamental role in defining our ways of working and mentoring junior members, as well as leading multiple projects with internal clients. I can't recommend working with him highly enough!" - Cyprian Rangel, Data Science Product Manager at QBE Insurance

This course is aimed at any Pythonic data scientist who:

  • Wants more confidence that their code runs correctly during deployment to avoid downtime and friction with business colleagues
  • Needs to learn more about testing and debugging
  • Wants ideas on routes to deployment and solutions to post-deployment changing data
  • Wants to collaborate with confidence with other technical team members to raise the team's long-term velocity

    During the course we'll:

    • Develop defensive coding practices in our Notebook which double up as documentation
    • Refactor Notebook code into modules for reuse and increased trust
    • Add unit-tests to test our modules for trust and integration with a Continuous Integration pipeline
    • Review a git process that uses nbdime for collaboration on Jupyter Notebooks
    • Practice code reviews backed by a documented process you can take back to your team
    • Review a standard research-to-deployment process using cookiecutter
    • Discuss how to sell these new techniques to team members and senior staff to get critical buy-in to see change occur after the course
    • Look at how "traditional" software engineering and "data science engineering" differ to highlight process differences that your software engineering colleagues probably haven't seen
    • Write useful documentation in our code to improve future support

    "One of the highlights from Ian’s Successfully Delivering Data Science Projects course [note - this is the sister course to this one] was being introduced to the concept of a specialised project specification document. This provides a systematic framework to directly tackle numerous problems I have experienced when trying to move a project beyond an initial prototyping stage. I have now applied my own tailored specification document at my organisation and it immediately surfaced critical questions and issues that otherwise would not have been realised for months." - Thomas Brown, Data Scientist at aire.io

    "Ian coached our team when we needed some extra technical firepower, and provided that in spades. He slipped into a role providing technical leadership to a new bunch of people, and energised every project to which he contributed. He also straightened our path towards best practice, with a combination of good sense and business experience, for which generations of my team will be grateful" - Alice Jacques, Senior Data Scientist at Channel 4

    After the course you'll:

    • Have a working cookiecutter layout to demonstrate all of the processes to your team
    • Take home a practical guide for code reviews to significantly improve your team's code quality and overall velocity
    • Have gained answers to the questions you arrived with, so your personal blockers will be resolved
    • Have a plan for new tools and processes to introduce at work to make your team more efficient
    • Have access to our Slack channel to continue the conversation with class mates and to download any shared material, you'll be able to see conversations from previous courses and you'll be able to collaborate with past and current students
    • Receive a Certificate of Professional Development

    You have some prior experience with Python and data science tools like Jupyter Notebooks and Pandas. You might be a data scientist of any level or a hands-on data science team lead. You might be a junior software engineer in a data science team who needs to understand data science processes.

    This course is not aimed at first-time Python users who want to learn about software engineering and data science. This is aimed at practitioners who have worked on a data science project and who want to get better results, faster and with more confidence. This course is not aimed at non-technical project managers, it is very hands-on.

    You'll need your laptop with Anaconda installed with Python 3.7+. I'll check-in with you before the course to make sure you have everything you need installed. You can also install before the class by arriving earlier.

    If you have questions please contact Ian@MorConsulting.com. Ian's blog has a long list of past public talks and videos. Ian also has a training email announce list if you'd like advance notice of future events. Read many glowing testimonials about Ian Ozsvald's work with past clients on LinkedIn.


    Prospero House is a dedicated training facility 5 minutes walk south of London Bridge station, opposite Borough tube station (on the Northern Line). It is easily reached from Kings Cross, Paddington and Victoria. The venue has tea and coffee and snacks all day, light continental breakfast for early arrivals, natural daylight and air conditioning.

    Lunch is included in the price, options include vegetarian/meat/fish and other diets may be accommodated (please contact me if you have dietary requirements). I've visited the venue and eaten their lunch - this location is ideal for a great day's training.

    Refund and change policy:

    Please be sensible when booking any accommodation or travel and get refundable tickets. In the unlikely event that we have to cancel this event we are not responsible for any expenses you've incurred. Make sure your hotel or travel is refundable, just in case.

    Eventbrite offers you a 100% refunded cancellation up to 7 days before the event (I'd happily talk to you about why you need to cancel to see if we can figure something out). I can move a ticket to another colleague, just get in touch. In the extremely unlikely event that we have to cancel the course then you will be offered either a 100% refund or the option of moving attendance to another workshop. Mor Consulting Ltd and Ian Ozsvald are not liable for any other costs incurred including travel and accommodation.

    Event organizers
    • Ian Ozsvald

      Ian is a Chief Data Scientist and Coach, he co-organises the annual PyDataLondon conference with 700+ attendees and the associated 9,000+ member monthly meetup. He runs the established Mor Consulting Data Science consultancy as Chief Data Scientist in London, gives conference talks internationally often as keynote speaker and is the author of the bestselling O'Reilly book High Performance Python (second edition in preparation). He has 16 years of experience as a senior technical leader, data scientist and

      Recent Events

    Are you organizing Software Engineering for Data Scientists?

    Claim the event and start manage its content.

    I am the organizer

    based on 0 reviews

    Featured Events