Announcing the CD Foundation MLOps SIG

By Animesh Singh (IBM), CD Foundation MLOps SIG Chair

Machine Learning (ML) is quickly becoming a key technology in a wide range of industries and organizations. One key thing happening in ML landscape is that more and more models are getting produced, but are they actually getting deployed?

The story of enterprise Machine Learning: “It took me 3 weeks to develop the model. It’s been >11 months, and it’s still not deployed.” @DineshNirmalIBM #StrataData #strataconf
— ginablaber (@ginablaber) March 7, 2018

Looking at the number of steps needed to be performed in Data and AI lifecycle, currently in a lot of cases the process remains bifurcated amongst various teams, and at every single step there are manual handoffs. Besides just slowing down the production and deployment of models, this also poses a strong challenge for traceability, governance and risk management.

This signifies utilizing Continuous Delivery (CI/CD) in ML may increase the speed of deployments, but does it increase the quality? How do you know if you’re deploying models which are giving ethical, fair and unbiased predictions? Are the actions performed by the ML code verifiable? Can the code be maintained and tested rigorously? Are there clear guidelines for lineage tracking, metadata collection, experiment tracking, data versioning, ETL operations, and more?

MLOps Sig

To address these challenges, the MLOps SIG managed under the CD Foundation has been formed with the following goals

MLOps Definition and Roadmap: Create vision and roadmap for MLOps, what it means, and how do we envision its role within the CI/CD ecosystem. Terry Cox, a co-leader in SIG has done an excellent job of compiling a draft vision and roadmap for this
Reference Architecture and Design Patterns: Create reference architecture, design patterns and implementations and processes for MLOps. A use case is integration between MLOps Pipelines and Tekton e.g. Kubeflow Pipelines working with Tekton as a backend. The SIG has already delivered significant results and has kicked off a project in to start enabling some of these capabilities on top of Tekton. Some of the artifacts which have been produced are listed here:

3. AI Governance and Risk Management: Define architecture and guidelines around lineage tracking, metadata collection, experiment tracking, data versioning, ETL operations, etc. which a typical Data and ML Pipeline shall support to enable Ethical AI

Communication

MLOps SIG communication happens via a public mailing list: https://lists.cd.foundation/g/sig-mlops

You can join in the discussion on Slack with us to collaborate.

Meetings

The MLOps SIG meets every other week at 9:30 AM Pacific, on Thursdays

Meeting agendas, minutes, and documentation are here: https://github.com/cdfoundation/sig-mlops

Download this invitation to add the meeting to your calendar: https://zoom.us/meeting/u5Iqduutpj8o7fVIT1pePLk5wv4H9XpojQ/ics

Full details on agenda, members, meeting times, mailing list sign up, and more are available here: https://lists.cd.foundation/g/sig-mlops

All are welcome to join the mailing list and attend meetings. Please add your voice to this important new area in CI/CD development.