Pachyderm ❤️ Spark ❤️ MLFlow - Scalable Machine Learning Provenance and Tracking

Published
Author
Enrico Rotundo
Associate Data Scientist

This article shows how you can employ three frameworks to orchestrate a machine learning pipeline composed of an Extract, Transform, and Load step (ETL), and an ML training stage with comprehensive tracking of parameters, results and artifacts such as trained models. Furthermore, it shows how Pachyderm’s lineage integrates with an MLflow’s tracking server to provide artifact provenance.

Read more

Optimising Industrial Processes with Reinforcement Learning

Published
Author
Dr. Phil Winder
CEO

Winder.AI helped CMPC, a large paper milling company, to optimise their production process by using reinforcement learning. CMPC are now able to automate industrial processes that were previously manual. This case study describes our approach and the results.

Read more

Buildpacks - The Ultimate Machine Learning Container

Published
Author
Enrico Rotundo
Associate Data Scientist

Winder.AI worked with Grid.AI (now Lightning AI) to investigate how Buildpacks can minimize the number of base containers required to run a modern platform. A summary of this work includes: Researching Buildpack best practices and adapting to modern machine learning workloads Reduce user burden and reduce maintenance costs by developing Buildpacks ready for production use Reporting and training on how Buildpacks can be leveraged in the future The video below presents this work.

Read more

Save 80% of Your Machine Learning Training Bill on Kubernetes

Published
Author
Dr. Phil Winder
CEO

Winder.AI worked with Grid.AI to stress test managed Kubernetes services with the aim of reducing training time and cost. A summary of this work includes: Stress testing the scaling performance of the big three managed Kubernetes services Reducing the cost of training a 1000-node model by 80% The finding that some cloud vendors are better (cheaper) than others The Problem: How to Minimize the Time and Cost of Training Machine Learning Models Artificial intelligence (AI) workloads are resource hogs.

Read more

Using Reinforcement Learning to Attack Web Application Firewalls

Published
Author
Dr. Phil Winder
CEO

Introduction Ideally, the best way to improve the security of any system is to detect all vulnerabilities and patch them. Unfortunately this is rarely possible due to the extreme complexity of modern systems. One primary threat are payloads arriving from the public internet, with the attacker using them to discover and exploit vulnerabilities. For this reason, web application firewalls (WAF) are introduced to detect suspicious behaviour. These are often rules based and when they detect nefarious activities they significantly reduce the overall damage.

Read more

Helping Modzy Build an ML Platform

Published
Author
Dr. Phil Winder
CEO

Winder.AI collaborated with the Modzy development team and MLOps Consulting to deliver a variety of solutions that make up the Modzy product, a ModelOps and MLOps platform. A summary of this work includes: Developing the Open Model Interface Open-sourcing chassis, the missing link that allows data scientists to build robust ML containers Model monitoring and observability product features MLOps and model management product features The Problem: How to Build An ML Platform Modzy’s goal is to help large organizations orchestrate and manage their machine learning (ML) models.

Read more

How To Build a Robust ML Workflow With Pachyderm and Seldon

Published
Author
Enrico Rotundo
Associate Data Scientist

This article outlines the technical design behind the Pachyderm-Seldon Deploy integration available on GitHub and is intended to highlight the salient features of the demo. For an in depth overview watch the accompanying video on YouTube. Introduction Pachyderm and Seldon run on top of Kubernetes, a scalable orchestration system; here I explain their installation process, then I use an example use case to illustrate how to operate a release, rollback, fix, re-release cycle in a live ML deployment.

Read more

How We Built an MLOps Platform Into Grafana

Published
Author
Dr. Phil Winder
CEO

Winder.AI collaborated with Grafana Labs to help them build a Machine Learning (ML) capability into Grafana Cloud. A summary of this work includes: Product consultancy and positioning - delivering the best product and experience Design and architecture of MLOps backend - highly scalable - capable of running training jobs for thousands of customers Tight integration with Grafana - low integration costs - easy product enablement Grafana’s Need - Machine Learning Consultancy and Development Grafana Cloud is a successful cloud-native monitoring solution developed by Grafana Labs.

Read more

Improving Data Science Strategy at Neste

Published
Author
Dr. Phil Winder
CEO

Winder.AI helped Neste develop their data science strategy to nudge their data scientists to produce more secure, more robust, production ready products. The results of this work were: A unified company-wide data science strategy Simplified product development - “just follow the process” More robust, more secure products Decreased to-market time Our Client Neste is an energy company that focuses on renewables. The efficiency and optimization savings that machine learning, artificial intelligence and data science can provide play a key role in their strategy.

Read more
}