How do we implement machine learning ops within our data tech stack?

ao link

Members

Contact

New to DataIQ?

Take our FREE data literacy indicator now

Unlock the power of data - take our FREE data literacy indicator now

In November, during the 2023 DataIQ Conference, Simon Case, head of data, Equal Experts, hosted a roundtable for data leaders to examine the implementation of machine learning operations within data tech stacks.

“While the excitement around gen AI grows, ‘classic’ AI machine learning (ML) is still creating great value for businesses and the DataIQ community still see it as a critical tool for extracting value from data in their organisations,” said Simon Case, head of data, Equal Experts. “We had full tables eager to discuss their challenges in implementing ML. There were great discussions from representatives of organisations at different stages of the ML journey, from someone starting their first ML team to mature ML developers deploying many models into production.”

Avoiding lock-in

When it comes to technology and new tools, it is easy to be swept up in the excitement and hype cycle of new developments, but there must be considerations about which platform(s) should be selected. For example, some tools are complex and require too many skills to be used effectively across an organisation.

“A challenge with selecting a commercial platform is that there is a risk of getting locked-in to vendors,” explained Simon. “Often, there are no easy ways to transfer models across platforms and the needs of the business will invariably evolve which risks outgrowing or moving away from the specialisms of the selected tool. Participants of the roundtable noted that the use of auto ML capabilities meant the lock-in risk was even higher.”

A few roundtable participants explained to the group that they had found tooling that worked for them, but they noted that the real challenges were the gaps between proof-of-concept and something fit for production. Businesses at different stages of their data maturity journey struggled with different aspects of the production lifecycle and this was partly down to these identifiable gaps that are left by different data platforms and tools. Most of the table agreed that data platforms are far more adept at the proof-of-concept aspect compared to the production portion.

Sourcing the data needed

Part of the issue implementing ML ops into a tech stack is being able to find and utilise the data sets required for success. This was a common hurdle faced by members of the roundtable as this task would usually be required at the start of a project when there was minimal capacity and an eagerness to prove proof of concept to decision makers.

There were calls from the roundtable to create data catalogues and dictionaries for internal use, as well as to improve the storytelling and data literacy capabilities of the team. Some felt they knew already where their data was located, with one business representative explaining how their organisation had been actively migrating from an on-premises system to a cloud platform. This transformation had taken over two years and was difficult, but because the data team were so involved in the process they know exactly where the data sets they require are and the lineage of the data.

A smaller, but by no mean insignificant hurdle faced by the group was that of team member churn. When an established member of the team left, there would often be a large knowledge gap in their absence that made tasks such as collating the right data slower. An issue found with higher churn is that the response times to problems are slower and the ability to fix the problems before they scale is reduced, and this is arguably something that ML will not be able to address without the human skills behind it.

Log in to read the entire article

Gain access to the entire article by logging in or registering for a free account here.

Did you find this content useful?

Thank you for your input

Thank you for your feedback

Next read

Key data leader challenges in 2024: Part one – Foundations

DataIQ’s Research Analyst, Rachael Pimblett, shares the findings on what data leaders feel will be their main challenges in the next year, presented in the first of a four-part article series.

Next read

Key data leader challenges in 2024: Part one – Foundations

30 Apr 2024by Rachael Pimblett

DataIQ’s Research Analyst, Rachael Pimblett, shares the findings on what data leaders feel will be their main challenges in the next year, presented in the first of a four-part article series.

A case of the AI biter bit?

23 Apr 2024by David Reed

DataIQ’s Chief Knowledge Officer and Evangelist, David Reed, examines the hype cycle around generative AI and the actual speed of transformation being seen.

Pioneering AI initiatives revealed: DataIQ Announces 2024 AI Awards Shortlist

15 Apr 2024by Alex Roberts

The shortlist for the 2024 DataIQ AI Awards has been unveiled, with the winners to be announced at the DataIQ Summit on May 21.

You may also be interested in

Data Literacy versus Data Culture – DataIQ’s view

DataIQ is a trading name of IQ Data Group Limited
10 York Road, London, SE1 7ND

We use cookies so we can provide you with the best online experience. By continuing to browse this site you are agreeing to our use of cookies. Click on the banner to find out more.

Cookie Settings