Report: 37% of ML leaders say they don’t have the data needed to improve model performance

Hand of robot touching global network connection on customers

Image Credit: ipopba // Getty Images

We are excited to bring Transform 2022 back in-person July 19 and virtually July 20 - 28. Join AI and data leaders for insightful talks and exciting networking opportunities. Register today!

A new report by Scale AI uncovers what’s working and what’s not working with AI implementation, and the best practices for ML teams to move from just testing to real-world deployment. The report explores every stage of the ML lifecycle – from data collection and annotation to model development, deployment, and monitoring – in order to understand where AI innovation is being bottlenecked, where breakdowns occur, and what approaches are helping companies find success.

The report’s goal is to continue to shed light on the realities of what it takes to unlock the full potential of AI for every business and help empower organizations and ML practitioners to clear their current hurdles, learn and implement best practices, and ultimately use AI as a strategic advantage.

For ML practitioners, data quality is one of the most important factors in their success, and according to respondents, it’s also the most difficult challenge to overcome. In this study, more than one-third (37%) of all respondents said they do not have the variety of data they need to improve model performance. Not only do they not have variety of data, but quality is also an issue — only 9% of respondents indicated their training data is free from noise, bias and gaps.

The majority of respondents have problems with their training data. The top three issues are data noise (67%), data bias (47%) and domain gaps (47%).

Most teams, regardless of industry or level of AI advancement, face similar challenges with data quality and variety. Scale’s data suggests that working closely with annotation partners can help ML teams overcome challenges in data curation and annotation quality, accelerating model deployment. ML teams that are not at all engaged with annotation partners are the most likely to take greater than three months to get annotated data.

Event

Transform 2022

Join us at the leading event on applied AI for enterprise business and technology decision makers in-person July 19 and virtually from July 20-28.

This survey was conducted online within the United States by Scale AI from March 31, 2022, to April 12, 2022. More than 1,300 ML practitioners including those from Meta, Amazon, Spotify and more were surveyed for the report.

Read the full report by Scale AI.

VentureBeat's mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Learn more about membership.

Report: 37% of ML leaders say they don't have the data needed to improve model p...

Report: 37% of ML leaders say they don’t have the data needed to improve model performance

Event

Recommend

优秀网络安全预测：近三分之一的国家将在三年内规范勒索软件响应

凝心聚力求突破跨越赶超谋复兴丨莲花健康隆重召开2022上半年营销会议

Car crash detection may soon shed its Google Pixel exclusivity

未来可期！飞力达将分红超2000万回馈投资者，高管增持股份显信心

Apple Expected to Increase iPhone 14 Pro Prices

易马达e换电完成C2轮数亿元融资，全面升级换电服务产品矩阵

Microsoft at Cloud Wars Expo: Redefining what’s possible for Industry

Lawrence Livermore’s “El Capitan” To Take AMD’s Instinct APU Mainstream

Project Arctic Means VMware Doesn’t Get Left Out In the Hybrid Cold

Best Jackpot Slot Games around Today

About Joyk