Why Unstructured Data Is the Hidden Brake on Enterprise AI
Credit: Microsoft Copilot

IN THIS ISSUE: Only 27 percent of business leaders say their data, processes, and applications are well-connected enough to support AI—even as 94 percent agree that connected data is essential to making it work. This week’s column examines why unstructured content has become the biggest obstacle to the agentic enterprise, and why solving it is organizational work, not engineering work.

The Prompt

The promise of enterprise AI has never been louder. Agentic systems. Autonomous workflows. Decisions made at machine speed. But a new survey from Harvard Business Review Analytic Services and content management firm Hyland lands like a cold splash of water on that narrative. The real obstacle to AI adoption isn’t the technology—it’s the data underneath it.

In a survey of 325 business leaders, 94 percent of respondents say that well-connected data, processes, and applications are highly important to successful AI adoption. However, more than a quarter (27 percent) say those elements are currently well connected within their organization. That gap is a sign that enterprises have sprinted toward AI deployment without laying the groundwork to support it.

The culprit: unstructured data.

While 65 percent of respondents say their organization’s structured data—databases, spreadsheets, formatted records—is at least prepared for AI use, only 39 percent say the same about their unstructured data—the emails, PDFs, images, videos, and documents that make up the majority of the enterprise’s knowledge corpus. And yet, 81 percent admit that structured and unstructured data make AI systems more intelligent and trustworthy. They know what needs to happen. They just haven’t done it.

How organizations stand in preparing structured and unstructured data for AI adoption (December 2025). Credit: Harvard Business Review Analytic Services/Hyland
How organizations stand in preparing structured and unstructured data for AI adoption (December 2025). Credit: Harvard Business Review Analytic Services/Hyland

“As organizations move into the next phase of AI, the challenge is no longer just access to models, but whether the business is ready to operationalize AI in a way that is governed, contextual, and trusted,” Jitesh Ghai, Hyland’s chief executive, says in a statement. “For many organizations, unstructured data is both the most overlooked asset and the biggest obstacle to scaling AI effectively.”

It’s becoming an urgent problem that companies need to tackle.

According to Amy Machado, IDC’s senior research manager, many organizations are reviewing their content and data and recognizing that a key reason their AI initiatives fail is that their data isn’t AI-ready. “Many are going back to the basics—having accurate contextual content and data, which requires curation, guardrails, and an understanding of your data and content landscape,” she says in the report.

The gap between structured and unstructured readiness reflects years of organizational neglect. Structured data lives in systems purpose-built for querying. Unstructured data—the contracts, customer emails, scanned forms, meeting recordings—can be anywhere. It’s the institutional knowledge AI needs most, and it is the hardest to wrangle. A growing market of infrastructure startups, including Bem and Unstructured.io, has emerged specifically to tackle this pipeline problem, converting messy enterprise content into formats AI systems can actually consume. That a dedicated market now exists for this work is itself a sign of how widespread this problem has become.

The pressure to solve it is only going to intensify. It’s critical for organizations using agentic AI to ensure information flows seamlessly across systems, workflows are orchestrated, and governance is in place. Data can’t be kept in siloes. Yet despite software vendors’ hype about embracing the agentic enterprise and becoming frontier firms, nearly half of respondents say they’re still exploring or piloting agentic AI, with a mere 17 percent reporting that they’ve implemented it. This shows how far most organizations still are from turning the ambition of agentic AI into operational reality.

HBR advises that organizations take a “coordinated approach that strengthens the foundations of AI,” meaning preparing data and content for AI, integrating intelligence into business processes, and aligning leadership, culture, and skills around new ways of working.

Truth be told, more than a third of respondents (39 percent) say their organizations still favor separate, standalone tools instead of having AI embedded throughout their workflows. Only 12 percent report the contrary. When it’s viewed as an “add-on,” disconnected from the data and content it needs, it’s unsurprising to see that AI would underperform.

The report frames this as an operating model challenge as much as a technical one. Bridging the gap requires business leaders and IT to align on a shared data strategy that prioritizes common definitions, governance, and reuse across workflows. That is organizational work, not engineering work—and it’s what stalls in large enterprises because it does not belong to any single team.

Some executives have decided to take action to address this problem. Half of all respondents say their firms intend to prioritize data quality improvements over the next 12 months. Nearly half (48 percent) say enhancing data talent and skills is also a priority, followed by defining a clear data strategy and integrating data sources (45 percent each) and enhancing data management and governance (43 percent).

“Once you have AI-ready data, it opens up a whole new world of possibilities for the future,” Machado says. “It allows you to reimagine your processes. It allows you to optimize them. It allows you to create new products and new ways of working. But if you don’t have that data foundation, all that is out of reach.”

The bottom line: enterprises have invested significant time and resources in AI capabilities but have underinvested in AI readiness. For organizations eyeing agentic AI as the next frontier, the foundation problem can no longer be a side project. It’s now the whole project.


Don’t Miss out on ‘The AI Economy’


Today’s Visual Snapshot

The amount of venture capital invested in voice AI startups (excluding OpenAI and Anthropic) reached record highs in Q1 2026. Credit: Newcomer.co
The amount of venture capital invested in voice AI startups (excluding OpenAI and Anthropic) reached record highs in Q1 2026. Credit: Newcomer.co

This week’s chart comes from Eric Newcomer, founder of the newsletter that bears his name. Tracking voice AI investment since January 2022, it shows more than $7 billion flowing into the category in January 2026—the largest quarter on record. The figure is more notable considering that Q1 2026 doesn’t include funding for OpenAI or Anthropic. Some of the startups that raised large funding rounds include ElevenLabs ($180 million), Synthesia ($180 million), Runway ($315 million), LiveKit ($100 million), Deepgram ($130 million), and Decagon ($250 million)

Investors are betting that voice will become a primary way humans interact with AI, not just a feature within a chatbot like ChatGPT or Siri, but a standalone layer. In fact, according to Newcomer, the workplace is one area where voice recognition technology is gaining traction. Companies are starting to adopt AI to help them with contact centers and customer support, conduct outbound sales and lead qualification, handle collections and account management, power voice-driven copilots, support field service and frontline workers with hands-free tasks, and more. And if you thought $7 billion was big, then you’ll be surprised to learn that the global market is expected to grow to $22 billion this year, according to Mordor Intelligence. And that’s only the start, as it’s predicted to nearly triple in size over the next five years, reaching nearly $62 billion.


Quote This

“I think the limiting factor for agentic commerce is trust. I think when we all think about ourselves as buyers, and we all think about ourselves having agents go out and transact on our behalf, we are going to fall back on payment methods that we, as users, trust.”

— Visa CEO Ryan McInerney, on the company’s Q2 2026 earnings call, arguing that even once AI agents can reliably execute purchases on their own, adoption will hinge on whether consumers are willing to delegate financial authority to them.


This Week’s AI News

Industry Trends

Generative AI and Applications

Enterprise and Workforce

Retail and Commerce

Business, Marketing, and Funding

Hardware, Infrastructure, and Robotics

Science and Breakthroughs

Media and Entertainment

Policy, Safety, and Misinformation

Opinions and Analysis


End Output

💡 🤝 Did you like what you read in “The AI Economy?” Share it with your coworkers, friends, and family! Connect with me on LinkedIn and check out my blog for more insights and thoughts on business and technology.

🕵️‍♂️ 📨 Got a great idea for a story or have a juicy scoop? Feel free to shoot me a message here, or reach me securely on Signal at kenyeung.888.

✍️ 🚀 I also help companies translate innovation into storytelling that resonates. If your team needs a seasoned journalist to craft your next big narrative, I’m available for writing and content strategy projects.

🙏 📬 Thanks for reading, and be sure to subscribe to receive future editions. Stay curious!

Featured Image: Credit: Microsoft Copilot