From Messy Documents to Governed Knowledge: What Our Hackathon Revealed About AI Agents

Kamiel Temmerman

27 May 26

5min read

AI & Data Science

Most of an organization's knowledge lives outside structured systems: slide decks, meeting notes, contracts, feedback forms, and old project folders. This knowledge was the focus of a recent Datashift hackathon. We brought consultants together with a practical assignment: take content from previous editions of our annual bootcamp and turn it into structured, searchable, governed knowledge to help shape the next edition.

On paper, the task sounded simple. Feed the material to AI agents, let them scan and classify it, use the output to design the next curriculum faster. In practice, the exercise exposed a harder question: how do you make AI useful when the information it needs is messy and fragmented?

The capability was there, but the quality was not

The agents could store the material and find documents on a given topic. The harder challenge was classifying that material meaningfully and generating output actually useful for curriculum design. Without the right context, an agent may answer a question using an outdated document, reuse a slide that no longer reflects the company's position, or combine correct fragments into a response that sounds convincing but does not match how the organization works. The work does not disappear. It moves into review, correction, and risk management.

The missing piece: a semantic layer

Access to documents is not enough. Agents need a semantic layer that tells them how to interpret information, how to store it, and how to use it. That means defining the concepts that matter: learning objectives, trainers, target audiences, curriculum versions, and the rules that govern them. Which content is approved? Which is outdated, sensitive, or still under review?

This layer is used twice: when agents process the archive, and when they reason over that structured knowledge to generate new output. Documents stop being isolated files. They become part of an active knowledge base, connected to business context and governed by clear rules.

What we learned

Models and prompts are not enough. Agents need structure when they store information and that same structure when they reason over it later. Organizations do not need systems that generate fast answers from messy data. They need systems that turn scattered content into governed knowledge and use that knowledge to support better decisions and reliable output.

Subscribe to our newsletter

Read, learn, adapt, grow.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Blog

Keep reading

Eager to learn more? No worries: we’ve got you covered.

View all

Let’s make your data move

Unlock the full power of your data with Datashift’s end-to-end Data & AI expertise. Contact us and we’ll show you how your data can fuel smarter decisions, reduce costs, and grow your business.

Get in touch

Join our newsletter

Subscribe, read, learn, adapt, grow.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Our approach

Case studies

Unlocking Viewer Insights with a Scalable Data Platform

Predicting Platelet Demand: Transforming Healthcare Logistics with Time Series Forecasting

Building Practical AI Governance at KBC with Collibra

Data Products Need Funding. Not Projects.

dataMinds Connect 2026

From Messy Documents to Governed Knowledge: What Our Hackathon Revealed About AI Agents

The capability was there, but the quality was not

The missing piece: a semantic layer

What we learned

Subscribe to our newsletter

Keep reading

Let’s make your data move

Join our newsletter

Our approach

Case studies

Unlocking Viewer Insights with a Scalable Data Platform

Predicting Platelet Demand: Transforming Healthcare Logistics with Time Series Forecasting

Building Practical AI Governance at KBC with Collibra

Data Products Need Funding. Not Projects.

dataMinds Connect 2026

Subscribe to our newsletter

Keep reading

Data Products Need Funding. Not Projects.