FutureScot
Cloud, Data & AI

From safe experiments to scaled impact: bridging the AI delivery gap 

Photograph: 3rdtimeluckystudio/Shutterstock.com

Across public services, AI activity is everywhere. Pilots are running, prototypes are being built and teams are exploring what’s possible. Importantly, this isn’t about replacing people, it’s about supporting them, helping teams work more efficiently, make better decisions, and focus on higher-value work. 

But very little of it is making the leap into production – and the reason for this isn’t that the tools aren’t good enough; it’s that the conditions surrounding it aren’t quite right.  

The organisations that are doing this effectively aren’t just experimenting more, they’re designing their experimentation so that it can scale. They’re operating with one eye on the now, and one on the future. 

The comfort, and risk, of the experimentation phase 

Experimentation is something that all organisation need to do more of in order to progress. Without it, innovation wouldn’t happen, services won’t improve, employees and citizens won’t have better jobs or lives. 

And when done well, experimentation creates a safe space that allows teams to: 

But that same safety can become a trap if there isn’t a clear purpose or the right conditions surrounding it. 

We’ve seen teams stuck in environments that are: 

The result? Promising pilots that can’t be deployed into operational services. 

Designing experiments that are built to scale 

If scaling is the goal, meaning moving experiments into live operational processes, it needs to shape decisions from the start. This doesn’t mean slowing down; it means being more intentional about how you move fast. 

We find that there are three shifts that make this difference: 

1. Start with a real opportunity 

Don’t just ask “Where can we try AI?” 

Ask: 

If you can’t describe the production and wider business context, you’re not testing the right thing. 

2. Build with real constraints in mind 

 In the public sector, experiments need to reflect the environments they are intended to operate in. This means designing with real operational constraints in mind from the start, rather than treating them as considerations to address later. 

That includes: 

The most valuable experiments aren’t the most impressive; they’re the ones that can realistically transition into live services. 

3. Measure what matters at scale 

Many pilots prove something works but far fewer prove it’s worth scaling. 

Shift your metrics from: 

Creating a pathway out of experimentation 

Moving into production isn’t a single step but a joined-up model that brings together business, technology and governance functions.  

Some of the key things to consider include: 

Cross-functional ownership 

Scaling AI isn’t just a technical exercise. This is something we talk about over and over again, but for good reason.  

The biggest failure mode for AI is lack of adoption. To get the best outcomes, AI requires product led multi-disciplinary teams: 

If these groups only engage at the end, scaling will stall.  Cross-functional ownership helps business stakeholders take responsibility, builds user trust, and integrates AI into daily operations instead of isolating it in labs. 

AI succeeds when it is owned by the business, not just built by technologists. 

A platform mindset 

Rather than rebuilding from scratch each time, we find that leading organisations create repeatable foundations: 

This is what turns isolated wins into scalable capability. 

One eye on the now, one on the future. 

This balance is the hard part. Move too fast and you create risk you can’t manage, but move too cautiously and you never realise value. 

The answer isn’t choosing one or the other. It’s finding the point where progress and control work together.  

This looks like: 

Not every new AI capability is ready for real use. Some will still be evolving, unproven, or driven by hype rather than value. The challenge is knowing when a technology is mature enough to test with intent, and when to wait. 

AI success doesn’t come from a single tool, pilot or agent demonstrator, it comes from the ability to learn quickly, test responsibly, and scale with confidence. 

Taking the next step 

Moving beyond pilots and experimentation isn’t just a simple task, especially when organisations bring individual nuances and ways of working. And for those in the public sector turning pilots into real, operational impact is harder than ever.  

We explored this further during our masterclass, From Pilots to Production – Scaling AI Safely in Public Services, at the FutureScot Public Sector conference on the 21st May. 

We covered: 


If you want to find out more about what we discussed and how it could apply to your organisation, drop me an email at gary.craven@soprasteria.com and we can explore how your next steps. 

Contributing authors: Neil Anderson, Data AI Practice, Chief Technology Officer, Sopra Steria
Gary Craven, Head of AI Strategy and Transformation, Sopra Steria

Related posts

Powering smarter public services: how LapSafe® supports Scotland’s forthcoming Digital Strategy 

Kay Tilbury
November 7, 2025

Starmer welcomes Microsoft’s ‘landmark’ $30 billion investment to ‘power UK’s AI future’

Kevin O'Sullivan
September 17, 2025

Glasgow researchers pioneer AI-powered ‘robo guide dogs’ to help blind and partially sighted people

Kevin O'Sullivan
February 8, 2024
Exit mobile version