Why Public Sector AI Projects Fail: They Are Designed for Committees, Not Citizens

Public Sector AI Insight

Why Public Sector AI Projects Fail: They Are Designed for Committees, Not Citizens

Artificial intelligence can improve public services, but many government AI projects stall because they are shaped around internal governance, consensus and risk avoidance rather than the citizen experience they are meant to improve.

The promise

Public sector AI should make services faster, fairer and more responsive.

In the public sector, artificial intelligence holds enormous promise: faster processing of citizen applications, more accurate risk assessments in social services, predictive maintenance for infrastructure and more efficient allocation of limited resources.

Yet across governments worldwide, including in New Zealand, many AI initiatives stall, deliver underwhelming results or quietly disappear after substantial investment.

The core problem is not usually the algorithm. More often, the project has been designed around internal machinery rather than the person who needs the service.

Projects are shaped around steering groups, approvals and internal consensus.
Citizen outcomes become diluted as each agency, department or function adds requirements.
Risk management becomes more visible than service improvement.
The project delivers something defensible internally, but weak in the hands of the public.
The structural problem

Public sector AI failure is rarely just a technology failure.

The failure rate for AI projects is high across enterprise and public contexts. Research and industry analysis frequently point to weak problem definition, poor data quality, unclear value, governance friction and implementation barriers as repeated causes of AI project failure.

In public sector environments, those risks are amplified. AI projects must work inside fragmented legacy systems, inter-agency boundaries, privacy obligations, procurement rules, political sensitivity and public accountability expectations.

That makes decision quality essential. Before a public agency asks whether the AI model can work, it should ask whether the decision environment around the project has been properly tested.

Failure pattern 01

Misaligned or diluted objectives from the start.

Public sector projects often begin with broad, aspirational goals shaped in steering groups, workshops and inter-agency consultations. By the time requirements are documented, the original citizen problem can become diluted.

A problem such as reducing wait times for benefit applications may slowly become a specification that satisfies policy, legal, operational, reporting and political requirements, while losing focus on the person waiting for help.

RAND Corporation research on AI failure has highlighted misunderstanding or miscommunication of the core problem as a major risk. In government, that risk is often magnified by the number of actors involved.

Original need Improve the citizen experience.

The public-facing problem is often clear at the beginning: faster decisions, better access, clearer support or less friction.

Committee effect Requirements become diluted.

Every internal group adds constraints, preferences and controls until the original problem becomes secondary.

Outcome risk The system works internally, but not publicly.

The project may satisfy sign-off requirements while failing to meaningfully improve the citizen outcome.

Failure pattern 02

Endless layers of governance and approval.

Public sector AI projects can accumulate committees quickly: project boards, risk registers, ethics panels, procurement teams, privacy impact assessments, technical reviews and senior oversight. Each layer may add scrutiny, but not necessarily better decision insight.

When approval processes stretch for months, pilots lose urgency and the original service problem fades into governance procedure. Risk-averse cultures can end up prioritising “no surprises” over practical learning.

Pilots become slower than the problems they were meant to solve.
Project teams optimise for approval rather than adoption.
Governance bodies add controls without always improving technical or citizen insight.
The result is often a safe but shallow solution that struggles to scale.
Failure pattern 03

Internal compliance becomes more important than citizen outcomes.

Public sector AI is often optimised for audit trails, explainability to internal reviewers and defensibility in information requests. These things matter, but they can overwhelm the actual service experience if they become the dominant design logic.

Interfaces become clunky to accommodate every possible edge case. Features are stripped out to minimise perceived risk. The system technically works, but citizens experience it as another bureaucratic layer.

Governance should protect citizens, not displace them from the centre of the design.

Failure pattern 04

Data and integration problems are amplified by silos.

Public data is often fragmented across legacy systems, departmental boundaries and inconsistent formats. AI needs clean, integrated and high-quality data to be useful, but public sector projects often defer the difficult work of data sharing, governance and modernisation.

Instead, they settle for narrow pilots using whatever data is easiest to access. That may be enough to demonstrate a concept, but not enough to support a reliable public service.

Data quality AI cannot fix weak foundations.

If the underlying data is inconsistent, incomplete or poorly governed, AI outputs will inherit those weaknesses.

Integration Silos reduce usefulness.

When departments cannot share or connect data effectively, AI projects become narrow and fragile.

Privacy Fear replaces design.

Privacy concerns should be designed into the work early, not used late as a reason to avoid difficult decisions.

Failure pattern 05

Projects resist iteration and real user feedback.

Private-sector AI often improves through rapid iteration: build, test with users, learn and refine. Public sector projects shaped by committee consensus often resist change once scoped.

Citizen testing can become tokenistic or too late in the process. Negative feedback triggers more review, rather than fast improvement. The outcome is a system designed for sign-off, not adoption.

Citizens are consulted after the project direction is already locked in.
User feedback is treated as risk rather than evidence.
Scope becomes difficult to change because too many committees have approved it.
The project becomes mediocre by design.
The insight

The project has to be designed around the citizen problem, not the committee process.

Fixing public sector AI failure does not require abandoning governance. It requires reorienting governance around outcomes, evidence and citizen value.

New Zealand already has useful guidance through the Public Service AI Framework, MBIE Responsible AI Guidance and the broader New Zealand AI Strategy. The challenge is turning guidance into working delivery conditions.

A better path

Citizen-centred AI needs lighter, sharper and more outcome-focused assurance.

The answer is not reckless experimentation. It is disciplined, citizen-centred testing before the project becomes too large, slow or politically exposed to change.

Step 01
Start with the citizen problem.

Define the specific service friction, delay, risk or unfairness the AI project is meant to improve. Keep that problem visible throughout the project.

Step 02
Use small, empowered teams.

Cross-functional teams should be able to define narrow, high-impact scopes without every decision being diluted through committee consensus.

Step 03
Test with real users early.

Citizen and frontline feedback should be treated as evidence, not a threat to the project plan.

Step 04
Invest in data foundations.

Data quality, privacy, integration and governance need to be solved as core project conditions, not left as late-stage blockers.

Step 05
Shift governance from sign-off to assurance.

Governance should test whether the project is useful, safe, lawful, explainable and improving the citizen outcome it was created for.

Where MOI fits

Public sector AI needs decision assurance before procurement and implementation.

Ministry of Insights is built for this kind of pre-commitment testing. The MOI Lab system helps leaders test the decision environment before money, people, data, reputation or public trust are placed at risk.

Civic Lab tests trust, legitimacy, public consequence and community confidence.
Insights Lab tests operational reality, data quality, constraints and failure loops.
Engage Lab tests stakeholder power, resistance, alignment and influence.
Change Lab tests adoption, behaviour change and implementation friction.
Consult Lab provides independent challenge before the recommendation becomes policy.
Decision Assurance Lab stress-tests high-stakes AI decisions before commitment.
The takeaway

The cost is not just failed technology. It is lost public trust.

When public sector AI projects fail, the cost is not only financial. It is the lost opportunity to make government services faster, fairer and more responsive at a time when public trust is fragile.

New Zealand’s public sector has strengths: pragmatism, a relatively small scale for testing and growing AI guidance from MBIE and the Government Chief Digital Office. But until AI projects are genuinely designed around citizens rather than committees, many will continue to underperform.

Talk to MOI Explore Decision Assurance Lab Explore the Lab system
Related reading

Decision support, responsible AI and public-sector assurance.

For broader context, review MOI’s Lab system, responsible AI position and relevant public-sector AI guidance.

MOI Lab system MOI Ethical AI Policy RAND research OECD AI Policy Observatory