Portfolio·Enterprise lane

Blue Prism
Insight

A multi-tab operational intelligence dashboard built for Blue Prism CoEs. Surfaces fleet health, queue status, per-process RAG scoring, exception trend analysis, infrastructure monitoring, and ROI reporting — none of which is available natively in Control Room.

The problem

Blue Prism's native tooling — Control Room and Process Analytics — gives you individual bot statuses and session logs. It doesn't give a CoE the unified operational picture it needs to run the programme confidently. Exception logs are flat lists with no trend analysis. There's no ROI reporting, no infrastructure visibility, and no way to answer "are we healthy right now?" without clicking through multiple screens.

For a mature CoE running 200+ processes, that gap is a real operational risk. Problems surface late, effort is spent manually compiling MI for leadership, and infrastructure degradation often isn't noticed until bots start failing.

Approach

Built a Streamlit dashboard that plugs directly into the BP v7 REST API and Zabbix, designed for three distinct audiences: automation teams who need to operate the estate day-to-day, leadership who need ROI and MI, and admins who manage access.

The dashboard ships with a full mock data provider — JSON fixtures covering every view — so it runs out of the box for demos and UAT without any BP access. Switching to live data is a single environment variable change.

What Insight answers

Are we healthy right now?

Control Room shows individual bot statuses, but there's no single view correlating fleet health, queue backlogs, infrastructure metrics, and schedule adherence. Insight's Overview tab puts all of this on one screen with live status pills, fleet-wide KPIs, and clearance ETAs for every monitored queue.

Which exceptions actually need attention?

BP's exception logs are flat lists. Insight adds trend analysis (spiking vs normal), velocity tracking, severity ranking, and Business vs System classification — so you can prioritise by impact, not just recency.

What's the ROI of our automation programme?

There's no native ROI reporting in Blue Prism. Insight models cost avoidance against your annual programme cost, with per-process breakdowns, contribution analysis, and plain-English interpretation of ROI ratios. Exportable as CSV.

Is our infrastructure about to cause problems?

BP doesn't monitor its own infrastructure. Insight pulls server metrics from Zabbix and derives health status from real thresholds — CPU, RAM, disk, and Windows service states. It highlights degradation before bots start failing.

Overview

Fleet-wide KPI cards, per-pool digital worker grid with status indicators, work queue cards with progress bars and clearance ETAs, and upcoming schedule table.

Digital worker fleet grouped by pool. Status indicators and fleet-wide KPIs at the top.
Work queue cards showing items processed, pending, exceptioned, and clearance ETAs.
Process Intelligence

Per-process RAG health scoring against configurable STP and exception rate thresholds, AHT and volume trends, and an Exception Intelligence section with trend-aware KPIs and Business vs System classification.

Per-process health status derived from STP rate and exception rate against configurable thresholds.
Exception Intelligence — type split, time window selector, and severity-ranked detail table.
ROI & Management Information

Projected annual cost avoidance, ROI ratio with plain-English interpretation, per-process contribution analysis, cumulative savings over time, and CSV export.

Hero cards showing projected annual cost avoidance and ROI ratio.
Process summary table sorted by cost avoidance contribution.
Infrastructure

Server health cards with derived status from real CPU, RAM, disk, and service metrics — all pulled from Zabbix.

Server status derived from live metrics. Status is Healthy, Warning, or Critical based on configurable thresholds.
Utilisation

Bot utilisation bar chart, hourly activity heatmap, and daily trend lines. Fleet-wide KPIs with configurable thresholds.

Hourly activity heatmap per bot and daily utilisation trend lines.
Architecture decisions

Mock/live provider swap

A factory pattern switches between static JSON fixtures and the live BP API + Zabbix via a single environment variable. Fully runnable for demos and UAT — one config change and it's pointing at production.

Derived status, not declared status

Server health cards compute status from real CPU, RAM, disk, and Windows service metrics against configurable thresholds. A server isn't 'healthy' because no one reported a problem — it's healthy because the numbers are within bounds.

Security by design

bcrypt password hashing, JWT session cookies, brute-force protection (5 attempts), server-side session invalidation, role-based tab access, and structured audit logging of every auth event.

Stack
Python / Streamlit
Dashboard framework
Plotly
Charting (dark theme)
Blue Prism v7 REST API
Automation estate data
Zabbix JSON-RPC
Infrastructure metrics
IIS + ARR
Production reverse proxy
pytest
128 tests, 92% coverage
GitHub Actions
CI — lint + test
PortfolioEnterprise lane