CoreGrid Operator Surface

Production Control Room

Health: ok. This screen exposes host containment, job reroute and host policy controls with operator confirmations and filtered audit visibility.

Live Summary
Active hosts4
Running jobs0
Completed jobs13
Queue pressure3
Hosts
5
Jobs
18
Queued Tasks
3
Quarantined
1
Audit Events
8

Host Operations

Update policy and isolate unhealthy nodes without shell access.

Host

dashboard-test-host

quarantined
Available GPU
55%
Last Seen
25.03.2026 18:47:49
Reputation
0.50
Policy
72C / 55%
Policy writes immediately to the live host record.
Available GPU
90%
Last Seen
26.03.2026 02:18:19
Reputation
0.50
Policy
80C / 90%
Policy writes immediately to the live host record.
Available GPU
90%
Last Seen
24.03.2026 02:37:33
Reputation
0.50
Policy
80C / 90%
Policy writes immediately to the live host record.
Available GPU
90%
Last Seen
23.03.2026 20:10:20
Reputation
0.50
Policy
80C / 90%
Policy writes immediately to the live host record.
Available GPU
90%
Last Seen
23.03.2026 20:09:55
Reputation
0.50
Policy
80C / 90%
Policy writes immediately to the live host record.

Jobs

Recent workload and reroute controls.

JobTypeStatusAcceptedAction
job_4bf55e98ab34eadeembedding_batchcancelled24.03.2026 06:13:20
job_e2aaf20b4f2c808cembedding_batchcancelled24.03.2026 06:13:50
job_3543851d4af05acfembedding_batchcompleted23.03.2026 20:10:45
job_f044deafafc34553embedding_batchcompleted23.03.2026 20:29:38
job_97f43d7a54aceef5embedding_batchcompleted24.03.2026 04:00:14
job_e79d795f31fb28eaembedding_batchcompleted24.03.2026 04:53:15
job_700062b4f326239dembedding_batchcompleted25.03.2026 22:43:40
job_30b9f1bebde3422dembedding_batchcompleted25.03.2026 18:57:32
job_28c8ef6eda7b0af7embedding_batchcompleted24.03.2026 06:14:39
job_cdce752c7df0acc1embedding_batchcompleted25.03.2026 19:11:29
job_f8126f6e1a9233abembedding_batchcompleted25.03.2026 18:47:49
job_11496fb5b1776515embedding_batchcompleted25.03.2026 20:18:38
job_50a34efac4f0a30eembedding_batchcompleted25.03.2026 19:10:44
job_1ea8d1ae033a9f94embedding_batchcompleted25.03.2026 19:10:44
job_82032ce5e0c4d534embedding_batchcompleted25.03.2026 19:22:44
job_1ea6db7a5bbb29c2embedding_batchqueued25.03.2026 22:45:19
job_be98d29b0e0a1cbdembedding_batchqueued25.03.2026 22:49:24
job_89c30c2db42e30b9embedding_batchqueued25.03.2026 22:47:12

Queue

Tasks waiting for lease

TaskJobVRAMAttempts
task_4ec8263c5a729e27embedding_batch2048 MB1
task_3367a8dbc8979cc7embedding_batch2048 MB1
task_d7e984543034471bembedding_batch2048 MB1

Host Detail

Selected from the host list

Select a host id from the list below.

Task Detail

Selected from the queue table

Select a queued task for detail view.

Audit Trail

Latest operator and runtime events

Reset
EventActorObjectAt
job.reroutedadmin:adminjob:job_f8126f6e1a9233ab25.03.2026 18:47:49
host.registeredhost:host_20260324023735.835443960host:host_20260324023735.83544396024.03.2026 02:37:35
host.quarantinedadmin:adminhost:host_20260325184749.10625020925.03.2026 18:47:49
host.registeredhost:host_20260325184749.106250209host:host_20260325184749.10625020925.03.2026 18:47:49
host.registeredhost:host_20260323201023.068080049host:host_20260323201023.06808004923.03.2026 20:10:23
host.registeredhost:host_20260323200940.930365485host:host_20260323200940.93036548523.03.2026 20:09:40
host.registeredhost:host_20260323201000.305046056host:host_20260323201000.30504605623.03.2026 20:10:00
host.policy_updatedadmin:adminhost:host_20260325184749.10625020925.03.2026 18:47:49