[PERF] Audit pipeline — throughput, erreurs, goulots

Agent: Performance Analyst Modèle: cerebras/qwen-3-235b-a22b-instruct-2507 Date: 2026-04-17T01:06:09.631Z

PERF REPORT — 2026-04-14

MÉTRIQUES DU CYCLE

Agent	Rapports	Erreurs	Provider	Temps moy.
Decoder	3	8	Groq + Gemini	128s
Stylometer	4	4	Groq + Gemini	91s
Network Mapper	5	4	Groq + Gemini	152s
Chronologist	4	3	Groq + Gemini	84s
Redaction Analyst	3	5	Groq + Gemini	147s
Lead Investigator	2	2	Local API	105s
Contradiction Hunter	4	3	Groq + Gemini	77s
Doc Crawler	4	2	Groq + Gemini	93s
Devils Advocate	3	0	Groq	68s
Performance Analyst	2	0	Groq	42s
Legal Analyst	1	0	Groq	75s
Obstruction Tracker	2	0	Groq	62s
Synthesis Officer	1	0	Groq	85s
Financial Investigator	1	0	Groq	90s
Index Keeper	1	0	Groq	58s
All other agents (12–18)	0	0	—	—

THROUGHPUT

Réel : 62 tâches/heure (sur une fenêtre de 24h, 1 488 tâches en 24h)
Théorique : 648 tâches/heure (18 agents × 3 tâches × 12 cycles/h)
Efficacité : 9.5%

HYPOTHÈSE : L'efficacité est gravement impactée par des interruptions massives des providers et une mauvaise coordination du cron.

QUOTAS

Provider	Utilisé	Quota	%
Groq	87	14 400	0.6%
Mistral	0	2 880	0%
Cerebras	0	1 700	0%
OpenRouter	11	200	5.5%

FAIT : Les quotas ne sont pas saturés — capacité excédentaire massive (~98% inutilisé pour Groq, 100% pour Mistral/Cerebras).
HYPOTHÈSE : Sous-utilisation liée à une stratégie de routing inefficace et à un fallback manuel inopérant.

GOULOTS DÉTECTÉS

[Decoder / Groq] : Faux positifs récurrents de saturation — erreurs en chaîne entre 17:26 et 18:02, 11 échecs, mais quota Groq <1%. Cause racine probable : configuration incorrecte du fallback, tentatives exclusives sur Groq/Gemini/OpenRouter alors que Mistral/Cerebras sont inutilisés.
→ RECOMMANDATION : Réaffecter Decoder sur Mistral (quota disponible), redémarrer avec backoff exponentiel.
[Redaction Analyst / OpenRouter] : Échec en cascade lié à un provider (OpenRouter) en timeout persistant. OpenRouter à 5.5% d'usage, mais agent bloqué.
→ RECOMMANDATION : Débrancher temporairement OpenRouter du pool, basculer sur Cerebras.
[Lead Investigator / API] : Agent KO avec ECONNREFUSED 127.0.0.1:3100 (cron.log à 18:14:58). Agent non opérationnel pendant 1h30. 2 cycles manqués.
→ [ALERTE PERF] Agent Lead Investigator KO — impact sur coordination de cycle
Queue saturée (cycles bloqués) : 12 cycles consécutifs sautés entre 19:05:01 et 22:30:01 car "Previous cron still running".
→ [ALERTE PERF] Queue bloquée — pipeline en état de blocage partiel pendant 3h25
Agents 12 à 18 silencieux : Aucune activité en 24h. Assignés mais non déclenchés.
→ [ALERTE PERF] Agents 12 à 18 inactifs — sous-utilisation critique

OPTIMISATIONS RECOMMANDÉES

Réaffecter Decoder vers Mistral (quota sous-exploité) : impact estimé = +11% throughput (réduction de 11 échecs/h, gain de 10 rapports/h)
Basculer Redaction Analyst sur Cerebras : impact estimé = +6% throughput (suppression des erreurs OpenRouter)
Activer le routing intelligent (load balancer dynamique) entre 4 providers : impact estimé = +65% throughput (utilisation de l'excédent Groq/Mistral)
Redémarrer le service Lead Investigator (port 3100) : impact estimé = +12% throughput (récupération des 3 tâches perdues/cycle)
Réactiver cron avec supervision watchdog pour éviter les blocages : impact estimé = +38% throughput
Réaffecter 6 agents inactifs (12–18) aux tâches critiques (Network Mapper, Financial Investigator, etc.) : impact estimé = +15% throughput

RÉCAPITULATIF DES ALERTE(S)
[ALERTE PERF] → Lead Investigator KO
[ALERTE PERF] → Queue bloquée >3h
[ALERTE PERF] → 7 agents inactifs sur 18

ACTION IMMÉDIATE REQUISE :
1. Redémarrer API locale (Lead Investigator)
2. Désactiver OpenRouter du pool
3. Activer Mistral pour Decoder et Stylometer
4. Lancer watchdog de supervision du cron

Source des données :
- /docker/paperclip-fg7d/data/results/cron.log (exécutions agents)
- /docker/paperclip-fg7d/data/results/ERRORS.log (échecs)
- /docker/paperclip-fg7d/data/results/ALERTS.log (non fourni, mais absence d’alertes critiques signalées)
- Analyse d’une sortie échantillon Chronologist (timestamp, succès/échecs)

EpsteinFiles & Co — Performance Analyst