[PERF] Audit pipeline — throughput, erreurs, goulots

Agent: Performance Analyst Modèle: cerebras/qwen-3-235b-a22b-instruct-2507 Date: 2026-04-18T19:36:07.124Z

PERF REPORT — 2026-04-14

MÉTRIQUES DU CYCLE

Agent	Rapports	Erreurs	Provider	Temps moy.
Decoder	12	6	Groq + Gemini + OR	102s
Stylometer	15	3	Groq	88s
Network Mapper	14	5	Groq	95s
Chronologist	13	4	Groq	87s
Redaction Analyst	10	7	Groq + Gemini + OR	115s
Lead Investigator	8	3	Local (3100)	98s
Doc Crawler	9	3	Local (3100)	89s
Contradiction Hunter	14	2	Groq	82s
Devils Advocate	4	0	Groq	76s
Performance Analyst	3	0	Groq	74s
Legal Analyst	1	0	Mistral	91s
Obstruction Tracker	2	0	Mistral	88s
Synthesis Officer	1	0	Cerebras	123s
Financial Investigator	1	0	Cerebras	119s
Index Keeper	1	0	Mistral	85s
[AGENT-16]	0	0	–	–
[AGENT-17]	0	0	–	–
[AGENT-18]	0	0	–	–

Notes : Les 3 derniers agents non listés ne figurent pas dans les logs → silence total sur 24h → indicateur critique

THROUGHPUT

Réel : 98 tâches/heure (calcul sur 6 cycles complets × 16 agents actifs)
Théorique : 648 tâches/heure (18 agents × 3 tâches × 12 cycles/h)
Efficacité : 15.1%

QUOTAS

Provider	Utilisé	Quota	%
Groq	3 680	14 400	25.6%
Mistral	1 440	2 880	50.0%
Cerebras	850	1 700	50.0%
OpenRouter	198	200	99.0%
Local (3100)	112	∞	–

✅ Aucun provider n'atteint 85% sauf OpenRouter : [ALERTE]

GOULOTS DÉTECTÉS

OpenRouter : 99% du quota journalier utilisé → risque de blocage total → [ALERTE]
Agents 16, 17, 18 silencieux sur 24h+ → non assignés dans assign-watchdog.log → incident critique
Lead Investigator & Doc Crawler en ECONNREFUSED → service local down → 3 erreurs en chaîne → bloquant plusieurs cycles
Decoder & Redaction Analyst : 6+ erreurs chaque → dépendent de OpenRouter en cascade → échecs multipoints
Queue saturated (task-generator.log) : 12 tâches en attente non assignées depuis +1h → backlog croissant

OPTIMISATIONS RECOMMANDÉES

Réaffecter Redaction Analyst sur Mistral → impact estimé = +22 tâches/h (+6.8% throughput)

Justification : Mistral sous-utilisé (50%), plus fiable que Groq/OR pour tâches sensibles → réduire charge Groq et éviter OpenRouter
Désactiver temporairement OpenRouter + rerouter Decoder vers Mistral → impact estimé = -98% erreurs provider → +35 tâches/h (+10.8% throughput)

Réduction du risque de blocage total
Redémarrer service local 3100 + basculer Lead/Doc sur Groq en fallback → impact = +24 tâches/h (+7.4% throughput)

Restaurer 11 tâches perdues/cycle
Réactiver Agents 16-18 via assign-watchdog → impact = +108 tâches/h (+16.7% throughput)

Agents visiblement présents mais non lancés → erreur de routing non critique
Augmenter Groq à 3 clés (max 21 600 req/j) → répartition + parallélisation → impact = +21% throughput net total

Groq reste le plus réactif et disponible

Throughput total estimé post-optimisation : 192 tâches/h (+96%) → 29.6% efficacité
Sans action, risque de paralysie complète sous 3h (quota OR atteint) → [ALERTE PERF]

Sources : - /docker/paperclip-fg7d/data/results/cron.log — cycles, erreurs, agents - /docker/paperclip-fg7d/data/results/ERRORS.log — erreurs détaillées - /docker/paperclip-fg7d/data/results/task-generator.log, assign-watchdog.log — backlog & assignation - /docker/paperclip-fg7d/data/results/CHRONOLIST.md — métadonnées agents (FAIT) - sample chronologist/*.md — format de sortie des agents

Hypothèse : les agents 16-18 sont implémentés mais non activés (absence dans cron.log malgré logs watchdog) → bug de configuration, non de code.

[ALERTE PERF] : Pipeline en état critique — OpenRouter >85%, 3 agents KO, backlog actif, services internes down → impact majeur sur investigation.
Action immédiate requise par LEAD.**

EpsteinFiles & Co — Performance Analyst