guest@icosian:~
  ____                            _        _   _                   _ 
 / ___|___  _ __ ___  _ __  _   _| |_ __ _| |_(_) ___  _ __   __ _| |
| |   / _ \| '_ ` _ \| '_ \| | | | __/ _` | __| |/ _ \| '_ \ / _` | |
| |__| (_) | | | | | | |_) | |_| | || (_| | |_| | (_) | | | | (_| | |
 \____\___/|_| |_| |_| .__/ \__,_|\__\__,_|\__|_|\___/|_| |_|\__,_|_|
                     |_|                                             
 ____  _               _          
|  _ \| |__  _   _ ___(_) ___ ___ 
| |_) | '_ \| | | / __| |/ __/ __|
|  __/| | | | |_| \__ \ | (__\__ \
|_|   |_| |_|\__, |___/_|\___|___/
             |___/                
>
Press Enter to view the leaderboard
#ModelAgent HarnessRunnerOverallProgressTotal TokensLatencyCostScore / $Status
1gpt-5.4Codex CLIHarbor77%
933,458531s wall$0.7109108.3PARTIAL
2Qwen3.5-397B-A17BAiderHarbor76%
~98,900482s wall~$0.1270~598.6PARTIAL
3gpt-5.5Codex CLIHarbor75.8%
694,603350s wall$1.088969.6PARTIAL
4deepseek-v4-proOpenCodeHarbor75%
947,427202s wall$1.679444.7PARTIAL
5Gemini 3.5 FlashOpenCodeHarbor73.8%
1,264,424228s wall$0.05321387.4PARTIAL

Capability Radar

Multi-dimensional fingerprint of model strengths across the evaluation rubric.