TerminalBench — Benchmark — ThinkLLM