HippoCamp: Benchmarking Contextual Agents on Personal Computers — ThinkLLM