Evaluating Multi-Agent Architectures: A Performance Benchmark