Exploring the frontier of LLMs and AI Agents in biological protocol understanding, reasoning, and automated physical execution. BioProBench is the first large-scale, integrated multi-task benchmark ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results