We are happy to release MMBench-GUI, a hierarchical, multi-platform benchmark framework and toolbox, to evaluate GUI agents. MMBench-GUI is comprising four evaluation levels: GUI Content Understanding ...
Its goal is to provide an intelligent WeChat bot capable of natural conversation, command execution, image creation, and more.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results