1.
grounding split
:作者把以下数据集中的 Meta 信息都统一成 pyautogui 命令格式的数据
2.
planning & reasoning split
"Thanks to our detailed inner monologue trajectory data, we implement a
reasoning mixture approach
, where the model is exposed to
various levels of cognitive complexity
,
from straightforward low-level action instructions to full inner monologues
that include observation descriptions, thoughts, and detailed action plans. By dynamically adjusting the complexity of these trajectories, we train the model to be adaptable, fostering step-by-step reasoning and high-level decision-making abilities. This
diversity
in reasoning ensures that the model can handle a wide range of tasks with nuanced understanding and precision."