Motor Agent

Overview

Motor Agent is similar to Language Agent but instead of returning a string, it always returns a Motion. Motor Agent is generally powered by robotic transformer models, i.e., OpenVLA, RT1, Octo, etc. Some small models, like RT1, can run on edge devices. However, some, like OpenVLA, are too large to run on edge devices. See OpenVLA Agent and an example OpenVLA server

motor_agent = OpenVlaAgent(model_src="https://api.mbodi.ai/community-models/") # OpenVLA model
hand_control = motor_agent.act("move left", image)
hardware_interface.do(hand_control)