ACT-1: Transformer for Actions

ACT-1 is a large-scale Transformer trained to use digital tools — among other things, we recently taught it how to use a web browser. Right now, it’s hooked up to a Chrome extension which allows ACT-1 to observe what’s happening in the browser and take certain actions, like clicking, typing, and scrolling, etc. The observation is a custom “rendering” of the browser viewport that’s meant to generalize across websites, and the action space is the UI elements available on the page.

This is a Transformer model trained to use a web browser.

Neat!

Now we just need an action transformer than can train a better action transformer that can train a better action transformer […] that can train the perfect AI. :laughing:

1 Like