MCPHub LabRegistrybytedance/UI-TARS-desktop
bytedance

bytedance/UI TARS desktop

Built by bytedance โ€ข 29,130 stars

What is bytedance/UI TARS desktop?

A high-performance MCP server implementation for bytedance/UI-TARS-desktop that bridges the gap between the Model Context Protocol and external services. It allows Large Language Models to interact with your data and tools with low latency and native support.

How to use bytedance/UI TARS desktop?

1. Ensure you have an MCP-compatible client (like Claude Desktop) installed. 2. Configure your server with: npx @modelcontextprotocol/bytedance/UI-TARS-desktop 3. Restart your client and verify the new tools are available in the system catalog.
๐Ÿ›ก๏ธ Scoped (Restricted)
npx @modelcontextprotocol/bytedance-ui-tars-desktop --scope restricted
๐Ÿ”“ Unrestricted Access
npx @modelcontextprotocol/bytedance-ui-tars-desktop

Key Features

Native MCP Protocol Support
High-performance Data Streaming
Type-safe Tool Definitions
Seamless Environment Integration

Optimized Use Cases

Connecting external databases to AI workflows
Automating workflow tasks via natural language
Augmenting developer productivity with specialized tools

bytedance/UI TARS desktop FAQ

Q

What is an MCP server?

MCP is an open standard that enables developers to build secure, two-way integrations between AI models and local or remote data sources.

Q

Is this server ready for production?

Yes, this server follows standard MCP security patterns and tool-restricted access.

Official Documentation

View on GitHub
<picture> <img alt="Agent TARS Banner" src="./images/tars.png"> </picture> <br/>

Introduction

English | ็ฎ€ไฝ“ไธญๆ–‡

<b>TARS<sup>*</sup></b> is a Multimodal AI Agent stack, currently shipping two projects: Agent TARS and UI-TARS-desktop:

<table> <thead> <tr> <th width="50%" align="center"><a href="#agent-tars">Agent TARS</a></th> <th width="50%" align="center"><a href="#ui-tars-desktop">UI-TARS-desktop</a></th> </tr> </thead> <tbody> <tr> <td align="center"> <video src="https://github.com/user-attachments/assets/c9489936-afdc-4d12-adda-d4b90d2a869d" width="50%"></video> </td> <td align="center"> <video src="https://github.com/user-attachments/assets/e0914ce9-ad33-494b-bdec-0c25c1b01a27" width="50%"></video> </td> </tr> <tr> <td align="left"> <b>Agent TARS</b> is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product. <br> <br> It primarily ships with a <a href="https://agent-tars.com/guide/basic/cli.html" target="_blank">CLI</a> and <a href="https://agent-tars.com/guide/basic/web-ui.html" target="_blank">Web UI</a> for usage. It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world <a href="https://agent-tars.com/guide/basic/mcp.html" target="_blank">MCP</a> tools. </td> <td align="left"> <b>UI-TARS Desktop</b> is a desktop application that provides a native GUI Agent based on the <a href="https://github.com/bytedance/UI-TARS" target="_blank">UI-TARS</a> model. <br> <br> It primarily ships a <a href="https://github.com/bytedance/UI-TARS-desktop/blob/main/docs/quick-start.md#get-model-and-run-local-operator" target="_blank">local</a> and <a href="https://github.com/bytedance/UI-TARS-desktop/blob/main/docs/quick-start.md#run-remote-operator" target="_blank">remote</a> computer as well as browser operators. </td> </tr> </tbody> </table>

Table of Contents

<!-- START doctoc generated TOC please keep comment here to allow auto update --> <!-- DON'T EDIT THIS SECTION, INSTEAD RE-RUN doctoc TO UPDATE --> <!-- END doctoc generated TOC please keep comment here to allow auto update -->

News

  • [2025-11-05] ๐ŸŽ‰ We're excited to announce the release of Agent TARS CLI v0.3.0! This version brings streaming support for multiple tools (shell commands, multi-file structured display), runtime settings with timing statistics for tool calls and deep thinking, Event Stream Viewer for data flow tracking and debugging. Additionally, it features exclusive support for AIO agent Sandbox as isolated all-in-one tools execution environment.
  • [2025-06-25] We released an Agent TARS Beta and Agent TARS CLI - Introducing Agent TARS Beta, a multimodal AI agent that aims to explore a work form that is closer to human-like task completion through rich multimodal capabilities (such as GUI Agent, Vision) and seamless integration with various real-world tools.
  • [2025-06-12] - ๐ŸŽ We are thrilled to announce the release of UI-TARS Desktop v0.2.0! This update introduces two powerful new features: Remote Computer Operator and Remote Browser Operatorโ€”both completely free. No configuration required: simply click to remotely control any computer or browser, and experience a new level of convenience and intelligence.
  • [2025-04-17] - ๐ŸŽ‰ We're thrilled to announce the release of new UI-TARS Desktop application v0.1.0, featuring a redesigned Agent UI. The application enhances the computer using experience, introduces new browser operation features, and supports the advanced UI-TARS-1.5 model for improved performance and precise control.
  • [2025-02-20] - ๐Ÿ“ฆ Introduced UI TARS SDK, is a powerful cross-platform toolkit for building GUI automation agents.
  • [2025-01-23] - ๐Ÿš€ We updated the Cloud Deployment section in the ไธญๆ–‡็‰ˆ: GUIๆจกๅž‹้ƒจ็ฝฒๆ•™็จ‹ with new information related to the ModelScope platform. You can now use the ModelScope platform for deployment.
<br>

Agent TARS

<p> <a href="https://npmjs.com/package/@agent-tars/cli?activeTab=readme"><img src="https://img.shields.io/npm/v/@agent-tars/cli?style=for-the-badge&colorA=1a1a2e&colorB=3B82F6&logo=npm&logoColor=white" alt="npm version" /></a> <a href="https://npmcharts.com/compare/@agent-tars/cli?minimal=true"><img src="https://img.shields.io/npm/dm/@agent-tars/cli.svg?style=for-the-badge&colorA=1a1a2e&colorB=0EA5E9&logo=npm&logoColor=white" alt="downloads" /></a> <a href="https://nodejs.org/en/about/previous-releases"><img src="https://img.shields.io/node/v/@agent-tars/cli.svg?style=for-the-badge&colorA=1a1a2e&colorB=06B6D4&logo=node.js&logoColor=white" alt="node version"></a> <a href="https://discord.gg/HnKcSBgTVx"><img src="https://img.shields.io/badge/Discord-Join%20Community-5865F2?style=for-the-badge&logo=discord&logoColor=white" alt="Discord Community" /></a> <a href="https://twitter.com/agent_tars"><img src="https://img.shields.io/badge/Twitter-Follow%20%40agent__tars-1DA1F2?style=for-the-badge&logo=twitter&logoColor=white" alt="Official Twitter" /></a> <a href="https://applink.larkoffice.com/client/chat/chatter/add_by_link?link_token=deen76f4-ea3c-4964-93a3-78f126f39651"><img src="https://img.shields.io/badge/้ฃžไนฆ็พค-ๅŠ ๅ…ฅไบคๆต็พค-00D4AA?style=for-the-badge&logo=lark&logoColor=white" alt="้ฃžไนฆไบคๆต็พค" /></a> <a href="https://deepwiki.com/bytedance/UI-TARS-desktop"><img src="https://img.shields.io/badge/DeepWiki-Ask%20AI-8B5CF6?style=for-the-badge&logo=gitbook&logoColor=white" alt="Ask DeepWiki" /></a> </p>

<b>Agent TARS</b> is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product. <br> <br> It primarily ships with a <a href="https://agent-tars.com/guide/basic/cli.html" target="_blank">CLI</a> and <a href="https://agent-tars.com/guide/basic/web-ui.html" target="_blank">Web UI</a> for usage. It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world <a href="https://agent-tars.com/guide/basic/mcp.html" target="_blank">MCP</a> tools.

Showcase

Please help me book the earliest flight from San Jose to New York on September 1st and the last return flight on September 6th on Priceline

https://github.com/user-attachments/assets/772b0eef-aef7-4ab9-8cb0-9611820539d8

<br> <table> <thead> <tr> <th width="50%" align="center">Booking Hotel</th> <th width="50%" align="center">Generate Chart with extra MCP Servers</th> </tr> </thead> <tbody> <tr> <td align="center"> <video src="https://github.com/user-attachments/assets/c9489936-afdc-4d12-adda-d4b90d2a869d" width="50%"></video> </td> <td align="center"> <video src="https://github.com/user-attachments/assets/a9fd72d0-01bb-4233-aa27-ca95194bbce9" width="50%"></video> </td> </tr> <tr> <td align="left"> <b>Instruction:</b> <i>I am in Los Angeles from September 1st to September 6th, with a budget of $5,000. Please help me book a Ritz-Carlton hotel closest to the airport on booking.com and compile a transportation guide for me</i> </td> <td align="left"> <b>Instruction:</b> <i>Draw me a chart of Hangzhou's weather for one month</i> </td> </tr> </tbody> </table>

For more use cases, please check out #842.

Core Features

  • ๐Ÿ–ฑ๏ธ One-Click Out-of-the-box CLI - Supports both headful Web UI and headless server execution.
  • ๐ŸŒ Hybrid Browser Agent - Control browsers using GUI Agent, DOM, or a hybrid strategy.
  • ๐Ÿ”„ Event Stream - Protocol-driven Event Stream drives Context Engineering and Agent UI.
  • ๐Ÿงฐ MCP Integration - The kernel is built on MCP and also supports mounting MCP Servers to connect to real-world tools.

Quick Start

<img alt="Agent TARS CLI" src="https://agent-tars.com/agent-tars-cli.png">
# Launch with `npx`.
npx @agent-tars/cli@latest

# Install globally, required Node.js >= 22
npm install @agent-tars/cli@latest -g

# Run with your preferred model provider
agent-tars --provider volcengine --model doubao-1-5-thinking-vision-pro-250428 --apiKey your-api-key
agent-tars --provider anthropic --model claude-3-7-sonnet-latest --apiKey your-api-key

Visit the comprehensive Quick Start guide for detailed setup instructions.

Documentation

๐ŸŒŸ Explore Agent TARS Universe ๐ŸŒŸ

<table> <thead> <tr> <th width="20%" align="center">Category</th> <th width="30%" align="center">Resource Link</th> <th width="50%" align="left">Description</th> </tr> </thead> <tbody> <tr> <td align="center">๐Ÿ  <strong>Central Hub</strong></td> <td align="center"> <a href="https://agent-tars.com"> <img src="https://img.shields.io/badge/Visit-Website-4F46E5?style=for-the-badge&logo=globe&logoColor=white" alt="Website" /> </a> </td> <td align="left">Your gateway to Agent TARS ecosystem</td> </tr> <tr> <td align="center">๐Ÿ“š <strong>Quick Start</strong></td> <td align="center"> <a href="https://agent-tars.com/guide/get-started/quick-start.html"> <img src="https://img.shields.io/badge/Get-Started-06B6D4?style=for-the-badge&logo=rocket&logoColor=white" alt="Quick Start" /> </a> </td> <td align="left">Zero to hero in 5 minutes</td> </tr> <tr> <td align="center">๐Ÿš€ <strong>What's New</strong></td> <td align="center"> <a href="https://agent-tars.com/beta"> <img src="https://img.shields.io/badge/Read-Blog-F59E0B?style=for-the-badge&logo=rss&logoColor=white" alt="Blog" /> </a> </td> <td align="left">Discover cutting-edge features & vision</td> </tr> <tr> <td align="center">๐Ÿ› ๏ธ <strong>Developer Zone</strong></td> <td align="center"> <a href="https://agent-tars.com/guide/get-started/introduction.html"> <img src="https://img.shields.io/badge/View-Docs-10B981?style=for-the-badge&logo=gitbook&logoColor=white" alt="Docs" /> </a> </td> <td align="left">Master every command & features</td> </tr> <tr> <td align="center">๐ŸŽฏ <strong>Showcase</strong></td> <td align="center"> <a href="https://github.com/bytedance/UI-TARS-desktop/issues/842"> <img src="https://img.shields.io/badge/View-Examples-8B5CF6?style=for-the-badge&logo=github&logoColor=white" alt="Examples" /> </a> </td> <td align="left">View use cases built by the official and community</td> </tr> <tr> <td align="center">๐Ÿ”ง <strong>Reference</strong></td> <td align="center"> <a href="https://agent-tars.com/api/"> <img src="https://img.shields.io/badge/API-Reference-EF4444?style=for-the-badge&logo=book&logoColor=white" alt="API" /> </a> </td> <td align="left">Complete technical reference</td> </tr> </tbody> </table> <br/> <br/> <br/>

UI-TARS Desktop

<p align="center"> <img alt="UI-TARS" width="260" src="./apps/ui-tars/resources/icon.png"> </p>

UI-TARS Desktop is a native GUI agent for your local computer, driven by UI-TARS and Seed-1.5-VL/1.6 series models.

<div align="center"> <p> &nbsp&nbsp ๐Ÿ“‘ <a href="https://arxiv.org/abs/2501.12326">Paper</a> &nbsp&nbsp | ๐Ÿค— <a href="https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B">Hugging Face Models</a>&nbsp&nbsp | &nbsp&nbsp๐Ÿซจ <a href="https://discord.gg/pTXwYVjfcs">Discord</a>&nbsp&nbsp | &nbsp&nbsp๐Ÿค– <a href="https://www.modelscope.cn/collections/UI-TARS-bccb56fa1ef640">ModelScope</a>&nbsp&nbsp <br> ๐Ÿ–ฅ๏ธ Desktop Application &nbsp&nbsp | &nbsp&nbsp ๐Ÿ‘“ <a href="https://github.com/web-infra-dev/midscene">Midscene (use in browser)</a> &nbsp&nbsp </p> </div>

Showcase

<!-- // FIXME: Choose only two demo, one local computer and one remote computer showcase. -->
InstructionLocal OperatorRemote Operator
Please help me open the autosave feature of VS Code and delay AutoSave operations for 500 milliseconds in the VS Code setting.<video src="https://github.com/user-attachments/assets/e0914ce9-ad33-494b-bdec-0c25c1b01a27" height="300" /><video src="https://github.com/user-attachments/assets/01e49b69-7070-46c8-b3e3-2aaaaec71800" height="300" />
Could you help me check the latest open issue of the UI-TARS-Desktop project on GitHub?<video src="https://github.com/user-attachments/assets/3d159f54-d24a-4268-96c0-e149607e9199" height="300" /><video src="https://github.com/user-attachments/assets/072fb72d-7394-4bfa-95f5-4736e29f7e58" height="300" />

Features

  • ๐Ÿค– Natural language control powered by Vision-Language Model
  • ๐Ÿ–ฅ๏ธ Screenshot and visual recognition support
  • ๐ŸŽฏ Precise mouse and keyboard control
  • ๐Ÿ’ป Cross-platform support (Windows/MacOS/Browser)
  • ๐Ÿ”„ Real-time feedback and status display
  • ๐Ÿ” Private and secure - fully local processing

Quick Start

See Quick Start

Contributing

See CONTRIBUTING.md.

License

This project is licensed under the Apache License 2.0.

Citation

If you find our paper and code useful in your research, please consider giving a star :star: and citation :pencil:

@article{qin2025ui,
  title={UI-TARS: Pioneering Automated GUI Interaction with Native Agents},
  author={Qin, Yujia and Ye, Yining and Fang, Junjie and Wang, Haoming and Liang, Shihao and Tian, Shizuo and Zhang, Junda and Li, Jiahao and Li, Yunxin and Huang, Shijue and others},
  journal={arXiv preprint arXiv:2501.12326},
  year={2025}
}

Global Ranking

8.5
Trust ScoreMCPHub Index

Based on codebase health & activity.

Manual Config

{ "mcpServers": { "bytedance-ui-tars-desktop": { "command": "npx", "args": ["bytedance-ui-tars-desktop"] } } }