MCPHub LabRegistryalibaba/page-agent
alibaba

alibaba/page agent

Built by alibaba β€’ 14,237 stars

What is alibaba/page agent?

JavaScript in-page GUI agent. Control web interfaces with natural language.

How to use alibaba/page agent?

1. Install a compatible MCP client (like Claude Desktop). 2. Open your configuration settings. 3. Add alibaba/page agent using the following command: npx @modelcontextprotocol/alibaba-page-agent 4. Restart the client and verify the new tools are active.
πŸ›‘οΈ Scoped (Restricted)
npx @modelcontextprotocol/alibaba-page-agent --scope restricted
πŸ”“ Unrestricted Access
npx @modelcontextprotocol/alibaba-page-agent

Key Features

Native MCP Protocol Support
Real-time Tool Activation & Execution
Verified High-performance Implementation
Secure Resource & Context Handling

Optimized Use Cases

Extending AI models with custom local capabilities
Automating system workflows via natural language
Connecting external data sources to LLM context windows

alibaba/page agent FAQ

Q

Is alibaba/page agent safe?

Yes, alibaba/page agent follows the standardized Model Context Protocol security patterns and only executes tools with explicit user-granted permissions.

Q

Is alibaba/page agent up to date?

alibaba/page agent is currently active in the registry with 14,237 stars on GitHub, indicating its reliability and community support.

Q

Are there any limits for alibaba/page agent?

Usage limits depend on the specific implementation of the MCP server and your system resources. Refer to the official documentation below for technical details.

Official Documentation

View on GitHub

Page Agent

<picture> <source media="(prefers-color-scheme: dark)" srcset="https://img.alicdn.com/imgextra/i4/O1CN01qKig1P1FnhpFKNdi6_!!6000000000532-2-tps-1280-256.png"> <img alt="Page Agent Banner" src="https://img.alicdn.com/imgextra/i1/O1CN01NCMKXj1Gn4tkFTsxf_!!6000000000666-2-tps-1280-256.png"> </picture>

License: MIT TypeScript Bundle Size Downloads GitHub stars

The GUI Agent Living in Your Webpage. Control web interfaces with natural language.

🌐 English | δΈ­ζ–‡

<a href="https://alibaba.github.io/page-agent/" target="_blank"><b>πŸš€ Demo</b></a> | <a href="https://alibaba.github.io/page-agent/docs/introduction/overview" target="_blank"><b>πŸ“– Docs</b></a> | <a href="https://news.ycombinator.com/item?id=47264138" target="_blank"><b>πŸ“’ HN Discussion</b></a> | <a href="https://x.com/simonluvramen" target="_blank"><b>𝕏 Follow on X</b></a>

<video id="demo-video" src="https://github.com/user-attachments/assets/a1f2eae2-13fb-4aae-98cf-a3fc1620a6c2" controls crossorigin muted></video>


✨ Features

  • 🎯 Easy integration
    • No need for browser extension / python / headless browser.
    • Just in-page javascript. Everything happens in your web page.
  • πŸ“– Text-based DOM manipulation
    • No screenshots. No multi-modal LLMs or special permissions needed.
  • 🧠 Bring your own LLMs
  • πŸ™ Optional chrome extension for multi-page tasks.

πŸ’‘ Use Cases

  • SaaS AI Copilot β€” Ship an AI copilot in your product in lines of code. No backend rewrite.
  • Smart Form Filling β€” Turn 20-click workflows into one sentence. Perfect for ERP, CRM, and admin systems.
  • Accessibility β€” Make any web app accessible through natural language. Voice commands, screen readers, zero barrier.
  • Multi-page Agent β€” Extend your own web agent's reach across browser tabs chrome extension.
  • MCP - Allow your agent clients to control your browser.

πŸš€ Quick Start

One-line integration

Fastest way to try PageAgent with our free Demo LLM:

<script src="{URL}" crossorigin="true"></script>

⚠️ For technical evaluation only. This demo CDN uses our free testing LLM API. By using it, you agree to its terms.

MirrorsURL
Globalhttps://cdn.jsdelivr.net/npm/page-agent@1.7.0/dist/iife/page-agent.demo.js
Chinahttps://registry.npmmirror.com/page-agent/1.7.0/files/dist/iife/page-agent.demo.js

NPM Installation

npm install page-agent
import { PageAgent } from 'page-agent'

const agent = new PageAgent({
    model: 'qwen3.5-plus',
    baseURL: 'https://dashscope.aliyuncs.com/compatible-mode/v1',
    apiKey: 'YOUR_API_KEY',
    language: 'en-US',
})

await agent.execute('Click the login button')

For more programmatic usage, see πŸ“– Documentations.

🀝 Contributing

We welcome contributions from the community! See CONTRIBUTING.md for guidelines and docs/developer-guide.md for local development workflows.

Please read the maintainer's note on principles and current state.

Contributions generated entirely by bots or AI without substantial human involvement will not be accepted.

βš–οΈ License

MIT License

πŸ‘ Acknowledgments

This project builds upon the excellent work of browser-use.

PageAgent is designed for client-side web enhancement, not server-side automation.

DOM processing components and prompt are derived from browser-use:

Browser Use <https://github.com/browser-use/browser-use>
Copyright (c) 2024 Gregor Zunic
Licensed under the MIT License

We gratefully acknowledge the browser-use project and its contributors for their
excellent work on web automation and DOM interaction patterns that helped make
this project possible.

🌟 Awesome Page Agent

Built something cool with PageAgent? Add it here! Open a PR to share your project.

These are community projects β€” not maintained or endorsed by us. Use at your own discretion.

ProjectDescription
Yours?Open a PR πŸ™Œ

⭐ Star this repo if you find PageAgent helpful!

Global Ranking

8.5
Trust ScoreMCPHub Index

Based on codebase health & activity.

Manual Config

{ "mcpServers": { "alibaba-page-agent": { "command": "npx", "args": ["alibaba-page-agent"] } } }