Skip to content

OpenAdapt.AI

AI-First Process Automation for the Desktop Era

OpenAdapt.AI is an open-source Generative Process Automation platform that transforms desktop automation through machine learning. Record human demonstrations, train vision-language models, and deploy agents that adapt to any software environment.

🎯 Core Capabilities

  • Learn by Demonstration: Automatically learns from user interactions to generate automation scripts
  • Adaptive Intelligence: Adapts to software changes and complex environments using multimodal models
  • Privacy-First: Built-in data protection with PII/PHI detection and redaction
  • Cross-Platform: Works with browsers and desktop applications across macOS and Windows
  • Multi-Model Support: Integrates with OpenAI, Anthropic, Google, Ollama, and vLLM
  • Open Source: Community-driven development with MIT licensing

🏗️ Ecosystem

Core Platform

Repository Description Status
OpenAdapt Main platform orchestrating GUI automation with LMMs. Meta-package integrating all ecosystem components 🟢 Active

Modular Components

Data Capture & Processing

Repository Description Status
openadapt-capture Production-ready event streams with time-aligned media for GUI interaction recording 🟢 Active
openadapt-viewer HTML viewer components for visualization dashboards and benchmark results 🟢 Active
openadapt-web Web interface components for browser-based interaction 🟢 Active

Machine Learning & Intelligence

Repository Description Status
openadapt-ml ML training toolkit for multimodal GUI-action models 🟢 Active
openadapt-grounding UI element detection and localization with OmniParser integration 🟢 Active
openadapt-retrieval Multimodal demo retrieval system for similarity search in GUI automation 🟢 Active

Evaluation & Quality

Repository Description Status
openadapt-evals Evaluation infrastructure and benchmarks for GUI agent performance testing 🟢 Active

Privacy & Security

Repository Description Status
openadapt-privacy PII/PHI detection and redaction for GUI automation data with Presidio integration 🟢 Active

Execution & Runtime

Repository Description Status
openadapt-agent Production execution engine with safety gates, audit logging, and human-in-the-loop confirmation 🟢 Active

Observability

Repository Description Status
openadapt-telemetry Error tracking and usage analytics with privacy filtering for platform monitoring 🟢 Active

Integration & Deployment Tools

Repository Description Status
OpenAdapter Effortless deployment platform for screenshot parsing and action models on AWS EC2 🟢 Active
OmniMCP Model Context Protocol integration with OmniParser for AI UI interaction (68 stars) 🟢 Active
OmniMCP.web Web interface for OmniMCP server 🟢 Active

🤝 Community Contributions

OpenAdapt has contributed deployment infrastructure to major open-source projects:

Microsoft OmniParser

  • PR #52: Add Dockerfile and client.py; deploy to EC2 on AWS via Github Actions
  • Status: Open (most commented PR in the repository)
  • Contribution: Production deployment infrastructure with Docker, client library, and AWS automated deployment
  • Impact: Enables production deployment of OmniParser for screen parsing tasks

Microsoft Set-of-Mark (SoM)

  • PR #19: Add Dockerfile and client.py; deploy to EC2 on AWS via Github Actions
  • Status: Merged (first PR to the repository)
  • Contribution: deploy.py, Gradio interface, client library, and AWS deployment automation
  • Impact: Provided the foundation for SoM's production deployment capabilities

Research Forks & Foundations

Repository Description Status
OpenCUA Fork of Computer-Use Agents framework that cites OpenAdapt as foundational 🟡 Fork
OmniParser Fork of Microsoft's OmniParser with deployment enhancements 🟡 Fork
SoM Fork of Set-of-Mark visual prompting with deployment infrastructure 🟡 Fork

Development Tools & Libraries

Repository Description
PydanticPrompt Library for documenting Pydantic models to generate structured LLM outputs

Critical Dependencies

Repository Description
atomacos macOS automation library (fork) - critical dependency for macOS support
pynput Input control library (fork) - critical dependency for cross-platform input handling

Research & Experimentation

Repository Description
OmniParser Fork of Microsoft's OmniParser for screen parsing with OpenAdapt-specific modifications
SoM Set-of-Mark visual prompting technique for precise UI element targeting

Configuration & Infrastructure

Repository Description
.github Organization-wide GitHub configuration, templates, and this profile README

🚀 Quick Start

# Install core platform
pip install openadapt

# Or use uv for faster installation
uv pip install openadapt

# Record a demonstration
openadapt record

# Train a model
openadapt train

# Evaluate performance
openadapt eval

📚 Documentation

🤝 Contributing

We welcome contributions across all repositories:

  • Frontend Development: openadapt-web, openadapt-viewer
  • Machine Learning: openadapt-ml, openadapt-grounding, openadapt-retrieval
  • Data Capture: openadapt-capture
  • Privacy & Security: openadapt-privacy
  • Agent Runtime: openadapt-agent
  • Evaluation: openadapt-evals
  • Documentation: All repositories

See individual repository CONTRIBUTING.md files for specific guidelines.

📊 Project Status

OpenAdapt.AI is currently in alpha. While functional, some features are still under development. We're actively working towards production-ready releases across the ecosystem.

💼 Enterprise & Consulting

Professional implementation services and organizational consulting are available. Contact info@openadapt.ai for details.

💖 Sponsorship

Support OpenAdapt.AI development through GitHub Sponsors. Your contributions help maintain and expand this open-source ecosystem.

📄 License

All OpenAdapt.AI repositories are licensed under the MIT License unless otherwise specified. See individual repository LICENSE files for details.

Pinned Loading

  1. OpenAdapt OpenAdapt Public

    Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

    Python 1.5k 217

  2. openadapt-retrieval openadapt-retrieval Public

    Multimodal demo retrieval for GUI automation

    Python

  3. openadapt-evals openadapt-evals Public

    Evaluation infrastructure for GUI agent benchmarks

    Python

  4. openadapt-grounding openadapt-grounding Public

    Temporal smoothing for UI element detection with OmniParser integration

    Python

  5. openadapt-capture openadapt-capture Public

    GUI interaction capture -- production-ready event streams with time-aligned media

    Python

  6. openadapt-privacy openadapt-privacy Public

    PII/PHI detection and redaction for GUI automation data (text, images, dicts)

    Python 2

Repositories

Showing 10 of 46 repositories
  • openadapt-evals Public

    Evaluation infrastructure for GUI agent benchmarks

    OpenAdaptAI/openadapt-evals’s past year of commit activity
    Python 0 MIT 0 2 0 Updated Jan 19, 2026
  • openadapt-ml Public

    OpenAdapt’s open-source ML toolkit for training and evaluating general multimodal GUI-action models.

    OpenAdaptAI/openadapt-ml’s past year of commit activity
    Python 2 MIT 0 1 0 Updated Jan 19, 2026
  • openadapt-capture Public

    GUI interaction capture -- production-ready event streams with time-aligned media

    OpenAdaptAI/openadapt-capture’s past year of commit activity
    Python 0 0 0 0 Updated Jan 19, 2026
  • openadapt-viewer Public

    HTML viewer components for ML dashboards and benchmarks

    OpenAdaptAI/openadapt-viewer’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Jan 19, 2026
  • openadapt-web Public
    OpenAdaptAI/openadapt-web’s past year of commit activity
    JavaScript 7 12 5 1 Updated Jan 18, 2026
  • openadapt-bootstrap Public

    Self-hosting infrastructure for OpenAdapt recursive development

    OpenAdaptAI/openadapt-bootstrap’s past year of commit activity
    Python 0 MIT 0 5 (1 issue needs help) 0 Updated Jan 18, 2026
  • .github Public
    OpenAdaptAI/.github’s past year of commit activity
    0 0 0 0 Updated Jan 18, 2026
  • openadapt-tray Public

    System tray application for OpenAdapt

    OpenAdaptAI/openadapt-tray’s past year of commit activity
    Python 0 0 1 (1 issue needs help) 0 Updated Jan 17, 2026
  • OpenAdapt Public

    Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

    OpenAdaptAI/OpenAdapt’s past year of commit activity
    Python 1,471 MIT 217 0 0 Updated Jan 17, 2026
  • openadapt-retrieval Public

    Multimodal demo retrieval for GUI automation

    OpenAdaptAI/openadapt-retrieval’s past year of commit activity
    Python 0 MIT 0 0 2 Updated Jan 17, 2026

Sponsors

  • @damoahdominic

Top languages

Loading…