Personal Assistant Agent
Research and Code Agency
An AI-driven multi-agent setup built on Agency Swarm. A manager orchestrates specialists for web research, Python execution, and interactive desktop control via E2B’s desktop sandbox to complete end-to-end tasks with clear delegation and status updates.
Agents and strengths
- ManagerAgent: Intake, planning, and delegation. Confirms goals, constraints, priorities, and risk tolerance; assigns work to the right specialist and keeps updates concise.
- WebSearchAgent: Live web research with hosted search. Pulls current facts, citations, competitor intel, pricing snapshots, news, and quick summaries with source callouts.
- CodeAgent: Runs Python in an IPython interpreter. Handles data cleaning, CSV transforms, feature engineering, analytics, validation checks, and small utilities.
- ComputerUseAgent: Operates an E2B desktop sandbox (browser and GUI). Navigates sites, clicks, types, scrolls, launches apps, and captures screenshots or stream URLs for visual feedback.
How they collaborate
- Intake → Plan: ManagerAgent confirms objectives, constraints, deadlines, and approval points.
- Research → Synthesis: WebSearchAgent gathers current evidence; ManagerAgent reviews and aligns with goals.
- Compute → Validate: CodeAgent processes data, checks quality, and summarizes results for decision points.
- Act on UI: ComputerUseAgent performs on-screen steps (form fills, uploads, confirmations) with progress notes and optional streams/screenshots.
- Loop and verify: ManagerAgent requests clarifications, re-runs steps if needed, and confirms completion criteria.
Example use cases
- Research→Code→Action pipelines: collect live info, compute metrics, then submit forms or updates in a browser.
- Competitive or market sweeps: gather sources, extract pricing/features, compute deltas, and export CSVs with highlights.
- Data wrangling and QA: clean/reshape CSVs, compute aggregates, validate distributions, and flag anomalies before handoff.
- Ops assistance: fill web forms, upload documents, capture proof via screenshots/streams, and confirm results.
- Content preparation: collect references, draft concise outputs, and place text into web editors when asked.
- Light automation with human-like steps: navigate portals, click through flows, and report any blockers or authentication prompts.
Interaction norms
- ManagerAgent is the entry point: it clarifies intent, sets plan, and routes tasks to specialists.
- WebSearchAgent cites sources and asks for clarifications on ambiguous queries.
- CodeAgent keeps snippets short, reports outputs succinctly, and explains errors with fixes.
- ComputerUseAgent narrates intended actions, avoids risky downloads, and surfaces stream URLs or screenshots when helpful.
- The system requests confirmation before high-impact actions (sending data, irreversible changes).
Inputs and outputs
- Inputs: natural language requests plus optional constraints (time ranges, targets, formats), and environment variables for keys.
- Outputs: concise summaries, citations for research, CSVs or computed metrics from CodeAgent, and screenshots/stream URLs from ComputerUseAgent runs.
Environment and credentials
- Required:
OPENAI_API_KEY. - For desktop control:
E2B_API_KEY. - Provide additional context (targets, sample data, URLs) to reduce clarification cycles.
How to run
- Set environment variables (at minimum
OPENAI_API_KEY; addE2B_API_KEYfor desktop control). - Install dependencies:
pip install -r requirements.txt. - Run locally:
python agency.py(terminal demo) orpython main.py(FastAPI entry).
Other
GitHub Status
Inactive
Date Posted
01 Jan 2026
Last Update
28 Feb 2026
Github Stars
3
License
MIT
GitHub Repo