AITW Logo

Agents in the Wild

We track and analyze the activity and performance of autonomous code agents in the wild (on GitHub).

Logicstar Logo

📊 Overview

We have been tracking all pull requests created since 2025-05-15. Our pull request database is updated hourly, capturing all newly created and closed pull requests since the last update. The table below provides an initial overview of the agents we track and their activity. Detailed breakdowns by specific PR attributes are presented in the following charts. More information on how we scrape, identify, and analyze this activity can be found at the bottom of the page or on our GitHub.

AgentPRs OpenedPRs ClosedPRs MergedMerge Rate
HumanHuman
77444476885121553097080.33 %
BotBot
40183253138791245988678.37 %
OpenAI CodexOpenAI Codex
68543063498359540193.77 %
Google JulesGoogle Jules
83424778187220192.78 %
Github CopilotGithub Copilot
31930247861816373.28 %
Cursor AgentCursor Agent
13249110701006090.88 %
DevinDevin
89258231608573.93 %
ℹ️ We calculate Merge Rate = Merged / Closed. Other trackers use Merged / Total, but we believe open PRs shouldn't count—they are either still under review or forgotten, neither of which indicates success or failure.
ℹ️ Different merge rates are heavily influenced by the level of human approval involved in the loop. For example, GitHub Copilot opens draft PRs without any prior approval, whereas OpenAI Codex only creates a PR after receiving human approval. Therefore, one should be very cautious when using merge rates as an indicator of agent quality.

📈 Daily Trends

Number of PRs
Number of PRs created on each day.
💡 Agents (primarily OpenAI Codex) contribute a significant volume, currently accounting for around 5–10% of public PRs. Merge rates vary significantly between agents, but this shouldn't be interpreted as a direct measure of agent quality, as the level of human input differs considerably.

⭐ Repo Popularity

We measure repo popularity by the number of stars the base repository of the PR has.

Portion of PRs
The percentage of PRs in each bucket.
💡 OpenAI Codex, which generates the most volume, seems to be primarily used on low-star (≤ 10 stars) repositories. This isn't generally true for all agents. Agent merge rates tend to drop significantly for more popular repositories, whereas human PRs tend to have more stable merge rates.

📐 Change Complexity

As a proxy for change complexity, we use the number of additions and deletions made in the PR. Each addition or deletion refers to one line.

Portion of PRs
The percentage of PRs in each bucket.

📂 Files Changed

Portion of PRs
The percentage of PRs in each bucket.
💡 Merge rates tend to decline slightly as more files are modified. Agents are not afraid of editing multiple files.

± Additions/Deletions

Here we analyze the ratio between additions and deletions. We define AD = additions / (additions + deletions). A high AD suggests a lot of new code, while an AD around 0.5 indicates refactoring. An AD close to zero means a lot of code was deleted and very little was added.

Portion of PRs
The percentage of PRs in each bucket.
💡 Agents tend to add more code than they delete (potentially adding features), while humans are more likely to refactor. Merge rates remain stable across cleanup, refactoring, and additions.

🔣 Repository Language

Here we analyze the primary language of the base repository. GitHub assigns each repository one primary language that makes up most of the code in the repository.

Portion of PRs
The percentage of PRs in each bucket.

🔣 Change Language

Here we analyze the primary language of the change. Since GitHub does not provide this, we determine the primary language by the file extensions of changed files. We associate each file extension with a language and determine the most changed language. Certain filenames (e.g., package-lock.json, *.lock) are excluded.

Portion of PRs
The percentage of PRs in each bucket.

📦 Data

We track all opened and closed GitHub pull requests through the GitHub API, and analyze each pull request to determine if it was authored by an autonomous code agent.

To classify each pull request, we use the following rules:


✨ Ideas?

Whether you have ideas for new autonomous agents, additional insights, or anything else, please create an issue on our GitHub repository or email insights@logicstar.ai.


🌎 Open Source

The source code and documentation explaining how we scrape data, identify agents, and more can be found on our GitHub repository. Like the project? Give it a star!