GitHub Agentic PR Dataset
~1.96M GitHub pull requests from AI coding agents (Claude Code, Cursor, Copilot, Devin) and humans — joined to 6.7M commits and 55M file-level diffs. Open data (CC-BY-4.0) now used for agentic-AI and MSR research.
The research has a build side. These are the data products, tools, and applications I’ve shipped — each tracked with its real state, the way you’d read a pull request.
~1.96M GitHub pull requests from AI coding agents (Claude Code, Cursor, Copilot, Devin) and humans — joined to 6.7M commits and 55M file-level diffs. Open data (CC-BY-4.0) now used for agentic-AI and MSR research.
A stdlib-only Python job, run by a daily GitHub Action, that pulls my latest Medium articles and YouTube videos into versioned JSON and auto-commits — so the site stays current with zero manual upkeep.
Hands-on Android Studio builds that wire AI coding agents and on-device AI into real app workflows — published as walkthroughs for students and developers.
As Head of the Computer Center at UCAS Gaza, directed the migration of a legacy Oracle system to a modern web platform serving 500+ users — covering infrastructure, IT policy, and rollout.
More to come — presentations & talks, and notes, are next. In the meantime, the dataset and publications go deeper.