Generative AI and Changing Inputs

In previous posts I've written about Natural Language Programming, and Intent/Realization Toolchains as two grounding concepts in AI productivity tools, particularly AI for Code. In this post I want to explore another angle that comes up in features in AI productivity tools, particularly those that generate documentation, synthesize specifications, discover build rules or other "extractive" … Continue reading Generative AI and Changing Inputs

Towards Semi-automatic Agentic Performance Engineering

(This blog post is written in personal tone, but relates to our work at GitHub Next and may be moved to https://githubnext.com in future. A huge thank you to Peli de Halleux, Joe Zhou, Eddie Aftandilian, Russell Horton, Idan Gazit and many others at GitHub Next, and the GitHub platform leadership of Mario Rodriguez. I'm … Continue reading Towards Semi-automatic Agentic Performance Engineering

What Kind of Programming is Natural Language Programming?

In previous posts I've written about Natural Language Programming, Dijkstra's Ghost - the End of The Symbolic Supremacy and Ephemeral Editable Specifications (aka Extract, Edit, Apply). These touched on the topics of Natural Language Programming and the role of Specifications in AI-native programming. Today I'd like to step back and address an underlying question: what … Continue reading What Kind of Programming is Natural Language Programming?

On Continuous AI for Test Improvement

Ever since we started working on "task-oriented programming" (aka vibe coding) in 2023, our group at GitHub Next have been throwing around ideas related to "continuous" tasks in software repositories: Continuous Code Cleanup, or Continuous Documentation and so on. This finally bubbled up as the Continuous AI project, locating it within the tradition of Continuous … Continue reading On Continuous AI for Test Improvement

GitHub Agentic Workflows

I'm excited to share our latest research demonstrator from GitHub Next - "GitHub Agentic Workflows - Natural Language Programming for GitHub Actions" .https://githubnext.com/projects/agentic-workflows/Agentic Workflows focuses on expressing repository‑level behaviors in natural language and running them on GitHub. Agentic Workflows is not a product and not even a technical preview; it's a vehicle for exploring the agentic design space, … Continue reading GitHub Agentic Workflows

Extract, Edit, Apply – a design pattern for AI

Sharing a write-up of one of our investigations at GitHub Next: Extract, Edit, Apply. Spec-oriented programming is usually seen as "Spec-first", with a compilation step to turn specs into code: Specs are permanent, and Code is ephemeral. This has many obvious problems, including: The instability of LLM code-generation under otherwise small or unimportant changes to … Continue reading Extract, Edit, Apply – a design pattern for AI

Augmenting GPT-4 with Calculational Code

GPT-4 and other LLMs (Large Language Models) are driving a tidal wave of innovation in applied AI. However used without augmentation they have very limited calculational capabilities and make mistakes calculating with numbers. In this project, we describe a simple, general technique to address this, apply it to some widely reported real-world failures of GPT-4-based … Continue reading Augmenting GPT-4 with Calculational Code