🎤 Speaking at KubeCon EU 2026 Lessons Learned Orchestrating Multi-Tenant GPUs on OpenShift AI View Session
Luca Berton
Developer Experience

GitHub Copilot Business is great — until Claude Opus 4.5 turns your month into a request-shaped bill

Luca Berton
#copilot#github#billing#quotas#vibe-coding#claude#opus-4-5#gpt-5-1-codex#ai-foundry#developer-experience

I wanted the cleanest possible “vibe coding” setup: Copilot in the IDE, premium model on tap, minimal friction.

I got exactly that… right up until Copilot showed me this:

Premium requests: 418%

No outage. No broken pipeline. No failing tests. Just a meter I wasn’t watching, quietly converting my “premium vibes” into a line item.

This is my write-up, anchored on the numbers in my screenshots, plus the alternatives I’d pick if I wanted the same results for roughly the same budget.


1) What actually happened (numbers, not vibes)

I’m on GitHub Copilot Business, 1 seat, and GitHub shows $19/month for the seat.

But my month wasn’t just the seat.

GitHub Seats

From Copilot’s Premium request analytics (Dec 1–31, 2025), I can see:

So I used about:

Then in the billing overview, GitHub shows:

Translation: this month, I didn’t just buy “Copilot.” I bought Copilot + a lot of premium overage.

GitHub Metered


2) Why this happened: Opus 4.5 is a multiplier machine

The part I underestimated is that Copilot premium usage isn’t “tokens.” It’s premium requests, and models have multipliers.

In GitHub’s model table:

So every time I used Opus 4.5, I was basically spending 3 premium requests per prompt.

That’s also why my “usage grouped by models” chart is basically the Opus 4.5 line doing cardio while everything else barely moves.

GitHub Analytics


3) What “418%” really means for my workflow

GitHub Copilot Business

If I simplify the math:

On Business, once I burn through the included allowance, every extra premium request costs money. Business overage is $0.04 per premium request.

So post-allowance, an Opus 4.5 interaction effectively costs:

It’s not “expensive” per prompt… but it’s perfectly designed to become expensive when I’m in the flow.


4) The part I actually care about: alternatives for the same budget

My net Copilot spend in the billing view is about $55.41 this month.

So I asked myself: What else could I get for ~$55/month if the goal is premium coding output?

Here are the options I’d seriously consider.


Alternative A: Stay on Copilot Business, switch my default to GPT-5.1-Codex-Max (or GPT-5.1-Codex)

Both GPT-5.1-Codex and GPT-5.1-Codex-Max are premium models in Copilot.

If I replay my month’s behavior but change only the multiplier:

Same IDE convenience. Same flow. A fraction of the “418%” panic.

This is probably the single highest ROI change I could make.


Alternative B: Use cheap premium models for most prompts (0.33×), then Opus only when it matters

If I still want “premium” but I don’t want to finance Opus as my daily driver:

At 0.33×, my 300 included requests behave like ~900 prompts of runway.

My ideal split would be:

That keeps my brain in flow and my bill out of the red zone.


Alternative C: Use included (0×) models for routine work, and treat premium models like a power tool

Copilot also offers “included” models on paid plans that don’t consume premium requests (0×).

So I can do:

This is “cost-aware by default” without switching products.


Alternative D: Upgrade to Copilot Enterprise if I’m going to keep using Opus a lot

Copilot Enterprise is $39/seat/month and includes 1000 premium requests.

If I repeat my month (~1254 requests):

That’s actually lower than my ~$55 month on Business, and it buys me headroom.

If I’m not willing to change my prompting habits, Enterprise is the cleanest “don’t change my workflow” answer.


5) The AI Foundry option: token billing instead of request billing

What I really like about AI Foundry / API usage is that it’s token-shaped rather than request-shaped.

For OpenAI’s gpt-5.1-codex and gpt-5.1-codex-max, the published pricing is:

For a rough feel, assume a typical coding interaction is:

That’s about $0.0125 per interaction.

With a ~$55 budget:

The point isn’t the exact number—it’s that Codex on tokens can be shockingly cost-efficient if I keep prompts reasonably sized.

And the flip side is also true: if I routinely paste massive files/logs, token bills climb fast. Token billing is predictable, but it makes my “context habits” visible.


6) What I’m actually going to do next month

If I want premium results without another “418%” moment, my plan is:

  1. Switch my default Copilot premium model to GPT-5.1-Codex-Max (1×)
  2. Use included 0× models for casual prompts
  3. Reserve Opus 4.5 for the rare “I need brilliance” prompts
  4. If I still blow past the allowance, I’ll stop pretending and move to Copilot Enterprise
  5. If I want maximum predictability, I’ll route my “big brain” prompts through AI Foundry / API and keep Copilot for IDE ergonomics

That’s the real lesson for me: Copilot is the best IDE UX, but Opus 4.5 is a premium tool—if I use it like a default, the meter will punish me.

If I stay intentional about when I use 3× models, I get the same quality and keep my budget boring.

← Back to Blog