I wanted the cleanest possible “vibe coding” setup: Copilot in the IDE, premium model on tap, minimal friction.
I got exactly that… right up until Copilot showed me this:
Premium requests: 418%
No outage. No broken pipeline. No failing tests. Just a meter I wasn’t watching, quietly converting my “premium vibes” into a line item.
This is my write-up, anchored on the numbers in my screenshots, plus the alternatives I’d pick if I wanted the same results for roughly the same budget.
I’m on GitHub Copilot Business, 1 seat, and GitHub shows $19/month for the seat.
But my month wasn’t just the seat.

From Copilot’s Premium request analytics (Dec 1–31, 2025), I can see:
So I used about:
Then in the billing overview, GitHub shows:
Translation: this month, I didn’t just buy “Copilot.” I bought Copilot + a lot of premium overage.

The part I underestimated is that Copilot premium usage isn’t “tokens.” It’s premium requests, and models have multipliers.
In GitHub’s model table:
So every time I used Opus 4.5, I was basically spending 3 premium requests per prompt.
That’s also why my “usage grouped by models” chart is basically the Opus 4.5 line doing cardio while everything else barely moves.


If I simplify the math:
I used ~1254 premium requests total.
If those were mostly Opus 4.5 prompts (3×), that’s roughly:
On Business, once I burn through the included allowance, every extra premium request costs money. Business overage is $0.04 per premium request.
So post-allowance, an Opus 4.5 interaction effectively costs:
It’s not “expensive” per prompt… but it’s perfectly designed to become expensive when I’m in the flow.
My net Copilot spend in the billing view is about $55.41 this month.
So I asked myself: What else could I get for ~$55/month if the goal is premium coding output?
Here are the options I’d seriously consider.
Both GPT-5.1-Codex and GPT-5.1-Codex-Max are 1× premium models in Copilot.
If I replay my month’s behavior but change only the multiplier:
Same IDE convenience. Same flow. A fraction of the “418%” panic.
This is probably the single highest ROI change I could make.
If I still want “premium” but I don’t want to finance Opus as my daily driver:
At 0.33×, my 300 included requests behave like ~900 prompts of runway.
My ideal split would be:
That keeps my brain in flow and my bill out of the red zone.
Copilot also offers “included” models on paid plans that don’t consume premium requests (0×).
So I can do:
This is “cost-aware by default” without switching products.
Copilot Enterprise is $39/seat/month and includes 1000 premium requests.
If I repeat my month (~1254 requests):
That’s actually lower than my ~$55 month on Business, and it buys me headroom.
If I’m not willing to change my prompting habits, Enterprise is the cleanest “don’t change my workflow” answer.
What I really like about AI Foundry / API usage is that it’s token-shaped rather than request-shaped.
For OpenAI’s gpt-5.1-codex and gpt-5.1-codex-max, the published pricing is:
For a rough feel, assume a typical coding interaction is:
That’s about $0.0125 per interaction.
With a ~$55 budget:
The point isn’t the exact number—it’s that Codex on tokens can be shockingly cost-efficient if I keep prompts reasonably sized.
And the flip side is also true: if I routinely paste massive files/logs, token bills climb fast. Token billing is predictable, but it makes my “context habits” visible.
If I want premium results without another “418%” moment, my plan is:
That’s the real lesson for me: Copilot is the best IDE UX, but Opus 4.5 is a premium tool—if I use it like a default, the meter will punish me.
If I stay intentional about when I use 3× models, I get the same quality and keep my budget boring.