RFC-117: clarification on Forge LLMs compute usage and pricing

FabienPenchenat · January 6, 2026, 4:00pm

Hi & Happy New Year 2026 everyone

I know RFC-117 is now closed, but I still had a question related to the Pricing section.

While looking into Forge LLMs, we noticed a very significant increase in Forge Functions compute usage (GB-seconds) when invoking LLMs. This seems logical given the current execution model, but it raised a question for us.

With the upcoming token-based pricing for Forge LLMs mentioned in RFC-117, should we expect this current Forge Functions compute impact to be absorbed by the dedicated LLM pricing, or will GB-seconds remain a significant part of the cost when using LLMs?

Thanks in advance for any clarification

FabienPenchenat · January 13, 2026, 12:54pm

Sorry to ping you, @AdamMoore, but do you have any idea?

DanielleLarregui · January 13, 2026, 5:05pm

Hi @FabienPenchenat, I’ve raised this question internally to try to help expedite an answer for you.

FabienPenchenat · January 13, 2026, 8:04pm

Thank you very much for raising this internally @DanielleLarregui.
I really appreciate your help.

DanielleLarregui · January 13, 2026, 11:30pm

@FabienPenchenat This is the answer that I received from the Forge LLM team:

Compute usage (GB-seconds) is a different cost element to the LLM usage (token based).
While LLM responses do take longer to respond, the running costs will therefore be twofold:

compute usage (longer running times)
LLM usage itself

FabienPenchenat · January 14, 2026, 8:15am

Thanks a lot for taking the time to raise this topic and for the very quick response.

I have to say I’m quite surprised by the potential impact on pricing, which could be both significant and hard to control, as it will strongly depend on how much the LLM-based features are actually used. In that context, I’m not sure this is the most effective approach to encourage vendors to adopt LLM usage.

In any case, thanks again for the clarifications and the transparency on this topic.

DanielleLarregui · January 14, 2026, 3:05pm

You’re welcome @FabienPenchenat. I’m not sure that this can be changed at this time, but I will surface this feedback to the Product Manager and the LLM team.

AdamMoore · January 17, 2026, 9:24pm

Hey Fabien,

I understand the concern, and agree that it will be important to keep usage in check because there are the two aspects to the cost you need to consider.

It’s the nature of Forge’s severless architecture, that we need to account for the cost of the underlying lambda which waits for the response as well as the tokens used by the model.

When Forge Containers are available they will also support Forge LLMs, but whether that turns out to be more cost effective will depend on a number of factors.

Did you have any other thoughts on how we could make things more efficient cost-wise?

FabienPenchenat · January 21, 2026, 9:58am

Hi @AdamMoore

Thanks for the explanation and for taking the time to clarify.

To be fully transparent, I haven’t had the chance yet to dig into Forge Containers, so it’s hard for me at this stage to assess how interesting or cost-effective they might be in our case.

That said, in the current state, even when reducing memory to the minimum, a single LLM request significantly increases compute usage, with a factor of x15 to x27 compared to what our apps usually consume.

Initially, I expected LLM-related execution to be handled differently from classic function compute, to reflect the fact that on the Forge side there isn’t much actual processing happening during that time. It is mostly waiting for the model response.

It might be worth considering a more LLM-specific architecture rather than the current synchronous model, for example a more asynchronous approach where the LLM request is triggered and processed in the background without keeping a function active for the entire waiting period. This could better reflect the nature of these workloads and help make costs more predictable and manageable for vendors.

FabienPenchenat · June 16, 2026, 2:20pm

Hi @AdamMoore,

I’d like to follow up on this RFC and check whether there are any updates regarding Forge LLM compute usage and pricing.

Since the publication of this RFC, Atlassian has introduced significant AI-related initiatives, including Forge Rovo Tools and MCP (RFC-134) and Forge AI Skills (RFC-137). As partners continue to explore AI-powered experiences on Forge, additional clarity around the long-term pricing model and roadmap for LLM usage would be very helpful.

Could you share any updates or current thinking on this topic?

Thanks!

nathanwaters · June 17, 2026, 1:48am

Forge uses AWS Lambda which is why the entire platform is painfully slow (cold starts) and expensive. They should have built on Cloudflare Workers and solved both issues.

Anyway with a quick search there’s two options they could use now to pause the compute fees while waiting for LLM API responses:

AWS Lambda Durable Functions
- context.wait(), context.waitForCallback(), context.waitForCondition()
AWS Step Functions
- .waitForTaskToken