Saturday, May 16, 2026

If an LLM is too expensive it won't be next year

AI can cost more than human workers now

This is an Axios headline but the tldr is that once an LLM can do your job it will undercut you the next year.

This is the article and it focuses on token and infrastructure costs. Not cleaning up messes or other times a humna needs to be in the loop. For example "Uber's chief technology officer already blew through his full 2026 AI budget due to token costs" There is a 7 year and a 4 year trend of the price drop to 1/10th a year for a set quality of token. As in $1 million become $1 hundred cost in 4 years. This article looks at gpt 3.5 level tokens from late 2022 to late 2024


But the deflation keeps going Qwen3.5 1.7B–2B is well above gpt 3.5 level in tests and in usage. This you can run really fast locally on any newish laptop or smartphone. Tokens at this quality level are now just electricity costs. That's less than 4 years data but gpt 2 level has followed the same trend for 7 years. 7 more years of the trend turns $10 million cost into $1. Lindy Effect says a reasonable first guess is if a trend has gone on for X length of time assume it will keep going for another X unless theres a good reason for it to stop. Coding and Video Image generation are still improving so fast there is not great reason to assume they will suddenly plateau yet. And the same for efforts to make smaller models smarter using big models. Teacher->pupil, chain of thought, distillation, pruning, low rank factorisation all keep getting better in a way that suggests that smaller models will keep getting better even if big ones plateau.

This is not an argument that LLMs in general will keep getting better though they will and that will help. Or that GPUs will keep improving though they will and that will help. Or that open weight models will stay about 9 months behind state of the art apis though they will and that will help a lot. Or that methods to condense smart models into smaller less demanding machines will improve though they will. It is that all these and a few other trends combined will work together to make a million tokens with a certain score on metrics and in a users opinion an order of magnitude cheaper every year.


The cost of tokens will come down. And at such a rate that it makes Moore's Law look tame. It won't go on forever but it is on a seven year trend with good reason to see it keeping going for at least 3 more years. At which point todays best current token will be 1/1000th the cost.




No comments: