Actually if you use LLMs sized responsibility to the task it's cheaper than a lot of APIs for the final product.
The expensive LLMs are expensive, but the cheap ones are cheaper than other infrastructure in something like quick answer or quick assistant