Prompt caching vs the long LLM conversation: where your input bill actually hides

· Dev.to