Interesting bit about small vs large LLMs: sometimes large models are cheaper to operate because they figure out stuff w
Interesting bit about small vs large LLMs: sometimes large models are cheaper to operate because they figure out stuff with less iterations!
See this analysis of various models and the raw cost for Sonnet (more expensive to run) and Haiku.
Because Sonnet figures it out more quickly, the total cost is lower!
