Thank you!Responses API is now implemented and it&#x27;s coming in RubyLLM 2.0<a href="https:&#x2F;&#x2F;github.com&#x2F;crmne&#x2F;ruby_llm&#x2F;blob&#x2F;main&#x2F;lib&#x2F;ruby_llm&#x2F;protocols&#x2F;responses.rb" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;crmne&#x2F;ruby_llm&#x2F;blob&#x2F;main&#x2F;lib&#x2F;ruby_llm&#x2F;pro...</a>

I found Ruby LLM to be surprisingly good - in terms of usability it&#x27;s close to Vercel&#x27;s AI framework.It tries to strike a balance between working out of the box and being flexible... which has its challenges, still nice overall.One big real-life pain I experienced is that caches don&#x27;t always work, e.g. for xAI, since it only supports completions API and thought signatures are returned wrong.

RubyLLM author here.I&#x27;m not sure where you got that.`chat.with_temperature(0.2)`<a href="https:&#x2F;&#x2F;rubyllm.com&#x2F;chat&#x2F;#controlling-response-behavior" rel="nofollow">https:&#x2F;&#x2F;rubyllm.com&#x2F;chat&#x2F;#controlling-response-behavior</a>`chat.with_thinking(effort: :high, budget: 8000)`<a href="https:&#x2F;&#x2F;rubyllm.com&#x2F;thinking&#x2F;#controlling-extended-thinking" rel="nofollow">https:&#x2F;&#x2F;rubyllm.com&#x2F;thinking&#x2F;#controlling-extended-thinking</a>Max tokens is the only one of your list that require provider specific params:<a href="https:&#x2F;&#x2F;rubyllm.com&#x2F;chat&#x2F;#provider-specific-parameters" rel="nofollow">https:&#x2F;&#x2F;rubyllm.com&#x2F;chat&#x2F;#provider-specific-parameters</a>I&#x27;m one guy doing it for free. Happy to see your contribution!

It is quite nice, but not as nice as you&#x27;d want. You still have to set platform specifics when running completions when you want to tune things like temperature, effort, max tokens, etc.