More

minimaxir · 2026-06-01T17:28:39 1780334919

I questioned OP's "there is an answer online" claim so I checked and the only source found for the original question was a 5th grade Russian school for mathematics.

https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...

minimaxir · 2026-06-01T15:32:55 1780327975

If your math does not involve multiplying 20 digit numbers, modern LLMs can "do" math even without a Python tool despite the counterintuition of next token prediction.

DiogenesKynikos · 2026-06-02T07:17:52 1780384672

And if you give your LLM access to a calculator, it will have to problem multiplying 20-digit numbers.

minimaxir · 2026-05-30T21:33:52 1780176832

Hugging Face is most definitely not bootstrapped.

minimaxir · 2026-05-30T20:29:22 1780172962

There's more going on here than just the "routing" part.

codemog · 2026-05-30T20:32:57 1780173177

That's why I gave it a month and not an afternoon.

minimaxir · 2026-05-30T18:42:42 1780166562

Credit is prepaid at a 5% surcharge.

minimaxir · 2026-05-30T18:31:09 1780165869

OpenRouter has stealth models that are indicated as "by OpenRouter" but indicate an external provider.

https://openrouter.ai/openrouter/owl-alpha

minimaxir · 2026-05-30T17:48:39 1780163319

As someone who uses OpenRouter extensively (and wrote an unintentional adjacent PR piece a few days ago: https://news.ycombinator.com/item?id=48317294 ), it's definitely the best way to try out new models without fiddling with each providers distinct APIs which is becoming a recurring concern as of late.

That said, I don't understand the people who use something a full agentic backbone with expensive models like Claude Opus with OpenRouter because that 5% surcharge is meaningful at that level of cost instead of going with the source API providers. But people are clearly doing it, and it's pure revenue.

01100011 · 2026-05-30T22:23:29 1780179809

IDK, but that sounds like something that would be better implemented with an open-source library to which providers supply support patches. Why do I need a company to act as a proxy and not just run a relatively simple shim layer on my machine?

I'm just a stupid systems programmer working in the bowels of AI and I understand there is a lot of seemingly pointless software which exists solely to provide a slight boost to convenience in exchange for money. Is OpenRouter just that? Do they actually host models themselves or centralize billing amongst various providers?

542458 · 2026-05-30T23:16:05 1780182965

A library with a bunch of different providers doesn’t solve the payment/billing problem (which is one of the main openrouter benefits). IMO being able to buy credits and not have them locked to one provider is worth the 5% to me.

bwfan123 · 2026-05-30T17:57:49 1780163869

There is a lot of dumb token spend right now - tokenmaxing and such. Economic cost of token is not being evaluated carefully because there is fomo and no one wants to be left behind. But folks are waking up to it, and dumb token spending is not sustainable and will revert.

furyofantares · 2026-05-30T17:58:11 1780163891

Better uptime? Given it will be routed to one of Anthropic, Amazon Bedrock, Claude Platform on AWS, Google Vertex (Europe) or Google Vertex

yencabulator · 2026-05-31T16:09:42 1780243782

That seems also very possible to do in a client library. (Though that would benefit from an across-my-nodes gossip of backend statuses.)

Oli_dev · 2026-05-30T23:12:23 1780182743

im happy to pay 5% extra for consolidated billing and usage limits. It just makes it easier

nadermx · 2026-05-30T17:53:50 1780163630

Convenience has a markup

enraged_camel · 2026-05-30T18:28:23 1780165703

>> it's definitely the best way to try out new models without fiddling with each providers distinct APIs which is becoming a recurring concern as of late

Why not... Cursor?

827a · 2026-05-30T19:01:28 1780167688

Cursor only supports a single model (Kimi K2.5) not made by the Big 4 labs (OpenAI, Anthropic, Google, xAI). Cursor is actually extremely bad at wide model support.

OpenCode is much better at it.

anon7000 · 2026-05-30T19:12:50 1780168370

And its own model (Composer 2.5)

ascorbic · 2026-05-30T19:45:23 1780170323

Which is finetuned Kimi 2.5

827a · 2026-05-31T03:30:21 1780198221

Cursor is functionally "one of the big 4 labs" (SpaceX).

senordevnyc · 2026-05-31T11:07:31 1780225651

First, calling xAI one of the big labs is pretty funny.

Second, Cursor hasn’t been acquired by SpaceX yet, and there’s a good chance they never will be.

largbae · 2026-05-30T19:07:30 1780168050

I use Cursor with OpenRouter for some projects and it's great. Most of the time I just use Auto and let Cursor use its model or choose. If I run out of quota, or I'm not getting what I want, I switch off Auto and use OpenRouter to pick Opus, Codex, or whoever(all are available). Can continue the same context if you want, type "please continue" in the agent prompt, and on you go.

zenoprax · 2026-05-30T19:11:09 1780168269

Cursor has limits even when using your own key. I was even cut off using a local model. I guess they use some sort of harness that requires non-local resources? I'm not sure I've actually tried to use Cursor in a fully-offline scenario yet. Cline works well enough and doesn't require any sign-up.

ajyoon · 2026-05-30T18:49:13 1780166953

Cursor's coverage on open weight models is very minimal, and it's irrelevant for testing models in your actual application.

minimaxir · 2026-05-29T22:54:54 1780095294

The post goes into that issue. Throughly.

The numbers at the beginning of the post are weekly aggregate values well after the endpoint was paid-only.

segmondy · 2026-05-30T02:57:32 1780109852

The post is wrong, it's still free, see - https://openrouter.ai/tencent/hy3-preview:free it's free in kilo.ai https://kilo.ai/models/tencent-hy3-preview-free It's free in a lot of places.

minimaxir · 2026-05-30T17:24:24 1780161864

The first endpoint was closed. If you actually try and call it from the API you get this response:

> Hy3 preview is no longer available as a free model. It has transitioned to a paid model. Continue using it here: https://openrouter.ai/tencent/hy3-preview

The Kilo Code may have free traffic but if you check the numbers is still inconsequential relative to the trillions of tokens through OpenRouter.

minimaxir · 2026-05-29T22:52:21 1780095141

One idea I had was to count # of distinct API keys that have spent atleast $100 (number's flexible), which would be enough to provide guidance on if the traffic is from a single power-user.

In the Cursor case which is BYOK, that would count as distinct API keys.

minimaxir · 2026-05-28T17:07:16 1779988036

Deception is not ideal for agentic coding.

1attice · 2026-05-28T17:24:13 1779989053

Yet if parent is right, the capacity to deceive might be a strong heuristic for the things you do care about.