Tally Is Open

HTTPS://tallyy.org

Apr 18, 2026

I’m getting tired of watching people talk about AI like it’s a person.

Like it remembers.
Like it “knows” them.
Like there’s continuity behind the interface.

There isn’t.

A large language model is stateless.

Every call is a fresh invocation.
No memory.
No continuity.
No accumulated understanding unless you explicitly pass it in.

That “conversation” you think you’re having is just context reconstruction… over and over again.

So Let’s Be Honest About What That Means

If every call is independent…

Then every call is a decision.

And one of the most important decisions is:

Which model should handle this specific piece of work?

Treating model selection as a one-time architectural choice is just wrong.

Different calls have:

different complexity
different structure
different tolerance for failure
different cost sensitivity

So wiring one model into everything…

isn’t a strategy.

But We Haven’t Solved This

Let’s be clear about that.

We don’t think we’ve nailed this.

What we’ve done is introduce a way to think about the problem.

Right now, that looks like question shapes.

Trying to describe a task in a structured way:

what kind of work is this
how complex is it
what kind of output is expected

And using that to make a decision about which model should handle it.

Maybe that’s right.

Maybe it’s incomplete.

Maybe it’s wrong in ways we haven’t seen yet.

That’s exactly why Tally exists.

This Needs To Be Figured Out Together

You can’t solve this in isolation.

Not across real systems.

Not across real workloads.

Because the moment this touches production:

assumptions break
edge cases show up
costs behave differently than expected

The only way to get this right is through shared usage and feedback over time.

What Tally Does

Tally gives you a way to make model selection a per-call decision…

and learn from it.

You:

describe the task
ask for a routing recommendation
run the call the way you normally would
report back what actually happened

Success. Failure. Cost.

Tally learns from that.

That’s the loop.

No proxy.
No rewrite of your system.
No pretending this is magic.

Just a better way to make a decision… and improve it.

What We Want From You

We’re opening this up because we need signal.

Not interest.
Not compliments.

Signal.

run real workloads
describe tasks honestly
send back real outcomes
push it until it breaks

Watch:

where it saves you money
where it makes bad calls
how quickly it adapts

Then tell us.

How To Get Involved

https://tallyy.org

Install the SDK.

Run it on something real.

Then join the Discord and tell us where it’s wrong… where it’s useful… and where it needs to go next.

One Last Thing

Yes… it’s tallyy.org.

Not tally.

We know.

If this works, maybe we earn the second “y” back… or maybe we can finally afford to buy the one we actually wanted.

Tally is a business.

But if we’re honest… it’s also a tool we built because we needed it.

Now we find out if anyone else does.

Less Wrong Me

Discussion about this post

Ready for more?