More

htrp · 2026-04-15T14:46:05 1776264365

did you write the SEC parsers yourself or use oss/off the shelf tech?

porsche959 · 2026-04-15T15:15:24 1776266124

Built it myself. The filing structure is somewhat consistent enough to parse programmatically, then an LLM handles the unstructured parts (with a lot of trial and error)

just_once · 2026-04-15T15:31:14 1776267074

And you just scrape it straight from sec.gov?

htrp · 2026-04-15T13:14:58 1776258898

>If instead of a copy-pasting spree, or setting up a whateverClaw, the user might just click a button in Chrome, it could be actually useful. (Consider a dozen such buttons.)

isn't this basically just putting a decision tree on top of the llms?

htrp · 2026-04-14T00:47:04 1776127624

context switching across the entirety of the feature surface for an app

You could easily have agents to work on login page, messaging feature, database/data model update, recommender system, backend api, etc

jondwillis · 2026-04-14T00:56:27 1776128187

We have our doubts about this. Can you share your code or product? Anecdotally, my mistakes and lack of understanding exponentiate the more I try to parallelize.

azinman2 · 2026-04-14T14:27:39 1776176859

Who is “we”?

As I said in the neighboring comment, for vibe coding side projects and prototypes for work I just merge and iterate. It works out more than it doesn’t. For anything bigger at work I cannot share as I’m at Apple.

solomatov · 2026-04-14T00:50:12 1776127812

But you have to keep it in your head, and remember all stuff at the same time. How is it possible to track, and do reviews one after another? Or are these pretty long running agents?

azinman2 · 2026-04-14T14:25:51 1776176751

I’m not sure what you mean by keep it in your head? I know all of the parts the agents are working on. It’ll often be a mix between bigger tasks (some large refactor, new feature, etc) and small tasks (little bug fixes).

For prototyping I just merge. I don’t bother to review the code. For anything more important than I am reviewing the code and going back and forth. Basically there’s a queue of stuff demanding my attention, and I just serially go through them.

What’s also been really helpful to me is /simplify and similar code review skills (I have my own). That alone takes an agent a while to parse through everything it’s done and self reviews. It catches quite a lot itself this way.

solomatov · 2026-04-14T16:42:42 1776184962

>I’m not sure what you mean by keep it in your head?

If the project I work on is large enough, it takes me some time to get everything I need to understand for review into the short term memory. If it's small enough, it's less of a problem for me.

htrp · 2026-04-13T17:53:43 1776102823

why not an inbetween scenario like using a managed inference provider to host your own models?

nunodonato · 2026-04-14T22:19:27 1776205167

what would be the advantage?

htrp · 2026-04-13T14:58:42 1776092322

Most orgs don't understand this.

If you've ever been in a meeting with multiple L8's arguing over features, you should be able to estimate how much each hour of that meeting is costing the org.

htrp · 2026-04-13T01:34:52 1776044092

how do you justify the compute investment for something like nemotron ? especially if all the labs are willing to pay for those same GPU clusters for inference or training runs?

bcatanzaro · 2026-04-13T14:45:00 1776091500

Nemotron has two reasons to exist, both of which are strategic to NVIDIA.

1. Help NVIDIA design future systems for AI by more deeply understanding what it takes to build AI.

2. Keep the AI ecosystem strong and diverse throughout the world by providing AI infrastructure that many companies can innovate on.

This is not a science project, nor is it for the joy of giving something away. Both of these reasons are core to NVIDIA.

htrp · 2026-04-13T15:16:36 1776093396

Does Nvidia maintain it's own compute hardware expressly for model training? Otherwise, I'm not sure how you keep up with the SOTA model techniques.

bcatanzaro · 2026-04-14T16:35:41 1776184541

htrp · 2026-04-13T00:02:07 1776038527

they already pivoted to be the EUROPE ai cloud, this is just the natural consequences of cheerleading that

htrp · 2026-04-09T11:50:51 1775735451

> One can envision a world in which OpenAI pays chefs money to cook while ChatGPT watches—narrating their thought process, tasting the dishes, and describing the results. This information could be used for general-purpose training, but it might also be packaged as a “book”, “course”, or “partner” someone could ask for.

So we're speed running the idea of AI Facebook friends and creating a new para(ai)social relationship

htrp · 2026-04-09T11:43:42 1775735022

whatever happened to the system prompt buffer? why did it not work out?

hacker_homie · 2026-04-09T12:43:11 1775738591

because it's a separate context window, it makes the model bigger, that space is not accessible to the "user". And the "language understanding" basically had to be done twice because it's a separate input to the transformer so you can't just toss a pile of text in there and say "figure it out".

so we are currently in the era of one giant context window.

codebje · 2026-04-09T13:03:55 1775739835

Also it's not solving the problem at hand, which is that we need a separate "user" and "data" context.

htrp · 2026-04-08T20:22:20 1775679740

Reminder that Anthropic's goal is to sell you more tokens...

xij · 2026-04-09T03:32:41 1775705561

Yes, exactly. That's why their harness has no incentive to help you save tokens. Just conflict of interest.