Posts tagged: llm

Post by @thdxr.com — Bluesky bsky.app

dax (@thdxr.com) so obviously n=1 but anecdotes matter a lot here i gave kimi k3 and sol the same task - simple issue with hovers in the tui being the wrong color sol found and fixed the issue with $0.30 of spend… Bluesky Social · bsky.app [1] So much nuance in model selection its hard to keep up. References: [1]: https://bsky.app/profile/thdxr.com/post/3mqsgohylgd2v

Dumb people lay off - YouTube www.youtube.com

- oof Mark, this does not feel like it is set to age well. These people in power feel so disconnected from regular people with a job trying to do work.

Programming Still Sucks. — Writing www.stvn.sh

Programming Still Sucks. — Writing Sorry Peter. — I'm at a birthday party, and while most people here also work in tech, there's always a Guy with a Real Job. You know, a physical job, building some or other thing people need. And... stvn.sh [1] Absolute banger of a post, this is the time we are living in. Explain “are you afraid AI is going to take your job” to a non tech blue collar worker. Broken over promises, greed, and projects mismanaged by leadership who has no idea what the day to day work actually does and how critical it is. I’m not quite in Sara’s position, but I feel something shielded by half of this working deep inside of a non tech part of a non tech company leading a very small rag tag team with get shit done attitude. But I feel it, I see colleagues hit by these blasts.b I get clipped with shrapnel from some of the largest blasts. But nothing as significant as I see many others hit with References: [1]: https://www.stvn.sh/writing/programming-still-sucks-fqffhyp

Ping 54

I'm regressing back to boomer ai for more plan mode style prompting at home... It does a decent job at ingesting a repo and coming up with plans before I start spending precious tokens.

Agent, Prove Yourself

🌱 This post is still growing b249c794-9411-42c0-be01-07922c3e98da.mp4 [1] a scroll through of https://github.com/WaylonWalker/markata-go/pull/1021 References: [1]: https://dropper.wayl.one/file/b249c794-9411-42c0-be01-07922c3e98da.mp4

"Am I Crazy?" [Wading Through AI - Episode 3] www.youtube.com

- Casey had an interesting point here. I think demitri came back with some sense of sanity that its just not how corporations look at employee cost, but I still thought it was a head scratcher. Roughly translated not quoted If the sellers of ai are telling you that your developers are going to be 10x productive, why are they only spending half their salary in tokens? Why not 9x?

A love letter to Pi | Lucas Meijer www.youtube.com

- I hate how he called out terminal user interfaces as shit… then proved web interfaces to be superior. Damn him. I love working from my terminal, but having ai prove itself through html [1] reports including video, image, metrics, charts, and text is goated. Rethinking yourself has the bottleneck not the orchestrator feels real. Validating the work is hard, theres a shift right now and everyone is trying to figure it out. Lucas’s technique is a little bit of be lazy and tell it to prove itself to you, so as you juggle your 15 agents you have a nice report to read. References: [1]: /html/

How Claude Code’s Creator Starts EVERY Project - YouTube www.youtube.com

- This is a really good guide, with quite a few good nuggets. I need to try deleting my AGENTS.md and rebuilding it from scratch more often. I liked how he talked about having agents prove their work and tell them up front how they will be judged. What I didn’t care for so much was the feeling that a lot of the rules go in markdown, thats not a rule, thats a suggestion. Rules should be deterministic. They should be tests and linters that ensure they are followed. Suggestions are good, but dont trust the agents to always follow them. And don’t trust that they wont change your rules, keep them honest.

Write It First, Then Let AI Drive - Kenneth Reitz kennethreitz.org

Write It First, Then Let AI Drive There's a thing that happens when you start using AI coding tools seriously. You assume the best workflow is obvious: let AI generate the first draft, then... Kenneth Reitz · kennethreitz.org [1] Interesting take by Kenneth Reitz. Not quite sure how I feel about it anymore. It kinda hurts, but I’m not sure if code aesthetics matter as much as the product anymore. I cared when I was the one editing, but at this point I’m not doing a lot of edits by hand. Do these aesthetics affect the final products that users use, Not sure. AI makes me sad. References: [1]: https://kennethreitz.org/essays/2026-04-12-write_it_first_then_let_ai_drive

I am slowly coming around to AI assisted programming. x.com

ThePrimeagen (@ThePrimeagen) on X I am slowly coming around to AI assisted programming. I am genuinely trying to codify every rule about programming that I have and using that + several stages to build out small changes. Not s… X (formerly Twitter) · x.com [1] If agents make prime a bit faster, what does that mean for the rest of us mortals? References: [1]: https://x.com/ThePrimeagen/status/2043861800819761382

AIs aren’t good rule followers x.com

Uncle Bob Martin (@unclebobmartin) on X @ThePrimeagen AIs aren’t good rule followers. The older the rule in the context window, the less priority it is given. So the best way to enforce the rules is with external tools that communicate... X (formerly Twitter) · x.com [1] I’ve gotta agree with bob on this one, the first thing I did to my biggest brownfield project I wanted to use agents on BEFORE they did work was a hardened pre-commit.yaml, ci, hardened type checking and linting. SECOND get rid of bad inconsistent patterns, let them replicate consistency, force them to pass checks. Agents will follow all of your markdown suggestions most of the time, enough for you to become complacent if you let it. They are goal seeking, if you put them to a task you thought was possible that is not given your constraints, they will try to find a way given enough tokens. I dont see this ever changing, its one thing that makes them great, it just needs to be kept in check. References: [1]: https://x.com/unclebobmartin/status/2044065822067282396

A quote from Steve Yegge simonwillison.net

Steve Yegge Steve Yegge: I was chatting with my buddy at Google, who's been a tech director there for about 20 years, about their AI adoption. Craziest convo I've had all year. … Simon Willison’s Weblog · simonwillison.net [1] behind, yet positioned to completely dominate this race by hitting it with some sense. Making trends in what looks like longevity in the race that is not subsidising to simply get users, but to get by until they figure out how to 100x reduce the cost to a reasonable level. They feel like the guy sitting in the back with nothing big or flashy to say that is going to drop the hammer on their competition that overstretched itself taking on too much debt because it was necessary to change the game. There might be something to having a mix of hipsters, boomers, and luddites all trying to balance each other out. References: [1]: https://simonwillison.net/2026/Apr/13/steve-yegge/#atom-everything

An ai model created by Anthropic was announced as a closed preview on April 7, 2026 for critical security research and evaluation with its close partners with critical software such as operating systems and browsers. Anthropic claims that mythos is able to reason through so much more context that any model ever before. This enables it to find bugs that are 25 years old in the BSD, considered one of the most secure operating systems we have. Once it finds these zero day bugs never discovered before its able to use them together in malicious ways never expected. In ways the world is not ready for. At the time of writing these are claims without proof. It remains scary to know the potential this has and that there is only a few companies with this potential that will gatekeep who gets access.

How does Claude Code *actually* work? - YouTube www.youtube.com

- 5 star video, if you are going to watch one video to understand how harnesses and agents work, this is it. This really had my gears spinning on what tools do for agents and how big of a difference they make in their ability to manage context efficiently and accurately create changes. It’s crazy how good bash works, and that gives the agents the ability to do just about everything, but it could be better.

Agents Are Here

🌱 This post is still growing Late last year I started writing I'm Out On Agents [1]. Agents sucked, the models were good, but there was still something missing between the harnesses and the models. They could write good code, they could do some debugging and exploring, but they were too good at fucking up the whole project to be useful. They could crank out Green Field POC’s like nobody’s business, but they created so much mess in brown field projects that it was easier to chat and edit yourself. f91a8893-b1ba-422a-9390-18de5034483c.mp4 [2] The Beautiful Glitch - Gemini The Inflection Point # [3] It’s very well agreed on that the inflection point for most people happened with Anthropic Opus 4.5 in late Nov 2025. Early adopters probably noticed right away and shouted from the rooftops how good it was. But we’ve all heard that developers have 6 months before ai writes all the code for years, so this felt like the rest of the noise. Hitting the December slowdown many of us hit cod...

An AI state of the union: We’ve passed the inflection point & ... www.youtube.com

- A really interesting long form interview with @simonwillison [1]. If you follow him closely most of it is probably not new, but I found some interesting nuggets. Simon is writing most of his code from his phone these days using anthropic hosted platform. He mentioned that a lot of security risks go away when you don’t put secrets on the platform and you let them take the risk of running ai written code with ai chosen supply chain. He talked about the Pelican Riding a Bike benchmark for quite awhile. He was surprised at how well of a proxy it is for how capable a model is at just about everything. He also said that when he runs the benchmark he also runs half a dozen others that he’s never talked about so that He could see if they were to train a model specific to his benchmark he could catch them, but it seems they had caught on and if they were they seem that they would already be doing it on all of his others anyways. TDD is incredibly boring for humans, it strips so much creativity and joy from the process. Who cares if agents are bored they do better when doing TDD. References: [1]: https://simonwillison.net

@seldo.com on Bluesky bsky.app

Laurie Voss (@seldo.com) Project Glasswing is a glimpse at an oncoming future in which agents do things humans could never have accomplished and the results are handled by other agents faster than humans could react and we... Bluesky Social · bsky.app [1] Is Glasswing the next inflection point [2] References: [1]: https://bsky.app/profile/seldo.com/post/3miybjol76p2r [2]: https://dropper.waylonwalker.com/file/00bc13be-32bd-4410-b0c4-2ecc0f2f6b95.webp

What Happens When AI Stops Being Artificially Cheap | Daniel M... danielmiessler.com

What Happens When AI Stops Being Artificially Cheap The subsidy era is ending. Here danielmiessler.com [1] I’ve been thinking about this for awhile and Daniel makes some great arguments here. Interestingly keeping inference cheap removes the incentives to make our tools better, help us choose the right model, lean on local models, open weight models. The frontier models are so affordable through subsidized subscription models why would you deal with anything less intelligent at this point. The tooling we use is not optimized for it, and why should it be. References: [1]: https://danielmiessler.com/blog/ai-stops-being-artificially-cheap

no one read the source x.com

ThePrimeagen (@ThePrimeagen) on X don't forget last time Anthropic, in their infinite PhD level wisdom, leaked their own source code (Feb 25) they DMCA'd all repos that had their code. Careful storing the code because Anthropic w… X (formerly Twitter) · x.com [1] Everyone look away, nothing to see here. [2] References: [1]: https://x.com/ThePrimeagen/status/2038978962089492631 [2]: https://dropper.waylonwalker.com/file/090f03b2-e6f5-4ede-a814-bfbb4e237b54.webp

`j`	Scroll down
`k`	Scroll up
`g` `g`	Scroll to top
`Shift` `G`	Scroll to bottom
`d`	Half-page down
`u`	Half-page up

`j` / `↓`	Next post (in feeds)
`k` / `↑`	Previous post (in feeds)
`Enter` / `o`	Open highlighted post
`Shift` `O`	Open in new tab
`g` `h`	Go to home
`g` `s`	Focus search
`[`	Previous page
`]`	Next page
`b`	Toggle left sidebar
`Shift` `B`	Toggle right sidebar
`s`	Toggle simple/rich feed view

`/`	Focus search input
`⌘CtrlK`	Focus search (alternative)
`y` `y`	Copy URL to clipboard
`?`	Show this help
`Esc`	Close / clear highlight