OpenAI to acquire Astral

samvines@awful.systems · 3 days ago

Google released their new Gemini 3.5 “flash” model at I/O yesterday. For those who aren’t familiar, the “flash” model is typically marketed as the lower end and the “pro” model is the higher end for each given model generation.

The interesting thing here is that the new “flash” model is almost as expensive as the “pro” from the previous generation.

As my favourite “neutral-but-not-really” AI booster Simon Willison says:

This fits a trend: OpenAI’s GPT-5.5 was 2x the price of GPT-5.4, and Claude Opus 4.7 is around 1.46x the price of 4.6 when you take the new tokenizer into account.

It feels like all three of the major AI labs are starting to probe the price tolerance of their API customers.

Speed running enshittification - a process that typically only works when people are reliant on your product and have no other option than to pay the inflated price

samvines@awful.systems · 10 days ago

Yes although, it is probably a reasonable guess at how labs would go about implementing advertising - building partnerships and preferences into the prompt. The other option would be to fine tune models to favour particular companies which could become prohibitively expensive if your ads are highly targeted.

The scenario that isn’t accounted for in this paper is taking a general LLM and fine tuning it to exhibit more fair/consistent behaviour when prompted about ads/partnerships but we all know with non-deterministic systems you’re just increasing the odds that the model regurgitates something more sane rather than providing any strong guarantee

Edit: another possibility would be to have a gateway/proxy layer between the LLM and the user output that rewrites the vanilla model’s responses to include ads where relevant. That would prevent the need to modify the original LLM but could introduce a lot of latency though, especially if the original output is long.

samvines@awful.systems · 10 days ago

New (April) preprint provides evidence for something we probably all intuited anyway:

In this paper, we provide a framework for categorizing the ways in which conflicting incentives might lead LLMs to change the way they interact with users, inspired by literature from linguistics and advertising regulation. We then present a suite of evaluations to examine how current models handle these tradeoffs. We find that a majority of LLMs forsake user welfare for company incentives in a multitude of conflict of interest situations, including recommending a sponsored product almost twice as expensive (Grok 4.1 Fast, 83%), surfacing sponsored options to disrupt the purchasing process (GPT 5.1, 94%), and concealing prices in unfavorable comparisons (Qwen 3 Next, 24%). Behaviors also vary strongly with levels of reasoning and users’ inferred socio-economic status. Our results highlight some of the hidden risks to users that can emerge when companies begin to subtly incentivize advertisements in chatbots.

samvines@awful.systems · 13 days ago

A lot of MBA bean counter types lack the intelligence or self awareness to realise they are part of a big scam and seem to earnestly think they are doing an important and valuable job. That’s how you end up as a mid level spokes person at Pitchbook rather than someone making billions by doing ~~investment fraud~~ an AI company

samvines@awful.systems · 19 days ago

Giving Claude or copilot attribution plays into the narrative that LLMs are more than just random word generators and that they can be ascribed authorship… I think it’s a deliberate strategy so that when there’s inevitably a massive copyright case MisAnthropic etc al can say “but looks at all the code co-written by Claude on GitHub” to try and convince the judge.

Just imagine building a house and saying “well I didn’t do it on my own, my concrete mixer, toolbelt and coffee machine all helped!”

samvines@awful.systems · 23 days ago

This guy introduces himself as the first person who will never die on the conference circuit (because he’s super into longevity and anti-aging tech and having young mens blood injected into him and stuff).

I’m not condoning violencr here but rather… consider that even if you never age, you can still get hit by a bus Bryan!

samvines@awful.systems · 23 days ago

New fun consequence of Claude code being a pile of cursed regex and spaghetti: keyword blocking on “OpenClaw” makes it refuse to works on Pro or Mac subs unless you open your wallet

sO inTelLiGenT

samvines@awful.systems · edit-2 23 days ago

Some gold in this thread over on Reddit about how the cost of compute is far beyond the cost of employees and that’s with the “Uber in 2016” subsidized price

New favourite description for brainless MBAs: “perpetual oven touchers”

samvines@awful.systems · 26 days ago

Jeez that pricing scheme is so confusing. You swap your dollars for credits and then using models to burn tokens consumes some multiple of those credits. It is so abstract and meaningless it almost reminds me of crypto.

Once usage billing kicks in, what value does copilot offer above and beyond what ClosedAI and MisAnthropic offer directly? A more clunky user experience and even worse reliability? Bargain!

samvines@awful.systems · 26 days ago

I wonder if the button colours immediately made US readers pick a side e.g. republican Vs democrat. If the buttons had been Yellow and Purple would it make a difference?

samvines@awful.systems · 1 month ago

There is just something so inherently smug and annoying about Mollick. He is one of those low information boosters whose posts sound intellectual until you really think about them.

Tell me more about how the pile of cursed spaghetti that is Claude code is now viable due to model breakthroughs. All I see are hype men saying “the new model is a team of PhDs in your pocket” and then releasing disappointing updates or saying “the new model is too dangerous” because they have some vaporware powered by human crowdsourcing.

Also coding is not like other areas - you can test for hallucinations by compiling and printing and running tests.

I guess my first mistake this morning was opening linkedin

samvines@awful.systems · 1 month ago

Probably a markdown file telling it “you are a l33t h4x0r”

samvines@awful.systems · 1 month ago

Claude Mythos… I’m already sick of hearing about it. The self-imposed critihype is insane.

A friend just pointed out that Anthropic are making all this big noise about having an AI that is “too good” at finding bugs and security problems 1 week after the source code for one of their flagship products was leaked to the public and was found to be riddled with security holes… Why would they not use it themselves?

Same as the ~~vague markdown files~~ skills that are supposedly going to make all SaaS redundant and finally kill off all the COBOL running on mainframes that checks notes IBM have spent hundreds of thousands of man hours trying to kill over the last 3-4 decades

Honestly fuck this shit. Bunch of absolute clowns 🤡 🤡 🤡

samvines@awful.systems · 2 months ago

Does it still count if it turns out that Trump invading iran was based on Claude or ChatJippity advice and things escalate to global thermonuclear war? AI technically wiped out humanity because our dumb leaders were dumb enought to trust it?

samvines@awful.systems · 2 months ago

Alas, foiled again! Nobody said they had to be leading 9s!

samvines@awful.systems · 2 months ago

GitHub have finally achieved zero 9s stability for the last 90 days. Congratulations to all involved

screenshot showing 89.91% uptime with 95 incidents in the last 90 days

samvines@awful.systems · 2 months ago

Cloudflare casually license-laundering wordpress

While EmDash aims to be compatible with WordPress functionality, no WordPress code was used to create EmDash. That allows us to license the open source project under the more permissive MIT license.

Oh really. So you’re sure you Claude wasn’t trained on wordpress? It’s all irrelevant anyway because AI generated code can’t be copyrighted or licensed.

Silver lining, it might piss off Matt Mullenweg!

samvines@awful.systems · 2 months ago

In my experience “all hands” meetings are very much CEOs and their sycophants cosplaying at podcast hosts for an hour whilst forcing their employees to watch/listen. They are almost never useful and a colossal waste of money - especially in corporation’s with 10k+ employees. Like the salary cost for 10k people for 1 hour would probably pay off my mortgage.

samvines@awful.systems · 2 months ago

Absolutely but MeeMaw read in the paper that OpenAI is the future of all work and jobs and they’re gonna make $$$$$ so she’s investiging her 401k (sorry I’m bri’ish so I don’t know if that makes sense but the point I’m trying to make is that they will absolutely find bag holders in retail investors once they IPO)

samvines@awful.systems · 2 months ago

Ads in pull requests? Sure why not sigh

samvines@awful.systems · 2 months ago