Back

Claude wrote a functional NES emulator using my engine's API

91 points1 monthcarimbo.games
worble1 month ago

I'd be curious in how well it passes 100th Coin's NES accuracy tests https://github.com/100thCoin/AccuracyCoin

utopiah1 month ago

Indeed, that's what I kind of hinted at in https://news.ycombinator.com/item?id=46442195 and coincidentally https://news.ycombinator.com/item?id=46437688 briefly after, namely that OK, one can "generate" a "solution", that's much easier than before... but until we can verify somehow that it actually does what it say it does (and we know of hallucinations and have no reason to believe this changed) then testing itself, especially of well know "problems" is more and more important.

That being said, it doesn't answer the "why" in the first place, an even more important question. At least though it does help somehow to compare with existing alternatives.

garciasn1 month ago

Isn’t this how all software development works? Folks commit code, it’s tested, and reviewed, and then deployed.

Why would this be any different?

PaulDavisThe1st1 month ago

That's not how software development works.

Folks think, they write code, they do their own localized evaluation and testing, then they commit and then the rest of the (down|up)stream process begins.

LLM's skip over the "actually verify that the code I just wrote does what I intended it to" step. Granted, most humans don't do this step as thoroughly and carefully as would be desirable (sometimes through laziness, sometimes because of a belief in (down|up)stream testing processes). But LLM's don't do it at all.

+1
sally_glance1 month ago
+1
mapontosevenths1 month ago
jimmaswell1 month ago

> actually verify that the code I just wrote does what I intended it to

That's what the author did when they ran it.

+1
adventured1 month ago
roger_1 month ago

I’m sure you can point Claude at that page and have it make the necessary changes to pass.

deadbabe1 month ago

Or it could loop infinitely, never quite being able to pass all the tests.

hu31 month ago

which is easily fixable by some human guidance

+1
RAMJAC1 month ago
Y_Y1 month ago

Git wrote a functional NES emulator for me by simply cloning one of the many publicly available ones!

LunicLynx1 month ago

This is the comment.

Give it copy paste / translate tasks and it’s a no brainer (quite literally)

But same can be said of humans.

The question here is, did it implement it because it read the available online documentation about the NES architecture OR did it just see one too many of such implementations.

jacquesm1 month ago

> But same can be said of humans.

Indeed, the 'cleanroom' standard always was one team does the RE and writes a spec, another team that has never seen the original (and has written statements with penalty clauses to prove it) then does the re-implementation. If you were to read the implementation, write the spec and then write the re-implementation that would be definitely violating the standard for claiming an original work.

draw_down1 month ago

[dead]

cebert1 month ago

It’s a shame that the source code isn’t commented and documented more. At the very least, I would see it being helpful to add some documentation for every CPU op code being emulated.

1122331 month ago

Forbidding LLM to write comments and docstrings (preferrably enforced by build and commit hook) is one of the best "hacks" for using that thing. LLM cannot help itself but emit poisonous comments.

jacquesm1 month ago

Or maybe clone the comments from where it cloned the source.

exe341 month ago

I used to worry that using LLMs to code would let them use my code and train on my hard work. Then I realised how bad my code is, so I'm probably singlehandedly holding off an agi catastrophe.

1122331 month ago

Meh. No human has written the horrors llm produces. At least I am yet to see codebase like that. Let me attempt a theatrical reenactment:

    // Use buffer that is large enough to hold any possible value. Avoid using JSON configuration, this optimizes codebase and prevents possible security exploits! 
    size_t len = 32;


    // this function does not call "sort" utility using shell anymore, but instead uses optimized library function "sort" for extreme perfomance improvement!!!
    void get_permutations() {
... and so on. It basically uses comments as a wall to scribble grandiose graffiti about it's valiant conquests in following explicit instruction after fifth repeat and not commiting egregious violence agains common sense.
+1
theshrike791 month ago
butlike1 month ago

And since it's vibe coded, no one knows what the opcodes are. LLM won't remember. Human has no comments. Human can't trust post-hoc LLM-generated comments because they're poisonous.

1122331 month ago

If function of vibecode is not self-evident, dispose of it.

Or, to put it differently, having vibe comment does not free you of responsibility to inspect actual vibe code.

If code contradicts comments, LLM is as likely to go by comments. It is bad enough to have heaps of dead, unused code. Comments make everything much worse.

StilesCrisis1 month ago

Probably better to look at a human-authored emulator if you want comments containing accurate information anyway.

bugfix1 month ago

If you let it, Claude Code will write a comment for almost every single line of code.

mikepurvis1 month ago

    # Assign value of x to y
    y = x
ziml771 month ago

Even if you try to get them to not, they will still overcomment the code. Or at least overcomment it from the perspective of a human. From the perspective of the LLM, I suspect the comments are necessary for it to be able to get the code output correct.

theshrike791 month ago

It's also a discoverability tool. If the code has good docstrings and decent naming for functions/variables it's a lot easier for the LLM to find the correct places to edit.

delduca1 month ago
johnisgood1 month ago

Why not use the LLM for more meaningful commit titles & messages as well while you are at it?

giancarlostoro1 month ago

Surprised there's no README file at all.

rmckayfleming1 month ago

Oh neat, I've been working with claude on an NES emulator in Racket using an SDL3 wrapper also written mostly by Claude.

tabs_or_spaces1 month ago

I tried this a while back using gemini 2.5 pro, round about the time gemini cli was released. I never got the emulator to work in the end, so I dropped the idea.

So this is impressive for me in terms of how fast things have progressed.

zorked1 month ago

Nice, but NES emulator is one of the most written pet projects anywhere, which makes it considerably less impressive.

StilesCrisis1 month ago

Heck, when Satya Nadella wanted to demonstrate Copilot coding, he had it emit an Altair emulator. I guess there's little room for creativity in 8-bit emulator design so LLMs can handle them well. https://thenewstack.io/from-basic-to-vibes-microsofts-50-yea...

ldng1 month ago

And said emulator was opensourced and tested by third parties, right ?

Until it's so, it's just hearsay to me by someone having a multi-billion horse in the race.

pragma_x1 month ago

This is a good point. I wonder how much NES emulator code is in Claude's training set? Not to knock what the author has done here, but I wonder if this is more of a softball challenge than it looks.

noident1 month ago

Somewhere along the line the AI bros stopped separating training and testing sets. It's great for impressing the villagers

swannodette1 month ago

WASM and the performance seems catastrophically bad (45ms to render a frame on an M4 laptop)? It would be much more impressive if Claude could optimize it into something that someone would actually want to play? Compare this to a random hit from Google, https://jsnes.org/ which has sound, much smaller payload, and runs really fast (<1ms to render a frame).

The cost of slop is >40X drop in performance? Pick any metric that you care about for your domain perhaps that's what you're going to lose and is the effort to recover that practical with current vibe-coding strategies?

masswerk1 month ago

For me on Firefox/macOS it's terribly slow, fails to initialise/resume sound, no keyboard input.

deadbabe1 month ago

I will be impressed when new game consoles come to market and it can write the first emulator for it.

luckydata1 month ago

a very slow one.

bfrog1 month ago

How much was grifted from existing emulators?

endemic1 month ago

By definition, all of it.

cgfjtynzdrfht1 month ago

Trained on 1000s of NES emulators, it's not really impressive.

Github alone has +4k NES emulator projects: https://github.com/search?q=nes%20emulator&type=repositories

This is more like "wow, it can quote training data".

keyle1 month ago

Who care what it did. What did you learn? To live is to learn.

mikkupikku1 month ago

When I consider the utility of a hammer, my first priority is to ask what the hammer can teach me.

pygy_1 month ago

There are NES emulators aplenty, the only value in writing a new one is pedagogic, for the writer.

This endeavor had negative net value.

jimmaswell1 month ago

It demonstrated the capabilities of an AI to a potentially on-the-fence audience while giving the author experience using the new tools/environment. That's solid value. I also just find it really cool to see that an AI did this.

butlike1 month ago

Yeah, it shows the AI is not capable of writing maintainable projects. I'm off the fence. And its cool you find it cool, but reducing the problem space to that of a toy project makes it so much less impressive as to be trivially ignorable.

The new LLM (pattern recognizer/matcher) is not a good tool

mikkupikku1 month ago

How about being entertained by the process?

worthless-trash1 month ago

They didnt call it the "Nintendo Entertainment System" for nothing.

NoraCodes1 month ago

Do you think that the use of a hammer is an innate skill, and that woodworkers learn nothing from their craft?

mikkupikku1 month ago

Okay, so let's say the use of a coding agent isn't an innate skill, so the author was gaining experience with the tool.

philipallstar1 month ago

Ask not what your hammer can do for you.

jancsika1 month ago

If it's a zillion dollar hammerbot the company is offering to your boss for pennies, that had better be your first priority!

risyachka1 month ago

Do you like to read posts about what hammer can do? Especially when it has been done 100 times already.

mikkupikku1 month ago

I'm no carpenter, but I can honestly say I've probably read a hundred articles about vim..

butlike1 month ago

You ask what you learned building the house. The hammer hits the nails.

aoeusnth11 month ago

Is there zero skill in managing agents?

password543211 month ago

Yeah I think this is the wrong approach. If they were making money out of it, that would be different. But this is pointless.

RcouF1uZ4gsC1 month ago

Is this why you only wrote in machine code until you fully understood the entire compiler front end, back end chain?

postalrat1 month ago

I learned claude can write a functional NES emulator. I wonder what else it can do?

jgbuddy1 month ago

to live is to build

shriek1 month ago

to build what you don't understand is to suffer in future

krapp1 month ago

Except OP isn't learning or building. He's telling a computer to do the work for him and padding his resume.

danielbln1 month ago

How cynical. Just seeing if the current crop of automation systems can do it can be interesting enough for some of us.

butlike1 month ago

It's a waste of time and energy, and when you're older you'll realize energy is the premium here.

+1
skydhash1 month ago