I'm surprised that so much of the discussion around Copilot has centered around ...

qbasic_forever · on July 12, 2021

IMHO they built the opposite of what's actually useful for real-world use. Copilot should have been trained to describe what a selected block of code does, not write a block of code from a description. It could be extremely useful when looking at new or under-documented codebases to have an AI that gives you a rough hint as to what some code might be doing. For example if you select some heinous spaghetti code function, press a button, and get a prompt back that says "This code looks like it's parsing HTML using regex (74.2% confidence)" it could be much easier for folks to be productive on big codebases.

heavyset_go · on July 12, 2021

But that would require hiring tons of software engineers to label training data accurately.

Why do that when you can just train a GPT-3 model on public repositories and call it a day?

qbasic_forever · on July 12, 2021

No presumably copilot skirted that need by just analyzing the AST of code they host and using the nearby comments to identify what a section of code is meant to do. This would use the same dataset but solve the opposite problem, generate a description from a block of code AST as input.

patchguard · on July 13, 2021

> copilot skirted that need by just analyzing the AST of code they host and using the nearby comments to identify what a section of code is meant to do.

I'm curious what it spills out for things like "Todo", or "this is probably broken", etc.

gspr · on July 12, 2021

Sorry for adding just noise, but I think this is the most insightful comment I've read on HN this year. Excellent analysis and idea!

moralestapia · on July 12, 2021

Something like this would be amazing, particularly for poorly written, obfuscated or even disassembled/decompiled code!

dkersten · on July 12, 2021

Now that is a damn good idea!

rantwasp · on July 12, 2021

it’s a good idea. depending on how “smart” it is it can be extremely hard to pull off

toomuchtodo · on July 12, 2021

Ideally, you'd train/teach it using PR code reviews. Human labeling and all that jazz.

JetSpiegel · on July 15, 2021

> Ideally, you'd train/teach it using PR code reviews.

Which is why, based on Windows state, it will never come out of Microsoft.

randallsquared · on July 12, 2021

I'm not sure I understand how you envision this working, given the underlying technology. You'd have to have a pretty large cache of such analyses to train on, right?

qbasic_forever · on July 12, 2021

Github has a huge amount of source code and likely for copilot they already had to transform it into an AST to look at comments and nearby code. This would use the same dataset but build the opposite model--input a block of code AST and get a guess as to what the description (i.e. comment) should be for it.

randallsquared · on July 13, 2021

My naive assumption is that they don't have nearly that level of control. I'd be surprised if they have an AST step before the tokenizer, or in it.

im_down_w_otp · on July 12, 2021

This is the thing that made no sense to me about it as a premise. Doing correct program synthesis is really hard even when you have really opinionated and well-defined models of the domain (e.g. the Termite project for generating Linux device drivers). The domain model for Copilot is somewhere between non-existent to so open-ended (i.e. all the diverse code on Github, et al.) as to be functionally non-existent.

A bare minimum baseline validation check for Copilot would be to see if it provides you code which won't compile in-context. If it will, then that means it's not even taking into account well-specified domain model of your chosen programming language's semantics. Which, upon satisfaction, is still miles away from taking into account the domain of your actual problem that you're using software to solve.

The only place where the approach taken, as-is, makes sense to me is for truly rote boilerplate code. However, that then begs the question... how is this machine learning approach more effective than a targeted heuristic approach already taken by existing IDE tooling, etc.?

FWIW, I don't think any of this is lost on GitHub. I think Copilot is more likely a tremendously marketable half-step and small piece of a larger longer-term strategy unfolding at Microsoft/GitHub to leverage an incredible asset they're holding, i.e... practically everybody's source code. The combination of detailed changelogs, CI results (e.g. GitHub actions), Copilot, and a couple other key pieces makes for a pretty incredible basis for reinforcement learning to multiple ends.

juped · on July 12, 2021

I'd like to think that it's because Copilot is so obviously useless that that part of it doesn't need discussion.

bryanrasmussen · on July 12, 2021

>It seems so obvious to me that people are going to commit more broken code via Copilot than ever before.

Maybe we should use Copilot to commit more open source code meaning that Copilot becomes more and more corrupted and unusable!

of course then we end up with a bunch of bad open source code that will turn people off of using open source.

Gee, I don't think Microsoft really thought this one through.