Recent comments in /f/MachineLearning

liquiddandruff t1_j98v6ko wrote

it's an open question and lots of interesting work is happening at a frenetic pace here

A favourite discussed recently:

4

h-dot t1_j98u1gl wrote

To echo the top comment, it is very difficult to implement AI/ML. Beyond the obvious technical challenges (learning how to code, problem framing, data warehousing, identifying signal in data, model fitting, tuning, retraining, etc etc) there’s the entire business implementation/change management required to actually capitalize on the predictions you’re receiving. I’m in the field and would also recommend a consultant to even see if it’s a worthwhile endeavor for your business.

6

Ferocious_Armadillo t1_j98sbqm wrote

I think I’m gonna have to respectfully disagree on a lot of this. You’re right that it largely comes down to training data used. The thing that largely jumps out to me, though, in the examples you give and in your point (1) is that while you want to train using a large amount of data, especially for such large networks as those you suggest, is that while you need that large amount of data, you want to avoid overfitting your model to your data in the pursuit of accuracy or reliability or whatever metric you choose to determine how “good” or accurate your model is against some ground truth.

And while on the surface, NNs can definitely seem like or appear as though they’re “black boxes” or “we can’t accurately describe their structure or how they work”. That’s largely untrue. In fact, I would claim that it’s precisely because we can design and model NN structure and use a structure (both in terms of # of layers, connectedness between them, inputs, weights, biases, activation functions, etc.) that would lend itself best to a given purpose, that has allowed the field to come as far as it has, to generate the NNS in the examples you provide in the first place.

Sorry about the rant… I didn’t realize I get so passionate about NNs.

1

thecodethinker t1_j98puob wrote

Where has chat gpt been rigorously shown to have reasoning ability? I’ve heard that it passed some exams, but that could just be the model regurgitating info in its training data.

Admittedly, I haven’t looked to deeply in the reasoning abilities of LLMs, so any references would be appreciated :)

1

I_will_delete_myself OP t1_j98p0vg wrote

I been running the A100 the entire weekend and so far it’s only costing me under 20 bucks. If you need it around an hour and it would probably cost you between 1-3 dollars

I would recommend you plan a budget before you get started and it will almost always be cheaper on a year basis. Try Colab first and see if you will need it longer than 12 hours.

6

dmart89 t1_j98eltr wrote

I think what you're asking is how to implement ML instead of building something from the ground up. I don't know your industry, but there are lots of suppliers and startups that would happily partner with you to help you adopt these capabilities without you needing to hire a team to build your own infrastructure. Many other industries already do!

1

NotARedditUser3 t1_j98cwel wrote

This is an idiotic and unrealistic ad, that makes sweeping, untrue generalizations about the current state of other LLM's. I think it would be hard for anyone to realistically read all the way through without dismissing it as the rantings of some agitated, smaller competitor that can't distinguish itself without trying to make untrue statements to put down the other players in the market, in order to try and prop up it's own value.

Perhaps you should use an LLM to improve your ad copy.

13

walkingsparrow t1_j98c2qw wrote

I am a bit confused. So overall, we want to make the generated response to be as close as possible to the ground truth. The paper adds a selection loss that distinguishes the generated response from the ground truth, which would make the generated response as different as possible from the ground truth. How could this help the main task of making these two responses as close as possible?

4

SupplyChainPhd t1_j98b9h0 wrote

So many things you need to look into getting something set up. I don’t even know where I’d suggest you start if you want to take that on as a personal project. TBH, might be worthwhile finding someone to set it up for you and teach you to how to monitor.

2