Recent comments in /f/MachineLearning
Disastrous_Elk_6375 t1_jb8y5r2 wrote
GptNeoX should fit with 8bit and low prompt sizes. GptJ-7B should fit as well with 16bit inference. On smaller models you might even be able to do some finetuning for fun.
There's a couple of coding models from salesforce that you could fit comfortably. Check out FauxPilot for a copilot "clone".
enjakuro t1_jb8thcg wrote
Yeah but copying data in a corpus has yielded better results, at least in NLP translation tasks. It's always good to know what's in your data though. Just saying that it might not be a bad thing.
lynnharry t1_jb8n8p0 wrote
Reply to comment by florinandrei in [D] The MMSegmentation library from OpenMMLab appears to return the wrong results when computing basic image segmentation metrics such as the Jaccard index (IoU - intersection-over-union). It appears to compute recall (sensitivity) instead of IoU, which artificially inflates the performance metrics. by florinandrei
Pixels with ignore_index does not mean the model's output should also be ignore_index. It means the groundtruth label is not determined on those pixels and whatever your model's output is, its correctness is undetermined.
For those undetermined pixels, we simply ignore those outputs completely.
ignore_index is not used to ignore a specific category during the metric calculation, which is what you're proposing. ignore_index is simply notifying intersect_and_union some areas of the image have undetermined labels and should be ignored, and those areas are marked by the value of ignore_index.
Top-Avocado-2564 t1_jb8lzcv wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
DM me , f500 company with lots of interesting ML projects. Let's talk
prettyyyyprettyygood t1_jb8lhxo wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
I was thinking a few years back about starting an ML-specific freelancing website with a focus on quality freelancers + clients. Maybe now is the time...
ortegaalfredo OP t1_jb8ksmz wrote
Reply to comment by ortegaalfredo in [R] Created a Discord server with LLaMA 13B by ortegaalfredo
And here is the discord invite (dont know if mods will remove this: https://discord.gg/ry4cNFwN)
ortegaalfredo OP t1_jb8kdzj wrote
Here are the instructions, you need a discord account, that's it. No limits on what you can ask it, nor rules. Please behave as any spam will need to be removed:
https://twitter.com/ortegaalfredo/status/1632903130416308229
Code for the bot is here:
https://github.com/ortegaalfredo/celery-ai/blob/main/discord/bot.py
Melodic_Stomach_2704 t1_jb8iql9 wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
Here with 3 + YOE in ML. If you need some manpower, count me in. I'm also looking forward to such opportunities.
Honest_Performer2301 t1_jb8ile7 wrote
Reply to What is the future of AI in medicine? [D] by adityyya13
How long will it take for the fda to approve it?
zaptrem t1_jb8i4cr wrote
Reply to comment by Jurph in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust
> the author of AUTOMATIC1111
…you mean AUTOMATIC1111? That’s their name.
i-can-sleep-for-days t1_jb8e8ng wrote
What is the state of the art image duplication finder algorithm?
alphasystem t1_jb87u6x wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
Similar background, a group of FAANG ML Engineers, looking for contract work. Have had a few clients before...
aplchian4287 t1_jb84h4w wrote
murrdpirate t1_jb7xg74 wrote
Reply to comment by deephugs in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
Yeah I think the issues you mention are probably pretty true. As a client, I'm often restricted to using US freelancers, so my experience may not be typical. But I have often found that experts are generally worth their higher rates.
deephugs t1_jb7vqn2 wrote
Reply to comment by murrdpirate in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
Having done ML consulting work through Upwork, my experience is the rate on Upwork is really low compared to what you can get through networks, especially remote Bay Area work. Most Upwork seems to be short timelines, small payouts, and competing against low cost international talent. Any tips for Upwork you can suggest?
enn_nafnlaus t1_jb7sxxi wrote
Reply to What is the future of AI in medicine? [D] by adityyya13
I can say this: my mother has struggled for many, many years trying to figure out what's wrong with her and causing her weird, debilitating symptoms. She finally, at long last got a diagnosis that her doctors are pretty confident in: advanced Sjögren's.
Out of curiosity, I punched her symptoms into ChatGPT, and - without access to any test results - Sjögrens was its #2 guess, and it suggested diagnostic tests that she had done and had shown it was Sjögrens. Sjögrens actually isn't super-rare (about a percent or so of the population has it), but usually much milder, and very underdiagnosed.
I think AI tools are seriously underappreciated with respect to proposing new lines of investigation on hard-to-crack cases.
deep-yearning t1_jb7rxhg wrote
Reply to comment by BreakingCiphers in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
What's Upwork?
Nm you?
Gottem
florinandrei OP t1_jb7pefb wrote
Reply to comment by Mediocre-Bullfrog686 in [D] The MMSegmentation library from OpenMMLab appears to return the wrong results when computing basic image segmentation metrics such as the Jaccard index (IoU - intersection-over-union). It appears to compute recall (sensitivity) instead of IoU, which artificially inflates the performance metrics. by florinandrei
> Isn't this what the ignore_index is doing?
No, it is not.
Let me repeat: ignore_index cuts holes in both the ground truth label frames, and in the prediction frames coming out of the model. Any pixels in those holes are ignored.
This includes pixels in the predictions from the model. You are ignoring chunks of the model's output.
> How else should we exclude them from the average metric?
By not computing metrics for that pixel value.
average_metric = sum(metric_index1 + metric_index2 + ... + metric_indexN) / N
Simply do not include it in the sum, and then just divide by N-1 instead.
What you are doing is not equivalent to that. What you are doing is: you discard pixels from both label frame and prediction frame based on the shape of some regions in the label frame alone. That makes no sense. Whatever the model's predictions happen to be in those holes, they are ignored even if they have pixel values different from ignore_index.
You are ignoring all the model's predictions in those holes, regardless of their pixel values.
You are discarding pixels from the model's output even if they have values different from ignore_index.
murrdpirate t1_jb7n8ls wrote
Reply to comment by z_fi in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
As someone who's done just a bit of freelance work on Upwork, and a ton of client work, what don't you like about Upwork?
Mediocre-Bullfrog686 t1_jb7n704 wrote
Reply to comment by florinandrei in [D] The MMSegmentation library from OpenMMLab appears to return the wrong results when computing basic image segmentation metrics such as the Jaccard index (IoU - intersection-over-union). It appears to compute recall (sensitivity) instead of IoU, which artificially inflates the performance metrics. by florinandrei
>If there is some index you want to ignore altogether, because you are not sure about the quality of the labels, it is best to just exclude it from the calculation of the average metric.
Isn't this what the ignore_index is doing? How else should we exclude them from the average metric? By applying ignore_index we effectively ignore those pixels.
>If some users set ignore_index to the value of the background pixels, that will cut very large holes in everything, therefore discarding a lot of pixels from performance evaluation, and will severely skew the results.
Well the users definitely should not do that. This is then a matter of documentation. We cannot just get rid of ignore_index because (I think) it is used in some existing segmentation datasets.
Jurph t1_jb7kym3 wrote
Reply to comment by TikiTDO in [R] We found nearly half a billion duplicated images on LAION-2B-en. by von-hust
I wonder whether the author of AUTOMATIC1111 could allow people to opt-in and send their training folder(s) of image-caption pairs to a central repository for use in a mega fine-tuning data set.
ispeakdatruf t1_jb7k2qi wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
Are you currently employed at a FAANG Co. and looking for side gigs? As in moonlighting? Or are you interested in taking the leap from a 9-to-5 gig to full-time freelancing?
z_fi t1_jb7ihpk wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places looking for freelance / contract work for ML? by doctorjuice
I’m on a career break, but I was as of December running the AI division of a consulting company.
I will say that finding part time or short term work is very hard. Longer term contract work is relatively easy.
most companies are struggling with the basics - data engineering, data analytics… maybe data science, but with data science you have to be able to talk to the c-suite well and without an mba the lingo is a little hard.
Machine learning projects often require a lot more time to deliver (beyond a proof of concept, and pocs don’t make money) and generally a team rather than an individual, and wayy more stakeholder support than you can muster
Usually ML projects require a lot of data which often puts you into a larger sized business which makes it very difficult to navigate as a freelancer…. You probably need to be in their system when it comes to invoicing and such and so you need to have your ducks in a row where most freelancers don’t. Freelancers, in general, succeed with smaller businesses.
Ignore anyone suggesting upwork.
One avenue I’d recommend is having an honest conversation with consulting company recruiters about what you’re looking for. Stay 1099 or do corp 2 corp. they’ll want you to come on as w2 but be a firm no. Generally these recruiters are looking for easy money and so are you. It’s definitely possible to make a meaningful business relationships here though at your level of seniority you might now know how to play the game at first
florinandrei OP t1_jb7cujz wrote
Reply to comment by Mediocre-Bullfrog686 in [D] The MMSegmentation library from OpenMMLab appears to return the wrong results when computing basic image segmentation metrics such as the Jaccard index (IoU - intersection-over-union). It appears to compute recall (sensitivity) instead of IoU, which artificially inflates the performance metrics. by florinandrei
The problem is: the current algorithm cuts holes in the prediction frames, based on ignore_index in the label frames.
Any pixels in the label frames equal to ignore_index will cause pixels in both label frames and prediction frames to be completely ignored from calculations. If some predicted mask pixels fall into those areas, they will be excluded from all calculations. This is the issue that needs to be addressed.
You cannot exclude pixels from the predicted frames based on pixel values in the label frames.
If there is some index you want to ignore altogether, because you are not sure about the quality of the labels, it is best to just exclude it from the calculation of the average metric.
If some users set ignore_index to the value of the background pixels, that will cut very large holes in everything, therefore discarding a lot of pixels from performance evaluation, and will severely skew the results.
etesian_dusk t1_jb8yzec wrote
Reply to [N] tinygrad 0.5.0 released by Balance-
Why would I start using this today?