Recent comments in /f/MachineLearning
FastestLearner t1_j5tzcvr wrote
For minor polishing, I use quillbot.
Also, stay away from grammarly.
mgwizdala t1_j5tyf1f wrote
Reply to [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
If you are willing to trade time for batch size you can try with gradient accumulation
zyl1024 t1_j5ty57a wrote
Are you a native English speaker? Is your advisor a native speaker? If one of you are, just have that person check for English grammar, and have your advisor check for content, flow and argument.
[deleted] t1_j5twcq5 wrote
CKtalon t1_j5tvh2p wrote
Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
Google supposedly has better models based on benchmarks, but few people outside of Google has used them (and those that have used it don't seem to be giving good reviews).
AnthropicAI's Claude model seems promising as a ChatGPT competitor.
BitterAd9531 t1_j5tuehz wrote
Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
https://blog.google/technology/ai/lamda/
It's supposed to be ahead of OpenAI's current GPT. Seems only logical to me since it was Google that invented and open-sourced the Transformer model and they likely have much more and much higher quality data than OpenAI for training.
CasaDeCastello t1_j5tttzi wrote
Reply to comment by [deleted] in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
Yum Lecum
[deleted] t1_j5toz5g wrote
Reply to comment by NadaBrothers in [R] Easiest way to train RNN's in MATLAB or Julia? by NadaBrothers
[deleted]
[deleted] t1_j5tnhkq wrote
Reply to comment by [deleted] in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
[removed]
IntelArtiGen t1_j5tmia0 wrote
Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
chatGPT is probably a very good model on the task it had to solve (to be a great conversational agent based on openAI data), but there are better models regarding the broad task of language understanding. You could adapt these models to be conversational agents, and they could probably beat chatGPT if they had access to the same dataset. But it would still be this specific task of being a great conversational agent. It's not the task of "thinking by itself like humans".
So it depends on what "more advanced" means. There are probably more "advanced" tasks towards AGI. But towards being a great conversational agent perhaps openAI has the best task-dataset combo today. At least I'm quite sure that there aren't systems which would be "significantly" more advanced than that, because I think the current limit is that it's "just" a very good conversational agent.
BalorNG t1_j5tlqsx wrote
Reply to comment by nashtownchang in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
NOW it does, after a massive public beta-test :)
Purple_noise_84 t1_j5tl26u wrote
Reply to [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
How about mocov2? That should work on a single gpu
[deleted] t1_j5tkyhz wrote
Reply to comment by [deleted] in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
[removed]
[deleted] t1_j5tkavc wrote
shingekichan1996 OP t1_j5tjy44 wrote
Reply to comment by IntelArtiGen in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
I think single GPU for SSL contrastive learning is a research direction to pursue, I'm not sure if anyone published papers on it, but if there's none, I'm surprised.
nashtownchang t1_j5tj0w3 wrote
Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
Meh - now OpenAI has the best chat dataset available to mankind. Not sure if saying one is more advanced now will make any difference in a few years.
shingekichan1996 OP t1_j5tiupg wrote
Reply to comment by IntelArtiGen in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
For 224x224 images, sure. But for images with large sizes, for example satellite images, it is hard to get 200+ batch size for a single gpu.
MysteryInc152 t1_j5tits4 wrote
Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle
Google has few systems that would beat current public SOTA models. PALM/Minerva/Med Palm is the best but Flamingo, Chinchilla/Sparrow would also best chatGPT.
Dunno about anything from meta. They have open source GPT models released but they're not as good as Open AI's stuff.
IntelArtiGen t1_j5tijjx wrote
Reply to [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
I managed to use SwAV on 1 GPU (8GB), batch size 240, 224x224 images, FP16, ResNet18.
Of course it works, the problem isn't just the batch size but the accuracy - batchsize trade-off, and the accuracy was quite bad (still usable for my task though). If 50% top5 on imagenet is ok for you, you can do it. But I'm not sure there are many tasks where it makes sense.
Perhaps contrastive learning isn't the best for single GPU. I'm not sure about the current SOTA on this task.
yarasa t1_j5tid8i wrote
Reply to comment by NadaBrothers in [R] Easiest way to train RNN's in MATLAB or Julia? by NadaBrothers
Can you not train in python and dump the results to a file and run analysis on that? Either you have to be an expert in the details of the implementation or you have to use the setup everyone else is using.
[deleted] t1_j5thmwj wrote
Reply to [D] CVPR Reviews are out by banmeyoucoward
[deleted]
[deleted] t1_j5th615 wrote
Reply to [D] CVPR Reviews are out by banmeyoucoward
[removed]
SimonJDPrince OP t1_j5taba8 wrote
Reply to comment by arsenyinfo in [P] New textbook: Understanding Deep Learning by SimonJDPrince
Thanks. This is useful.
limpbizkit4prez t1_j5t7cl5 wrote
Reply to comment by NadaBrothers in [R] Easiest way to train RNN's in MATLAB or Julia? by NadaBrothers
Ok, yeah that's what I was thinking. That totally makes sense. Good luck!
RaptorDotCpp t1_j5u0yxq wrote
Reply to comment by mgwizdala in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996
Gradient accumulation is tricky for contrastive methods that rely on having lots of negatives in a batch.