RaptorDotCpp t1_j5u0yxq wrote on January 25, 2023 at 3:50 PM

Reply to comment by mgwizdala in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996

Gradient accumulation is tricky for contrastive methods that rely on having lots of negatives in a batch.

FastestLearner t1_j5tzcvr wrote on January 25, 2023 at 3:40 PM

Reply to [R] Best service for scientific paper correction by Meddhouib10

For minor polishing, I use quillbot.

Also, stay away from grammarly.

mgwizdala t1_j5tyf1f wrote on January 25, 2023 at 3:34 PM

Reply to [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996

If you are willing to trade time for batch size you can try with gradient accumulation

zyl1024 t1_j5ty57a wrote on January 25, 2023 at 3:32 PM

Reply to [R] Best service for scientific paper correction by Meddhouib10

Are you a native English speaker? Is your advisor a native speaker? If one of you are, just have that person check for English grammar, and have your advisor check for content, flow and argument.

[deleted] t1_j5twcq5 wrote on January 25, 2023 at 3:20 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

[deleted]

CKtalon t1_j5tvh2p wrote on January 25, 2023 at 3:14 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

Google supposedly has better models based on benchmarks, but few people outside of Google has used them (and those that have used it don't seem to be giving good reviews).

AnthropicAI's Claude model seems promising as a ChatGPT competitor.

BitterAd9531 t1_j5tuehz wrote on January 25, 2023 at 3:07 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

https://blog.google/technology/ai/lamda/

It's supposed to be ahead of OpenAI's current GPT. Seems only logical to me since it was Google that invented and open-sourced the Transformer model and they likely have much more and much higher quality data than OpenAI for training.

CasaDeCastello t1_j5tttzi wrote on January 25, 2023 at 3:04 PM

Reply to comment by [deleted] in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

Yum Lecum

[deleted] t1_j5toz5g wrote on January 25, 2023 at 2:30 PM

Reply to comment by NadaBrothers in [R] Easiest way to train RNN's in MATLAB or Julia? by NadaBrothers

[deleted]

[deleted] t1_j5tnhkq wrote on January 25, 2023 at 2:19 PM

Reply to comment by [deleted] in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

[removed]

IntelArtiGen t1_j5tmia0 wrote on January 25, 2023 at 2:12 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

chatGPT is probably a very good model on the task it had to solve (to be a great conversational agent based on openAI data), but there are better models regarding the broad task of language understanding. You could adapt these models to be conversational agents, and they could probably beat chatGPT if they had access to the same dataset. But it would still be this specific task of being a great conversational agent. It's not the task of "thinking by itself like humans".

So it depends on what "more advanced" means. There are probably more "advanced" tasks towards AGI. But towards being a great conversational agent perhaps openAI has the best task-dataset combo today. At least I'm quite sure that there aren't systems which would be "significantly" more advanced than that, because I think the current limit is that it's "just" a very good conversational agent.

BalorNG t1_j5tlqsx wrote on January 25, 2023 at 2:06 PM

Reply to comment by nashtownchang in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

NOW it does, after a massive public beta-test :)

Purple_noise_84 t1_j5tl26u wrote on January 25, 2023 at 2:01 PM

Reply to [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996

How about mocov2? That should work on a single gpu

[deleted] t1_j5tkyhz wrote on January 25, 2023 at 2:00 PM

Reply to comment by [deleted] in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

[removed]

[deleted] t1_j5tkavc wrote on January 25, 2023 at 1:55 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

[removed]

shingekichan1996 OP t1_j5tjy44 wrote on January 25, 2023 at 1:53 PM

Reply to comment by IntelArtiGen in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996

I think single GPU for SSL contrastive learning is a research direction to pursue, I'm not sure if anyone published papers on it, but if there's none, I'm surprised.

nashtownchang t1_j5tj0w3 wrote on January 25, 2023 at 1:45 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

Meh - now OpenAI has the best chat dataset available to mankind. Not sure if saying one is more advanced now will make any difference in a few years.

shingekichan1996 OP t1_j5tiupg wrote on January 25, 2023 at 1:44 PM

Reply to comment by IntelArtiGen in [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996

For 224x224 images, sure. But for images with large sizes, for example satellite images, it is hard to get 200+ batch size for a single gpu.

MysteryInc152 t1_j5tits4 wrote on January 25, 2023 at 1:44 PM

Reply to [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

Google has few systems that would beat current public SOTA models. PALM/Minerva/Med Palm is the best but Flamingo, Chinchilla/Sparrow would also best chatGPT.

Dunno about anything from meta. They have open source GPT models released but they're not as good as Open AI's stuff.

IntelArtiGen t1_j5tijjx wrote on January 25, 2023 at 1:41 PM

Reply to [D] Self-Supervised Contrastive Approaches that don’t use large batch size. by shingekichan1996

I managed to use SwAV on 1 GPU (8GB), batch size 240, 224x224 images, FP16, ResNet18.

Of course it works, the problem isn't just the batch size but the accuracy - batchsize trade-off, and the accuracy was quite bad (still usable for my task though). If 50% top5 on imagenet is ok for you, you can do it. But I'm not sure there are many tasks where it makes sense.

Perhaps contrastive learning isn't the best for single GPU. I'm not sure about the current SOTA on this task.

yarasa t1_j5tid8i wrote on January 25, 2023 at 1:40 PM

Reply to comment by NadaBrothers in [R] Easiest way to train RNN's in MATLAB or Julia? by NadaBrothers

Can you not train in python and dump the results to a file and run analysis on that? Either you have to be an expert in the details of the implementation or you have to use the setup everyone else is using.