Recent comments in /f/MachineLearning
tysam_and_co OP t1_j6gzgf9 wrote
Reply to comment by McCheng_ in [R] Train CIFAR10 in under 10 seconds on an A100 (new world record!) by tysam_and_co
I've created a writeup for each release in https://github.com/tysam-code/hlb-CIFAR10/releases
Maybe I could do more at some point (like the original "bag of tricks" blogpost) once the smoke all clears, but I spent ~8-10 hours straight manually tuning hyperparameters yesterday night so I am smoked! Though usually I am pretty beat on release days due to trying to keep up with everything in the dopamine rush. :D
_poisonedrationality t1_j6gz4it wrote
Reply to comment by iamsunnycoast in [P] Automating a Youtube Shorts channel with Huggingface Transformers and After Effects by Ch1nada
Really? I see a lot of text to speech channels doing stuff like reading reddit comments. Are you sure youtube demonetizes these because I don't think people would bother without monetization.
Vegetable-Skill-9700 OP t1_j6gygb7 wrote
Reply to comment by jabarifowle in [P] Launching my first ever open-source project and it might make your ChatGPT answers better by Vegetable-Skill-9700
Thanks!
Main_Mathematician77 t1_j6gy1gn wrote
Reply to [D] AI Theory - Signal Processing? by a_khalid1999
signal decomposition (Fourier, SVD, Laplace), compression, convolutions, it’s also adjacent to lots of linear algebra/search(inner product spaces, kernels, mean squares error, optimization, etc), I’m only scratching the surface
jabarifowle t1_j6gx2ya wrote
McCheng_ t1_j6gwm0w wrote
Can you summarise the tricks that you have used to make it fast?
hapliniste t1_j6gvcgp wrote
Reply to comment by SaifKhayoon in [R] META presents MAV3D — text to 3D video by SpatialComputing
I guess AR glasses will make access to 3d video (as in first person scanned scenes) way easier (for the companies that control the glasses OS).
MrAcurite t1_j6gr1n8 wrote
Reply to comment by a_khalid1999 in [D] AI Theory - Signal Processing? by a_khalid1999
I would argue that EE is actually a better major than CS for ML. It beefs up your Math and Statistics chops with DiffEQ, Quantum, and the like, and also includes enough Linear Algebra and Statistics to get you sorted. As a Math major doing ML research, I'm kind of embarrassed by how weak my background in Signal Processing is, and am working through a textbook on DSP in my spare time to fix that.
LetWrong1932 t1_j6gpp34 wrote
Reply to comment by AvailablePresent1113 in [D] CVPR Reviews are out by banmeyoucoward
there's no such "minimum score" because there's always an exception but heard that 3 3 4 or 3 4 4 fairly gets accepted (not sure tho)
jobeta t1_j6gpgqj wrote
Reply to comment by tysam_and_co in [R] Train CIFAR10 in under 10 seconds on an A100 (new world record!) by tysam_and_co
What’s the trick you’re most proud of?
PretendiFendi t1_j6gox66 wrote
Reply to comment by PretendiFendi in [D] Remote PhD by TheRealMrMatt
Edit: just wanted to add that I only found out what he was doing through a friend of mine being friends with his wife independently at her university. Wild story.
PretendiFendi t1_j6goq91 wrote
Reply to [D] Remote PhD by TheRealMrMatt
No probably not. If you have money to travel and your work can be done at home, you could probably arrange to fly back and forth later in your phd. No one is going to know where you are.
I did an experimental PhD, and there was a guy in my group who had a baby with his phd student wife across the country. He flew back and forth without any of us knowing. He would typically do weeks of work in a single week by working literally 20 hours a day and then go stay with her for two weeks and fly back before his next meeting so no one knew. We all didn’t know bc he was co-advised, so we all thought when he wasn’t in our lab he was in the other.
What I’m saying is that as you get older and more independent you can get away with a lot, people don’t check how you’re doing your work.
bo_peng OP t1_j6gnqrp wrote
Reply to comment by Gody_Godee in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
2006.16236 is bad at any nontrivial task such as language modeling.
Red-Portal t1_j6gn8u1 wrote
Reply to comment by randomusername11010 in [D] AI Theory - Signal Processing? by a_khalid1999
Correlations are the same as convolutions with just the kernel flipped.
tysam_and_co OP t1_j6gjzzf wrote
Reply to comment by jobeta in [R] Train CIFAR10 in under 10 seconds on an A100 (new world record!) by tysam_and_co
Many thanks! I've found that the 'speed hunger' for me is truly insatiable -- we're almost at half the time it takes to train as when we started, and I find myself just as hungry to make it faster and faster. The Sisyphean hill is real, though I suppose it is more easily justified with a goal in mind! 😄😁
randomusername11010 t1_j6gjthg wrote
Reply to [D] AI Theory - Signal Processing? by a_khalid1999
CNNs are incorrectly named because they use autocorrelation functions which is super common technique from extracting signals from noise. The CNN is essentially learning filters to extract those signals from images
miracle_jinjin t1_j6gj5e8 wrote
Reply to comment by Late-Associate8835 in Apple AI Residency 2023 [R] by Extension-Reward5756
do you mind sharing some info that you got during the information session? I registered but missed it so have no idea on how to prepare the interview...
jobeta t1_j6gj0er wrote
dineNshine t1_j6gikpr wrote
Reply to comment by mirrorcoloured in [D] Couldn't devs of major GPTs have added an invisible but detectable watermark in the models? by scarynut
Children and pets are not the same as adults. Guns are also different from language models and image generators. A gun is a weapon, but a language model isn't.
Adding certain protections might be necessary for objects that can otherwise cause bodily harm to the user (e.g. gun safeties), but if you think that people must be prevented from accessing information because they are too stupid to properly evaluate it, then you might as well abolish democracy.
I am not doubting that people can evaluate information incorrectly. The issue is that nobody can do it in an unbiased way. The people doing the censorship don't know all that much better and often don't have the right intentions, as is often demonstrated.
It has been shown that ChatGPT has strong political biases as a result of the tampering applied to make it "safe". I find this concerning.
[deleted] t1_j6gfj9z wrote
Reply to comment by Drisku11 in [R] InstructPix2Pix: Learning to Follow Image Editing Instructions by Illustrious_Row_9971
[removed]
nmkd t1_j6gaf5g wrote
Reply to comment by onnod in [R] InstructPix2Pix: Learning to Follow Image Editing Instructions by Illustrious_Row_9971
I think so, haven't tried though
deep_noob t1_j6gab7l wrote
Reply to comment by toftinosantolama in [D] CVPR Reviews are out by banmeyoucoward
I think during discussion they will know about the other reviews, otherwise there is point of discussion
onnod t1_j6g9t00 wrote
Reply to comment by nmkd in [R] InstructPix2Pix: Learning to Follow Image Editing Instructions by Illustrious_Row_9971
>InstructPix2Pix
Can this run on CPU?
doctorjuice t1_j6h0hpn wrote
Reply to [P] Automating a Youtube Shorts channel with Huggingface Transformers and After Effects by Ch1nada
Very cool! How did you get Pexel to give you relevant videos? I’ve tried to use Pexel for video editing before, but it usually gave videos that didn’t really make sense for the content.