Recent comments in /f/MachineLearning
__mantissa__ t1_jameavh wrote
Reply to [D] Podcasts about ML research? by Tight-Vacation-9410
In addition to the already mentioned ones, I recommend The Thesis Review Podcast for deep conversations about pure research
TobusFire OP t1_jamcrd2 wrote
Reply to comment by [deleted] in [D] Are Genetic Algorithms Dead? by TobusFire
This is a reasonable question but I believe you are misunderstanding. The randomization of parameters in a neural network (I assume you are talking about initialization?) is certainly not the same as a mutation in a GA. Mutation occurs randomly, sure, but is selected for and crossed over, whereas hill-climbing and gradient descent simply move on the gradient and do not use either random mutations or cross-over so are not genetic.
rm-rf_ t1_jambi6b wrote
Reply to comment by sugar_scoot in [D] Are Genetic Algorithms Dead? by TobusFire
Don't Bayesian approaches generally work better in gradient-free optimization?
harharveryfunny t1_jamab7m wrote
Reply to comment by londons_explorer in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
The two pair up very well though - now that there's a natural language API, you could leverage that for speech->text->ChatGPT. From what I've seen of the Whisper demos, it seems to be the best out there by quite a margin. Does anything else perform as well?
londons_explorer t1_jam8409 wrote
Reply to [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
It was an interesting business decision to make a blog post announcing two rather different products (ChatGPT API and Whisper) at the same time...
ChatGPT is a best-in-class, or even only-in-class chatbot API... While Whisper is one of many hosted speech to text solutions.
Smallpaul t1_jam83rb wrote
Reply to comment by MonstarGaming in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
I guess you haven’t visited any B2C websites in the last 5 years.
But also: there is a world model behind the chatbot which can translate between human languages, between computer languages, can compose marketing copy, summarise text...
Smallpaul t1_jam7abr wrote
Reply to comment by WarAndGeese in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
> Don't let it demotivate competitors. They are making money somehow,
What makes you so confident?
BigBayesian t1_jam6z7u wrote
Reply to [D] Are Genetic Algorithms Dead? by TobusFire
Genetic algorithms are good, as you said, when you really understand the space and can come up with a really good candidate generation system. They’re okayish (or, the same as everything else) when you have no understanding of the space at all, and you’re just totally guessing. They can’t latch onto a curve in design space as well as things that look at a simpler gradient can. So maybe they’re best used for really complex spaces where gradient based methods don’t do well. The kind of places you’d use Gibbs sampling, or general optimization algorithms.
So, basically, they’re useful when you have good feature engineering already done, like many methods that have fallen out of vogue in the age of letting algorithms and data do your feature engineering for you. And they’re as good a shot in the dark as any when standard methods fail and you’ve got no clue how to proceed.
So, yeah, the number of times genetic algorithms are the “right” choice is pretty limited these days.
Im2bored17 t1_jam6y5y wrote
Reply to comment by xGovernor in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
$20.00 / ($0.002/ 1k tokens) = 10m tokens. If you only used a few hundred k, you got scammed hard lol
londons_explorer t1_jam6r8g wrote
Reply to comment by LetterRip in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
Don't you mean the other way around?
mikonvergence OP t1_jam6qmo wrote
Reply to comment by mikonvergence in [P] A minimal framework for image diffusion (including high-resolution) by mikonvergence
MIT license has now been added to the project!
londons_explorer t1_jam6oyr wrote
Reply to comment by LetterRip in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
Aren't biases only a tiny tiny fraction of the total memory usage? Is it even worth trying to quantize them more than weights?
Smallpaul t1_jam6mjl wrote
Reply to comment by Educational-Net303 in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
1 of 2 months??? How would that short time achieve the goal against well-funded competitors?
It would need to be multiple years of undercutting and even that might not be enough to lock google out.
Smallpaul t1_jam6et8 wrote
Reply to comment by andreichiffa in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
They’ll need a stronger story around lock-in if that’s their strategy. One way would be to add structured and unstructured data storage to the APIs.
mikonvergence OP t1_jam6eal wrote
Reply to comment by blabboy in [P] A minimal framework for image diffusion (including high-resolution) by mikonvergence
Thanks for pointing this out! I'll add some permissive license to the repository today to allow free use!
Smallpaul t1_jam673c wrote
Reply to comment by jinnyjuice in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
I think you are using the word revenue when you mean profit.
blabboy t1_jam5ppi wrote
Looks great! Is the code under a specific licence?
M_Alani t1_jam3i7i wrote
Reply to comment by [deleted] in [D] Are Genetic Algorithms Dead? by TobusFire
It wasn't as bad as it sounds. The fun part was that you had to understand how every little piece of the algorithm works, and the nightmare was implementing all of this with 512mb of RAM. We didn't have the luxury of trying different solutions.
[deleted] t1_jam354o wrote
Excellent
Stakbrok t1_jam0bpq wrote
Reply to comment by iTrooz_ in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
You can edit what it replied of course (and then hope it builds off of that and keeps that specific vibe going, which always works in the playground) but damn, they locked it down tight. 😅
Even when you edit the primer/setup into something crazy (you are a grumpy or deranged or whatever assistant) and change some things it said into something crazy, it overrides the custom mood you set for it and goes right back to its ever serious ChatGPT mode. Even sometimes apologizing for saying something out of character (and by that it means the thing you 'made it say' by editing, so it believes it said that)
bjergerk1ng t1_jalzy46 wrote
Reply to comment by xEdwin23x in [D] What are the most known architectures of Text To Image models ? by AImSamy
That's not diffusion though
ID4gotten t1_jalytoe wrote
Reply to comment by protonpusher in [D] Are Genetic Algorithms Dead? by TobusFire
This is actually a very cool approach, if more narrow in application. Other domains that have these linear representations like the Morgan fingerprint might also benefit.
themrzmaster t1_jalxwx1 wrote
Reply to comment by _simple_machine_ in [D] Are Genetic Algorithms Dead? by TobusFire
Yes, there is a whole field https://arxiv.org/abs/2008.10937
ID4gotten t1_jalww3w wrote
Reply to comment by bo_peng in [P] ChatRWKV v2 (can run RWKV 14B with 3G VRAM), RWKV pip package, and finetuning to ctx16K by bo_peng
I'll give it a try, thanks
ShowerVagina t1_jamiqb4 wrote
Reply to comment by jturp-sc in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
> I had an exhausting number of conversations with confused product managers, engineers and marketing managers on “No, we’re not using ChatGPT”.
They use your conversations for further training which means if you use it to help you with proprietary code or documentation, you're effectively disclosing that.