Recent comments in /f/MachineLearning
[deleted] t1_j4t4vcd wrote
[removed]
LetterRip t1_j4sumo7 wrote
Reply to comment by limpbizkit4prez in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
Receptance Weighted Key Value RWKV
Unlikely-Advice-7168 t1_j4st6nt wrote
Reply to comment by Apprehensive-Tax-214 in [P] Built an at-cost, pay per second, open-source API for Tortoise text-to-speech (best I've heard!) by Apprehensive-Tax-214
this is what i found, my literacy in this stuff is limited but i checked my account, all good there.
gdpoc t1_j4ssata wrote
Reply to comment by Franck_Dernoncourt in [D] Unlocking the Potential of ChatGPT: A Community Discussion by North-Ad6756
Chat gpt (large language models, in general) is a great generalist and would be likely very useful in predicting 'root node' locations in a knowledge graph which would allow finding the correct content from a minimal subset.
Chat gpt sucks with details, yes, but for use in a recommendation algorithm which depends on the graph, I think that issue could be minimized.
Apprehensive-Tax-214 OP t1_j4soopu wrote
Reply to comment by Unlikely-Advice-7168 in [P] Built an at-cost, pay per second, open-source API for Tortoise text-to-speech (best I've heard!) by Apprehensive-Tax-214
So only two other people have had this error. One was resolved by verifying their GitHub email. Haven't heard back from the other.
hayder978 t1_j4soixh wrote
How much time takes to carry out speaker diarization per 1hr audio?
Unlikely-Advice-7168 t1_j4soi7h wrote
Reply to comment by Apprehensive-Tax-214 in [P] Built an at-cost, pay per second, open-source API for Tortoise text-to-speech (best I've heard!) by Apprehensive-Tax-214
yup, I'm going to try it on another computer to see if it works there.
If it's just me then there must be a problem on my end
UltimateGPower t1_j4snmvi wrote
Reply to comment by SnooWords6686 in [P] featureimpact: A Python package for estimating the impact of features on ML models by cblume
It's pip install scikit-learn. sklearn was just a shadow package that has been deprecated.
[deleted] t1_j4sm206 wrote
Reply to [D] RLHF - What type of rewards to use? by JClub
[deleted]
fimari t1_j4slzng wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
Uh getting customers is the biggest trick nobody will tell you (because they just could grab them for them self) people in ML got rich by different means like scraping fiver and auto-generate content but as soon as everyone is on the wagon no one is.
Kinwwizl t1_j4slam2 wrote
Reply to [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
That's one of the reasons GCP is nice for ML training workloads - you can kill VM after training is finished calling poweroff at the end of bash script for training.
Franck_Dernoncourt t1_j4scxq2 wrote
Reply to comment by gdpoc in [D] Unlocking the Potential of ChatGPT: A Community Discussion by North-Ad6756
Wouldn't ChatGPT inaccuracies be an issue if used for education?
gdpoc t1_j4sc11w wrote
Reply to comment by Franck_Dernoncourt in [D] Unlocking the Potential of ChatGPT: A Community Discussion by North-Ad6756
Education first.
Franck_Dernoncourt t1_j4sbqee wrote
Reply to comment by gdpoc in [D] Unlocking the Potential of ChatGPT: A Community Discussion by North-Ad6756
For which downstream application?
Franck_Dernoncourt t1_j4sbgvu wrote
Reply to comment by Haunting-Ad-5191 in [D] Unlocking the Potential of ChatGPT: A Community Discussion by North-Ad6756
> I mean some kind of home assistant that integrates CHATGPT is obvious right?
How do you handle the fact that some answers are inaccurate?
limpbizkit4prez t1_j4sarps wrote
Reply to [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
What does RWKV stand for?
GPT-5entient t1_j4s8q64 wrote
Reply to comment by ThirdMover in [D] Bitter lesson 2.0? by Tea_Pearce
>I think the point of the metaphor was Amazon stealing product ideas from third party vendors on their site and undercutting them. They know what sells better than anyone and can then just produce it.
In many cases they are probably just selling the same white label item outright, just slapping on "Amazon Basics"...
gdpoc t1_j4s74zi wrote
ChatGPT, coupled with a dynamic, searchable (log(n) query) knowledge graph, and an algorithm to optimize that graph to maximize educational growth.
ndemir t1_j4s5774 wrote
Reply to [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
good idea.
HolySoviet t1_j4s2p2c wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
I work at FAANG also but as SDE and have been thinking of transitioning to a MLE role, may i ask what made you tired of your current role? just want to do something different?
currentscurrents t1_j4s2n9t wrote
Reply to comment by _Arsenie_Boca_ in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
It looks like he goes into a lot more detail on his github.
ClayStep t1_j4s2dfp wrote
Reply to [D] Is it possible to update random forest parameters with new data instead of retraining on all data? by monkeysingmonkeynew
Hackiest solution I can think of:
Just add new trees to the forest trained on the new data and weight the trees by how new the data is...(assuming we care more about the new data)
(probably a terrible idea)
_Arsenie_Boca_ t1_j4rxdt8 wrote
Reply to comment by bo_peng in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
Is there some more detailed description? Would be interesting to read about these lots of new ideas :)
mrconter1 OP t1_j4rsus2 wrote
Reply to comment by navillusr in [R] The Unconquerable Benchmark: A Machine Learning Challenge for Achieving AGI-Like Capabilities by mrconter1
Really appreciate your feedback.
> The distinctions you’re drawing, pixels vs selenium output and browser vs os, are far less significant than the complexity of the tasks (step-by-step vs entire processes). What they’ve achieved is strictly harder for humans than what you are testing. We can argue whether perception or planning are harder for current technology (the computer vision is far more developed than AI planning right now), but I think you need to reconsider the formulation of your tasks. It seems like they are designed to be easy enough for modern methods to solve.
I'm not sure about this. Being able to do the next click on a large diversified benchmark of screenshot is extremely difficult for a computer today. It would need to be able to:
- Choose the next chess move if I am in a chess application
- Recognize the color palette icon on the keyboard if I ask it to change the color of the keyboard
- Recognize the Gmail icon of I say "send an email"
- Change keyboard mode in if I ask it to write an exclamation mark
- Press the key "2" if I ask it to type the number equivalent to the number of consuls that traditionally held the office at the same time in ancient Rome.
That's way outside what current models can do. But humans could do it easily. This benchmark would be extremely simple and intuitive for humans to complete (even with far fetched goals) but there is no model today capable of even knowing that you should press on the new line location given a screenshot and "Add line" today.
> On another note, most interesting tasks can’t be completed with just an x,y mouse location output. Why did you decide to restrict the benchmark to such a limited set of tasks?
I wrote about this in the ReadMe. There is no reason. It's just easier to explain the idea for people. I think the most powerful variant of this idea would take a series of frames (video context) and instructions and output something of the following:
- Click
- Press (X seconds)
- Move from P1 to P2 (X seconds)
The benchmark is simple enough to understand and explain so that you can start to envision what such a model would be able to do. Or much more interesting. What would it not be able to do.
If you have any more feedback or thoughts please reply. I wish more people were interested but either the idea sucked or I need to create something interactive for people.
LetGoAndBeReal t1_j4t6yb6 wrote
Reply to [D] Unlocking the Potential of ChatGPT: A Community Discussion by North-Ad6756
I'm a bit unclear why this announcement is so significant, and frankly I'm not even sure I understand it. We already have API access to the text-davinci-003 model, and my understanding is that ChatGPT basically uses the same model with a small amount of incremental tuning.
Is this announcement just saying that this marginally revised model will now be available as a model option through the OpenAI API? If so, what benefit does this provide over the API access using text-davinci-003?