Recent comments in /f/MachineLearning
regular-jackoff t1_j69c2ar wrote
Reply to comment by ginsunuva in [D] MusicLM: Generating Music From Text by carlthome
Contractors paid $2 an hour
regular-jackoff t1_j69biyw wrote
Reply to comment by Acceptable-Cress-374 in [D] MusicLM: Generating Music From Text by carlthome
It's transformers all the way down
regular-jackoff t1_j69bh1m wrote
Reply to comment by ginsunuva in [D] MusicLM: Generating Music From Text by carlthome
Attention is all you need
zoontechnicon t1_j69b6g5 wrote
Reply to comment by zoontechnicon in [D] Simple Questions Thread by AutoModerator
The solution, as evidenced by code in huggingface/transformers is to force the probability of the end token to -Inf. What a hack...
GhostCheese t1_j699f2m wrote
Reply to comment by youcandigit in [R] META presents MAV3D — text to 3D video by SpatialComputing
in the offices of meta?
doesn't look like they provide a portal to use it, just showing off what they can do.
CKtalon t1_j695owv wrote
Reply to comment by NoFairYouCheated in [R] SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot by Secure-Technology-78
No. There are blog posts about it performing quite badly: https://www.surgehq.ai/blog/how-good-is-hugging-faces-bloom-a-real-world-human-evaluation-of-language-models
Then based on the Chinchilla paper, you can kind of infer that it's a result of undertraining.
SupplyChainPhd t1_j690ygz wrote
Reply to comment by Vegetable-Skill-9700 in [P] Launching my first ever open-source project and it might make your ChatGPT answers better by Vegetable-Skill-9700
Can’t wait to check it out Monday
NoFairYouCheated t1_j68z10h wrote
Reply to comment by CKtalon in [R] SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot by Secure-Technology-78
Are there any papers or blog posts discussing this undertraining?
kiteguycan t1_j68xk83 wrote
Would be cool if it could take a book as an input and immediately make it into a passable movie
Grimaldi2 t1_j68wr7g wrote
Reply to comment by marcingrzegzhik in [P] Using algorithms or models from papers for commercial use by romantimm25
Patents might be the real showstopper in this case. Be careful
deathtosquishy t1_j68vfh4 wrote
Now this is what I've been waiting for. Can it create obscene images is the question?
throwaway2676 t1_j68vbfq wrote
Reply to comment by currentscurrents in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents
> Not in 40 years when computers are 1000x better.
It won't take anywhere near that long. We've barely scratched the surface of ASICs and analog matrix multiplication, which is where the real fun is going to begin.
marcingrzegzhik t1_j68ugfe wrote
Great post! I'm really excited to explore this project and see what kind of applications it has! Can you tell us a bit more about what kind of data it works with and how it works?
Vegetable-Skill-9700 OP t1_j68t7rk wrote
Reply to comment by Acceptable-Cress-374 in [P] Launching my first ever open-source project and it might make your ChatGPT answers better by Vegetable-Skill-9700
Lol, I get the joke now, it's a good one! Thanks for bookmarking!
[deleted] t1_j68so1p wrote
Reply to Apple AI Residency 2023 [R] by Extension-Reward5756
[deleted]
marcingrzegzhik t1_j68pygm wrote
That's really interesting! I wonder what other advances they have made with their large team of contractors. It would be great to see the results of their work!
Acceptable-Cress-374 t1_j68pjdo wrote
Reply to comment by Vegetable-Skill-9700 in [P] Launching my first ever open-source project and it might make your ChatGPT answers better by Vegetable-Skill-9700
Ah, you're so sweet! I was actually setting up the updog joke :)
I checked & bookmarked the repo. Looks promising!
yazriel0 OP t1_j68nwcd wrote
So, this is +++ for codex quality.
But a --- for future prospects of GPT5-ish, AGI and our new overlords ?
bridgeton_man t1_j68njfs wrote
Reply to [D] Simple Questions Thread by AutoModerator
Quesiton about goodness of fit.
​
For regressions, R-squared and Adj. R-Squared are typically considered the primary goodness-of-fit measures.
​
But in many supervised machine-learning models, RMSE is the main measure that I keep running across. For example, decision tree models that I create in R using Rpart do that.
​
So, my question is how to compare the predictive accuracy of OLS regression models that report R-sq to equivalent Rpart regression trees that report RMSE.
Herrmaciek t1_j68kkbi wrote
Billions well spent
pulpquoter t1_j68hppt wrote
Brilliant. How about the thing that you put on your head and see images? This must be worth trillions.
DCBAtrader t1_j68h2fr wrote
Reply to [D] Simple Questions Thread by AutoModerator
Basic question on regression/AutoML (pycaret mainly).
When do p-values versus error metric (MAE, MSE, R Squared matter).
My previous model building experience (multivariate regression) was to first use various combinations of variables in OLS such that all the variables were statistically significant, and then use an AutoML (pycaret) to build models, and judge them by MAE, MSE or R squared. Using proper cross-validation test/train splits of course.
I'm wondering if this step is needed, and I just can just run the entire data-set in pycaret, and thus judge a model based on said metrics (MAE, MSE, R squared)?
My gut says that the simpler model with stat. significant variables should perform better but maybe I can just look at the best error metric?
Vegetable-Skill-9700 OP t1_j68g80z wrote
Reply to comment by Acceptable-Cress-374 in [P] Launching my first ever open-source project and it might make your ChatGPT answers better by Vegetable-Skill-9700
So, you know how it’s almost impossible to build 100% accurate and super-generalised ML models. On top, the performance of these models degrade over time. Furthermore, due to the black boxiness of ML models, identifying problems with them and fixing those problems is super-hard.
UpTrain solves for these exact issues. It identifies cases where the model is going wrong, collects those problematic data-points and retrains the model on them to improve it's accuracy!
You can checkout the repo here: https://github.com/uptrain-ai/uptrain
Anechoic-Chamber t1_j68f61p wrote
Reply to comment by Late-Associate8835 in Apple AI Residency 2023 [R] by Extension-Reward5756
Me too! What team(s)?
[deleted] t1_j69ceeq wrote
Reply to [R] META presents MAV3D — text to 3D video by SpatialComputing
[deleted]