Recent comments in /f/MachineLearning
[deleted] t1_jc2tugi wrote
[deleted]
TwoTurnWin t1_jc2thhm wrote
Reply to [D] Simple Questions Thread by AutoModerator
So I'm working with the UrbanSound 8k set on Kaggle.
I want to try two approaches:
- MFCCs and Mels for image classification.
- Raw audio data classification.
Would a 1DCNN work for both approaches?
light24bulbs t1_jc2s2oc wrote
Reply to comment by Kinexity in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692
Oh, definitely, it's an amazing optimization.
But less than a token a second is going to be too slow for a lot of real time applications like human chat.
Still, very cool though
AnomalyNexus t1_jc2ictg wrote
Reply to [D] Simple Questions Thread by AutoModerator
Do I need a specific GPU generation for 4bit weights? Or just anything that supports tensorflow/pytorch?
jacobgil OP t1_jc2hgbr wrote
Reply to comment by Kaleidophon in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Thanks! Will add citation info there!
jacobgil OP t1_jc2heig wrote
Reply to comment by Balance- in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Thanks! Following your suggestion I posted to r/DataScience
jacobgil OP t1_jc2hcov wrote
Reply to comment by blablanonymous in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Thanks!
jacobgil OP t1_jc2hc8h wrote
Reply to comment by mfarahmand98 in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Thanks!
jacobgil OP t1_jc2hboz wrote
Reply to comment by francozzz in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Thanks!
jacobgil OP t1_jc2hb5f wrote
Reply to comment by fastglow in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Thanks!
jacobgil OP t1_jc2hal0 wrote
Reply to comment by jonnyyen in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Cool!
jacobgil OP t1_jc2h94t wrote
Reply to comment by Valuable-Kick7312 in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil
Yes. I think confidence intervals assume iid. If they are not iid, then the CI could be too short.
bo_peng OP t1_jc2alfm wrote
Reply to comment by KerfuffleV2 in [P] ChatRWKV v2 (can run RWKV 14B with 3G VRAM), RWKV pip package, and finetuning to ctx16K by bo_peng
Try rwkv 0.4.0 & latest ChatRWKV for 2x speed :)
threevox t1_jc29kpf wrote
Reply to comment by I_will_delete_myself in [R] Introducing Ursa from Speechmatics | 25% improvement over Whisper by jplhughes
It’s fine, open source SOTA will make them irrelevant sooner rather than later
optorobotics t1_jc24knt wrote
Reply to [D] Development challenges of an autonomous gardening robot using object detection and mapping. by science-raven
seriously, how much would you pay if such a robot exist say can do weeding, digging, sowing seeds, irrigation. Would you buy it?
luaks1337 t1_jc24dqa wrote
Reply to comment by remghoost7 in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692
They managed to run the 7B model on a Raspberry PI and a Samsung Galaxy S22 Ultra.
bangbangwo t1_jc21eac wrote
Reply to [D] Simple Questions Thread by AutoModerator
Hey, I'm new at ML and I have a question. I've created a LSTM and XGBoost model etc, trained it, evaluated it etc. But now, how do I actually forecast future data ? Do you have a notebook where the creator actually plot predictions? I can't seem to find one !
MorallyDeplorable t1_jc1umt7 wrote
Reply to comment by Necessary_Ad_9800 in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692
I'm not actually sure. I've just been chatting with people in an unrelated Discord's off topic channel about it.
I'd post some of what I've got from it but I have no idea what I'm doing with it and don't think what I'm getting would be decently representative of what it can actually do.
denxiaopin t1_jc1uk2f wrote
Reply to [D] Simple Questions Thread by AutoModerator
How difficult and time consuming is it to teach AI how to choose glasses according to the type of face with tools we have today?
serge_cell t1_jc1tqaq wrote
Reply to comment by OptimizedGarbage in [N] Man beats machine at Go in human victory over AI : « It shows once again we’ve been far too hasty to ascribe superhuman levels of intelligence to machines. » by fchung
see previous response
serge_cell t1_jc1to7o wrote
Reply to comment by ertgbnm in [N] Man beats machine at Go in human victory over AI : « It shows once again we’ve been far too hasty to ascribe superhuman levels of intelligence to machines. » by fchung
There was a paper about it. There was a find - specific set of positions not encountered or pooply represented during self-play. Fully trained AlphaGo was failing on those positions. However then they were explicitly added to the training set the problem was fixed and AlphaGo was able to play them well. This adversarial traning seems just an automatic way to find those positions.
PS fintess landscape is not convex it separated by hills and valleys. Self-play may have a problem in reaching all important states.
G_fucking_G t1_jc1tmli wrote
Reply to comment by CashyJohn in [R] Introducing Ursa from Speechmatics | 25% improvement over Whisper by jplhughes
On which metric are you basing this on? I'm not deep in ASR but in the Whisper paper it is compared to word2vec 2.0 and whisper is better in most categories.
[deleted] t1_jc1tflg wrote
Raise_Fickle t1_jc1tb4r wrote
Reply to [D] Is it possible to train LLaMa? by New_Yak1645
Any idea for finetuning llama on multi-gpu setup?
estrafire t1_jc2umln wrote
Reply to comment by bo_peng in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng
Any particular reason for moving from CNN to RNN?