Recent comments in /f/MachineLearning

estrafire t1_jc2umln wrote on March 13, 2023 at 5:15 PM

Reply to comment by bo_peng in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng

Any particular reason for moving from CNN to RNN?

[deleted] t1_jc2tugi wrote on March 13, 2023 at 5:09 PM

Reply to [D]: Generalisation ability of autoencoders by Blutorangensaft

[deleted]

TwoTurnWin t1_jc2thhm wrote on March 13, 2023 at 5:06 PM

Reply to [D] Simple Questions Thread by AutoModerator

So I'm working with the UrbanSound 8k set on Kaggle.

I want to try two approaches:

MFCCs and Mels for image classification.
Raw audio data classification.

Would a 1DCNN work for both approaches?

light24bulbs t1_jc2s2oc wrote on March 13, 2023 at 4:57 PM

Reply to comment by Kinexity in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692

Oh, definitely, it's an amazing optimization.

But less than a token a second is going to be too slow for a lot of real time applications like human chat.

Still, very cool though

AnomalyNexus t1_jc2ictg wrote on March 13, 2023 at 3:54 PM

Reply to [D] Simple Questions Thread by AutoModerator

Do I need a specific GPU generation for 4bit weights? Or just anything that supports tensorflow/pytorch?

jacobgil OP t1_jc2hgbr wrote on March 13, 2023 at 3:49 PM

Reply to comment by Kaleidophon in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Thanks! Will add citation info there!

jacobgil OP t1_jc2heig wrote on March 13, 2023 at 3:48 PM

Reply to comment by Balance- in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Thanks! Following your suggestion I posted to r/DataScience

jacobgil OP t1_jc2hcov wrote on March 13, 2023 at 3:48 PM

Reply to comment by blablanonymous in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Thanks!

jacobgil OP t1_jc2hc8h wrote on March 13, 2023 at 3:48 PM

Reply to comment by mfarahmand98 in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Thanks!

jacobgil OP t1_jc2hboz wrote on March 13, 2023 at 3:48 PM

Reply to comment by francozzz in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Thanks!

jacobgil OP t1_jc2hb5f wrote on March 13, 2023 at 3:48 PM

Reply to comment by fastglow in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Thanks!

jacobgil OP t1_jc2hal0 wrote on March 13, 2023 at 3:48 PM

Reply to comment by jonnyyen in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Cool!

jacobgil OP t1_jc2h94t wrote on March 13, 2023 at 3:47 PM

Reply to comment by Valuable-Kick7312 in [P] Introducing confidenceinterval, the long missing python library for computing confidence intervals by jacobgil

Yes. I think confidence intervals assume iid. If they are not iid, then the CI could be too short.

bo_peng OP t1_jc2alfm wrote on March 13, 2023 at 3:03 PM

Reply to comment by KerfuffleV2 in [P] ChatRWKV v2 (can run RWKV 14B with 3G VRAM), RWKV pip package, and finetuning to ctx16K by bo_peng

Try rwkv 0.4.0 & latest ChatRWKV for 2x speed :)

threevox t1_jc29kpf wrote on March 13, 2023 at 2:56 PM

Reply to comment by I_will_delete_myself in [R] Introducing Ursa from Speechmatics | 25% improvement over Whisper by jplhughes

It’s fine, open source SOTA will make them irrelevant sooner rather than later

optorobotics t1_jc24knt wrote on March 13, 2023 at 2:21 PM

Reply to [D] Development challenges of an autonomous gardening robot using object detection and mapping. by science-raven

seriously, how much would you pay if such a robot exist say can do weeding, digging, sowing seeds, irrigation. Would you buy it?

luaks1337 t1_jc24dqa wrote on March 13, 2023 at 2:19 PM

Reply to comment by remghoost7 in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692

They managed to run the 7B model on a Raspberry PI and a Samsung Galaxy S22 Ultra.

bangbangwo t1_jc21eac wrote on March 13, 2023 at 1:57 PM

Reply to [D] Simple Questions Thread by AutoModerator

Hey, I'm new at ML and I have a question. I've created a LSTM and XGBoost model etc, trained it, evaluated it etc. But now, how do I actually forecast future data ? Do you have a notebook where the creator actually plot predictions? I can't seem to find one !

MorallyDeplorable t1_jc1umt7 wrote on March 13, 2023 at 1:03 PM

Reply to comment by Necessary_Ad_9800 in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692

I'm not actually sure. I've just been chatting with people in an unrelated Discord's off topic channel about it.

I'd post some of what I've got from it but I have no idea what I'm doing with it and don't think what I'm getting would be decently representative of what it can actually do.

denxiaopin t1_jc1uk2f wrote on March 13, 2023 at 1:02 PM

Reply to [D] Simple Questions Thread by AutoModerator

How difficult and time consuming is it to teach AI how to choose glasses according to the type of face with tools we have today?

serge_cell t1_jc1tqaq wrote on March 13, 2023 at 12:55 PM

Reply to comment by OptimizedGarbage in [N] Man beats machine at Go in human victory over AI : « It shows once again we’ve been far too hasty to ascribe superhuman levels of intelligence to machines. » by fchung

see previous response

serge_cell t1_jc1to7o wrote on March 13, 2023 at 12:54 PM

Reply to comment by ertgbnm in [N] Man beats machine at Go in human victory over AI : « It shows once again we’ve been far too hasty to ascribe superhuman levels of intelligence to machines. » by fchung

There was a paper about it. There was a find - specific set of positions not encountered or pooply represented during self-play. Fully trained AlphaGo was failing on those positions. However then they were explicitly added to the training set the problem was fixed and AlphaGo was able to play them well. This adversarial traning seems just an automatic way to find those positions.

PS fintess landscape is not convex it separated by hills and valleys. Self-play may have a problem in reaching all important states.

G_fucking_G t1_jc1tmli wrote on March 13, 2023 at 12:54 PM

Reply to comment by CashyJohn in [R] Introducing Ursa from Speechmatics | 25% improvement over Whisper by jplhughes

On which metric are you basing this on? I'm not deep in ASR but in the Whisper paper it is compared to word2vec 2.0 and whisper is better in most categories.

[deleted] t1_jc1tflg wrote on March 13, 2023 at 12:52 PM

Reply to [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692

[removed]

Raise_Fickle t1_jc1tb4r wrote on March 13, 2023 at 12:51 PM

Reply to [D] Is it possible to train LLaMa? by New_Yak1645

Any idea for finetuning llama on multi-gpu setup?