AdFew4357 OP t1_j7ckkmx wrote on February 5, 2023 at 8:10 PM

Reply to comment by UnusualClimberBear in Are PhDs in statistics useful for ML research? [D] by AdFew4357

I see

Mechanical_Number t1_j7ck5au wrote on February 5, 2023 at 8:07 PM

Reply to Are PhDs in statistics useful for ML research? [D] by AdFew4357

This is a very broad question but in general, yes.

On multiple occasions there such a big overlap between the fields that unless someone is doing some highly specialised (e.g. some very particular problems in Measure Theory or Computer Vision) the underlying skills will be transferable and almost interchangeable (e.g. in Gaussian Processes- or Causality- related topics).

UnusualClimberBear t1_j7cjxu9 wrote on February 5, 2023 at 8:06 PM

Reply to comment by AdFew4357 in Are PhDs in statistics useful for ML research? [D] by AdFew4357

Basically, current trends just ignore any reasonable thing, such as train/valid/test set. For now, the bigger, the better. This requires quite a lot of tech support functions (parallelization and data pipelining in particular) rather than theory-related ones.

Cheap_Meeting t1_j7cit9i wrote on February 5, 2023 at 7:58 PM

Reply to comment by MysteryInc152 in [D] List of Large Language Models to play with. by sinavski

Are any benchmark scores such as MMLU or BigBench available for Aleph Alpha's models?

AdFew4357 OP t1_j7cidhf wrote on February 5, 2023 at 7:55 PM

Reply to comment by UnusualClimberBear in Are PhDs in statistics useful for ML research? [D] by AdFew4357

Could you elaborate

[deleted] t1_j7ci88m wrote on February 5, 2023 at 7:54 PM

Reply to comment by Veedrac in [N] "I got access to Google LaMDA, the Chatbot that was so realistic that one Google engineer thought it was conscious. First impressions" by That_Violinist_18

[removed]

geeky_username t1_j7chxwk wrote on February 5, 2023 at 7:52 PM

Reply to comment by Freed4ever in [N] "I got access to Google LaMDA, the Chatbot that was so realistic that one Google engineer thought it was conscious. First impressions" by That_Violinist_18

>the fact that they invented it, but didn't release it, it means that they thought the technology would be a threat to them

I slightly disagree with this.

Imo, from what I know of Google from people that do or used to work there - they likely didn't care or didn't think of it.

Inside Google is a researcher's playground and there's little to no pressure to ever go to market. I've seen things that are extremely impressive that's never been published or put into a product. Asking why - they just don't care to do so.

The higher-ups lack imagination now, and unless something can directly obviously improve ads, they don't care.

So for years you've had engineers not caring to make something marketable, and leadership not caring but still throwing money at it. My impression is that leadership was looking for something that was so obviously a home run they didn't want to bother with releasing and iterating.

Cheap_Meeting t1_j7chivx wrote on February 5, 2023 at 7:49 PM

Reply to [D] List of Large Language Models to play with. by sinavski

In terms of Consumer Apps, the Poe app from Quora has access to two models from Open AI and one from Anthropic.

Perplexity.ai, YouChat and Neeva are search engines that integrated LLMs.

Google has an AI + Search Event on Wednesday where they are likely to announce something as well.

In terms of APIs and getting a feeling for these models, I would use OpenAI's APIs. Their models are the best publically available models. Open Source models are still far behind.

UnusualClimberBear t1_j7chhkq wrote on February 5, 2023 at 7:49 PM

Reply to Are PhDs in statistics useful for ML research? [D] by AdFew4357

This is all about timing. Currently stats/maths capabilities are not are their best.

geeky_username t1_j7cgu86 wrote on February 5, 2023 at 7:44 PM

Reply to comment by 7366241494 in [N] "I got access to Google LaMDA, the Chatbot that was so realistic that one Google engineer thought it was conscious. First impressions" by That_Violinist_18

>and it’s naïve for people to think ChatGPT is so special that it’s a threat to Google.

Google does a ton of R&D but really sucks(or doesn't care) about productizing.

That's where the real "threat" to them lies.

blablanonymous t1_j7cgt6m wrote on February 5, 2023 at 7:44 PM

Reply to comment by po-handz in [N] GitHub CEO on why open source developers should be exempt from the EU’s AI Act by EmbarrassedHelp

There are a lot of people with absolutely no disposable income. Just having to move is a huge financial stress to them. Aside from the actual cost of moving, you might need to spend more time commuting which adds more cost. A ton of people are very vulnerable financially. Why do you think there are so many homeless people? They’re just lazy? I’m curious where you live? This stuff is really obvious

candidhorse4 OP t1_j7cg5xb wrote on February 5, 2023 at 7:40 PM

Reply to comment by suflaj in What text to speech does this guy use? [R] by candidhorse4

have you tried murf.ai and wellsaid labs?

velcher t1_j7ccfbs wrote on February 5, 2023 at 7:14 PM

Reply to Are PhDs in statistics useful for ML research? [D] by AdFew4357

Yes, it is useful. The breakdown between PhD types would depend on the specific needs of the hiring organization.

gopher9 t1_j7cbdlg wrote on February 5, 2023 at 7:07 PM

Reply to [D] List of Large Language Models to play with. by sinavski

RWKV 14B, trained on The Pile.

Murmeltier8000 t1_j7c9uyt wrote on February 5, 2023 at 6:57 PM

Reply to Are PhDs in statistics useful for ML research? [D] by AdFew4357

Only if you want to build AI with quantuum processors

danielbln t1_j7c9mpc wrote on February 5, 2023 at 6:56 PM

Reply to comment by 2blazen in [N] OpenAI starts selling subscriptions to its ChatGPT bot by bikeskata

I much prefer to see the tokens as they are generated, it's much better UX as you can abort the generation if you feel it's not going in the right direction. All my GPT3 integrations use stream:true and display every word as it comes in.

rudboi12 t1_j7c9lrd wrote on February 5, 2023 at 6:55 PM

Reply to comment by red75prime in [N] GitHub CEO on why open source developers should be exempt from the EU’s AI Act by EmbarrassedHelp

I think populism in EU reached it’s climax before covid. Now with the energy crisis in EU, people are starting to realize governments suck and they need to start relying a bit more on local corporations

red75prime t1_j7c810d wrote on February 5, 2023 at 6:45 PM

Reply to comment by rudboi12 in [N] GitHub CEO on why open source developers should be exempt from the EU’s AI Act by EmbarrassedHelp

Populism is on the rise in Europe.

Hyper1on t1_j7c7vga wrote on February 5, 2023 at 6:44 PM

Reply to comment by Cheap_Meeting in [N] GitHub CEO on why open source developers should be exempt from the EU’s AI Act by EmbarrassedHelp

This isn't true - GDPR puts much more onerous restrictions on what consent must be gained before personal data is processed. Much of what cookies collect is considered personal data, and so immediately on GDPR's passing, many websites started to change their cookie acceptance boxes to these massive things which take up half the screen and have granular consent check boxes. Another factor which just makes browsing the web increasingly inconvenient for the average user.

suflaj t1_j7c73af wrote on February 5, 2023 at 6:39 PM

Reply to comment by candidhorse4 in What text to speech does this guy use? [R] by candidhorse4

Make no mistake - there is no TTS more humanlike than Azure ATM, but the exact voice was likely fiddled around with a bit to get the exact pronunciation, or ran through a filter.

2 days ago I was comparing all the state-of-the-art TTS', and while Google's Neural2 came close to the video, it does not feature similar voices to the one in the video.

candidhorse4 OP t1_j7c6rhs wrote on February 5, 2023 at 6:36 PM

Reply to comment by suflaj in What text to speech does this guy use? [R] by candidhorse4

i gave it a look but the one in the video seems more proffesional, human-like almost so its not azure

matth0x01 t1_j7c3smm wrote on February 5, 2023 at 6:17 PM

Reply to comment by Ggronne in Information Retrieval book recommendations? [D] by Ggronne

Sorry, my library seems a bit outdated on that side.

But the one from Wikipedia looks great at first sight. Ralph., Kimball (2004). The data warehouse ETL toolkit : practical techniques for extracting, cleaning, conforming, and delivering data

Acrobatic-Name5948 t1_j7c2xwv wrote on February 5, 2023 at 6:12 PM

Reply to 15 years old and bad at math [D] by Daniel_C_____

I would start with learning programming very well. On the side you can learn necessary math on school and from youtube. For calculus i recommend professor Leonard. If you dont have good foundations he start from i guess literally adding numbers. You will need good software skills anyways and you can do it without math. Create a website to show your future projects. Solve some programming problems in codewars etc. Programming language is not that important you just need to get used to abstract thinking. You can embed some AI models from OpenAI etc. to your website with their public API's.

When you know enough calculus, you can dive into ML theory with good skills in your belt. Eventually you can implement research papers and become a research engineer to work on cutting edge development in ML world. When you know some calculus i suggest you to watch Tesla AI ex-Lead Engineer Andrej Karpathy's lectures, he also briefly mentions relevant calculus. Before Karpathy i recommend you neural nets from scratch series in youtube.

Freed4ever t1_j7c28dx wrote on February 5, 2023 at 6:07 PM

Reply to comment by gatorling in [N] "I got access to Google LaMDA, the Chatbot that was so realistic that one Google engineer thought it was conscious. First impressions" by That_Violinist_18

Agreed, but they are forced to play catch up now, and not sure if they are ready. It's not just about the pure tech, it's about the UX, the scalability, the liability, etc. It's safe to say Bing has worked on this before ChatGPT went public, so several months already. Also, OpenAI uses Azure, so they know exactly the loads and plan to scale. The fact that they have way less users currently helps as well.

MysteryInc152 t1_j7c1kwr wrote on February 5, 2023 at 6:03 PM

Reply to [D] List of Large Language Models to play with. by sinavski

GLM-130b https://huggingface.co/spaces/THUDM/GLM-130B

Cohere's models https://cohere.ai/

Aleph Alpha's models https://app.aleph-alpha.com/

AI21's models https://www.google.com/url?sa=t&source=web&rct=j&url=https://studio.ai21.com/&ved=2ahUKEwigktrH-_78AhWAFlkFHefHC3IQFnoECAsQAQ&usg=AOvVaw1L0TKoIvBtSFFB1oJsG5nW

Recent comments in /f/MachineLearning