svantevid t1_jawk06h wrote on March 4, 2023 at 4:59 PM

If you know and remember the command - yes, if you need to google or read the man first this could be faster. Or sometimes faster for complex commands with subshells and regexes.

currentscurrents t1_javx4pw wrote on March 4, 2023 at 2:14 PM

Reply to [D] The Sentences Computers Can't Understand, But Humans Can by New_Computer3619

The Winograd Schema is a test of commonsense reasoning. It's hard because it requires not just knowledge of english, but also knowledge of the real world.

But as you found, it's pretty much solved now. As of 2019 LLMs could complete it with better than 90% accuracy, which means it was actually already solved when Tom Scott made his video.

Disastrous_Elk_6375 t1_javwm8d wrote on March 4, 2023 at 2:10 PM

Reply to Did you get access to Meta AI's LLAMA? [Discussion] by WittyBananaPeel

laughs in magnet link

RoninUTA t1_javs4zg wrote on March 4, 2023 at 1:29 PM

Reply to [P] LazyShell - GPT based autocomplete for zsh by rumovoice

Impressive!

DAlmighty t1_javrbwq wrote on March 4, 2023 at 1:21 PM

Reply to [P] LazyShell - GPT based autocomplete for zsh by rumovoice

This is definitely impressive but it also feels like more work than to just bang out the commands.

CellWithoutCulture t1_javqw9s wrote on March 4, 2023 at 1:17 PM

Reply to comment by LetterRip in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir

Fantastic reply, it's great to see all those concrete advances thst made it intro prod. Thanks for sharing.

New_Computer3619 OP t1_javqiuu wrote on March 4, 2023 at 1:13 PM

Reply to comment by DSM-6 in [D] The Sentences Computers Can't Understand, But Humans Can by New_Computer3619

I tried the same questions in separate chats as in the edited post. ChatGPT gave incorrect/unsatisfying answers this time. May be without context from previous Q&A, it can only infer using grammar rule? What do you think?

LetterRip t1_javpxbv wrote on March 4, 2023 at 1:07 PM

Reply to comment by CellWithoutCulture in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir

> I mean... why were they not doing this already? They would have to code it but it seems like low hanging fruit

GPT-3 came out in 2020 (they had their initial price then a modest price drop early on).

Flash attention is June of 2022.

Quantization we've only figured out how to do it fairly lossless recently (especially int4). Tim Dettmers LLM int8 is from August 2022.

https://arxiv.org/abs/2208.07339

> That seems large, which paper has that?

See

https://github.com/HazyResearch/flash-attention/raw/main/assets/flashattn_memory.jpg

>We show memory savings in this graph (note that memory footprint is the same no matter if you use dropout or masking). Memory savings are proportional to sequence length -- since standard attention has memory quadratic in sequence length, whereas FlashAttention has memory linear in sequence length. We see 10X memory savings at sequence length 2K, and 20X at 4K. As a result, FlashAttention can scale to much longer sequence lengths.

https://github.com/HazyResearch/flash-attention

DSM-6 t1_javnmz2 wrote on March 4, 2023 at 12:43 PM

Reply to [D] The Sentences Computers Can't Understand, But Humans Can by New_Computer3619

Personally, I think the answer is existing bias in the training data.

I don’t know enough about chatgpt to state this as fact, but I think it’s safe to assume that chatgpt understands or adheres to grammar rules. I.e. nowhere in the code does it state “antecedent pronouns should refer to the subject of a sentence”

Instead I assume chatgpt grammar comes repeated convention in the training data. Enough data in which the antecedent refers to something other then the sentence object means that the “they” can refer to any of the preceding nouns. In that case “councilmen fear voilence” is a far more common sentence in the training than “protesters fear violence”

Then again your example was passive tense, so I dunno 🤷‍♀️.

notaninvestor633 t1_javmt3c wrote on March 4, 2023 at 12:34 PM

Reply to [P] LazyShell - GPT based autocomplete for zsh by rumovoice

Wow this is awesome. Can’t wait to tinker around with it

harharveryfunny t1_javmsab wrote on March 4, 2023 at 12:34 PM

Reply to comment by Thin_Sky in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir

It's a leak, but seems to be legitimate.

https://twitter.com/transitive_bs/status/1628118163874516992

Recent comments in /f/MachineLearning