_poisonedrationality t1_j85uxxg wrote on February 11, 2023 at 9:28 PM

Reply to comment by _sshin_ in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Do you know the difference between ChatGPT and GPT? Are you being misleading on purpose?

[deleted] OP t1_j85rq68 wrote on February 11, 2023 at 9:04 PM

Reply to [R] I made a mistake in a recent submission, what to do ? by [deleted]

[removed]

currentscurrents t1_j85rpol wrote on February 11, 2023 at 9:04 PM

Reply to comment by goj-145 in [D] Is it legal to use images or videos with copyright to train a model? by Tlaloc-Es

They use the open LAION 50B dataset, everybody knows what's in there.

Still, some preprocessing and deduplication would have been a good idea just for output quality.

bik1230 t1_j85oq4m wrote on February 11, 2023 at 8:43 PM

Reply to [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Is the amount of context ChatGPT can process really enough for a typical research paper?

edunuke t1_j85opbe wrote on February 11, 2023 at 8:43 PM

Reply to comment by ZestyData in [D] What ML or ML-powered projects are you currently building? by TikkunCreation

I'm in the same boat. Years in this ML business full-time. When I get out of work, I don't want to touch anything tech related, not even with a stick.

VelveteenAmbush t1_j85ngvn wrote on February 11, 2023 at 8:34 PM

Reply to comment by [deleted] in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Do a two-step. Summarize each paper so the summaries all fit into the context window, then have it compare and contrast.

Rieux_n_Tarrou t1_j85le82 wrote on February 11, 2023 at 8:19 PM

Reply to comment by SatoshiNotMe in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Yes it is confusing and I don’t think openAI is incentivized to clear up the confusion 😄

Rieux_n_Tarrou t1_j85l5ek wrote on February 11, 2023 at 8:18 PM

Reply to comment by DreamWithinAMatrix in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

No only for gpt3 models such as davinci

Rieux_n_Tarrou t1_j85l08p wrote on February 11, 2023 at 8:17 PM

Reply to comment by [deleted] in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Whisper is a voice to text model

Iunaml t1_j85g996 wrote on February 11, 2023 at 7:44 PM

Reply to comment by ktpr in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Cite one more paper to get 0.15% more chance of being accepted!

[deleted] t1_j85flmh wrote on February 11, 2023 at 7:39 PM

Reply to comment by Trakeen in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

The problem I found with chatgpt and other AI is the word limit. I believe it is 4000 words max. and that includes the summary as well.

If anyone knows a fix, please let me know. In the meantime, I use an AI-tool called scholarcy, but it lacks data to be fed with. I study a subject that is *very* reading-heavy, so I can't simply rely on the abstract, and 100 pages per week/course is mostly too much to handle, while working part-time.

A_Light_Spark t1_j85cy4f wrote on February 11, 2023 at 7:22 PM

Reply to comment by Trakeen in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Case in point:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3530294/

The title and the abstract are almost disjointed. I come across papers like regularly like maybe 15% of the time?

[deleted] t1_j85cv32 wrote on February 11, 2023 at 7:21 PM

Reply to [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

[deleted]

[deleted] t1_j85b4hf wrote on February 11, 2023 at 7:09 PM

Reply to comment by DreamWithinAMatrix in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

[deleted]

LetterRip t1_j85b07d wrote on February 11, 2023 at 7:08 PM

Reply to comment by norcalnatv in The Inference Cost Of Search Disruption – Large Language Model Cost Analysis [D] by norcalnatv

Why not int4? Why not pruning? Why not various model compression tricks? int4 halves latency. At minimum they would do mixed int4/int8.

https://arxiv.org/abs/2206.01861

Why not distillation?

https://transformer.huggingface.co/model/distil-gpt2

NVidia using FasterTransformer and Triton inference server has a 32x speed up over baseline GPT-J,

https://developer.nvidia.com/blog/deploying-gpt-j-and-t5-with-fastertransformer-and-triton-inference-server/

I think their assumptions are at least an order of magnitude pessimistic.

As someone else notes, the vast majority of queries can be cached. Also there would likely be a Mixture of experts. No need for the heavy duty model when a trivial model can answer the question.

Trakeen t1_j85angn wrote on February 11, 2023 at 7:06 PM

Reply to comment by radarsat1 in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Yea that certainly seems useful but it also sounds like a mix of search engine and chatgpt. MSs updates to bing might be able to do that?

Trakeen t1_j85aa80 wrote on February 11, 2023 at 7:03 PM

Reply to comment by A_Light_Spark in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Probably depends on field? I’ve not typically encountered this and most other researchers are going to be looking at dozens of papers at least so they really don’t want to actually have to dig into a paper to find the meat

ktpr t1_j859ruq wrote on February 11, 2023 at 6:59 PM

Reply to comment by endless_sea_of_stars in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Or, click here to auto-cite this paper to learn more about number 14!

ktpr t1_j859ne1 wrote on February 11, 2023 at 6:59 PM

Reply to comment by Trakeen in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Imagine that!

Himalun t1_j8593ax wrote on February 11, 2023 at 6:55 PM

Reply to comment by norcalnatv in The Inference Cost Of Search Disruption – Large Language Model Cost Analysis [D] by norcalnatv

It’s worth noting that both MS and Google own the data centers and hardware so it is likely cheaper for them to run. But still expensive.

endless_sea_of_stars t1_j858dvn wrote on February 11, 2023 at 6:50 PM

Reply to comment by Iunaml in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

> abstract is meant is often a bit clickbaity.

Had a vision of a nightmare future where papers are written in click bait fashion.

Top Ten Shocking Properties of Positive Solutions of Higher Order Differential Equations and Their Astounding Applications in Oscillation Theory. You won't believe number 7!

SatoshiNotMe t1_j856nri wrote on February 11, 2023 at 6:38 PM

Reply to comment by Rieux_n_Tarrou in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

A lot of people just write “using ChatGPT” in their app headlines when in fact they are actually using the GPT3 API. I will generously interpret this as being due to this genuine confusion :)

A_Light_Spark t1_j8564n5 wrote on February 11, 2023 at 6:34 PM

Reply to comment by import_social-wit in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Climax my ass, I'm trying to learn, not to cum

import_social-wit t1_j85565r wrote on February 11, 2023 at 6:27 PM

Reply to comment by A_Light_Spark in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_

Nobody likes having the climax spoiled during the first few pages of a story!

sweatierorc t1_j854tn3 wrote on February 11, 2023 at 6:25 PM

Reply to comment by goj-145 in [D] Is it legal to use images or videos with copyright to train a model? by Tlaloc-Es

On the training part, it is probably legal, though you need to be careful about something like GDPR. E.g. for facial recognition, there are extra rules.

The "sharing model and/or its prediction" is the gray area.

Edit:t ypo

Recent comments in /f/MachineLearning