Recent comments in /f/MachineLearning
[deleted] OP t1_j85rq68 wrote
[removed]
currentscurrents t1_j85rpol wrote
Reply to comment by goj-145 in [D] Is it legal to use images or videos with copyright to train a model? by Tlaloc-Es
They use the open LAION 50B dataset, everybody knows what's in there.
Still, some preprocessing and deduplication would have been a good idea just for output quality.
bik1230 t1_j85oq4m wrote
Reply to [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Is the amount of context ChatGPT can process really enough for a typical research paper?
edunuke t1_j85opbe wrote
Reply to comment by ZestyData in [D] What ML or ML-powered projects are you currently building? by TikkunCreation
I'm in the same boat. Years in this ML business full-time. When I get out of work, I don't want to touch anything tech related, not even with a stick.
VelveteenAmbush t1_j85ngvn wrote
Reply to comment by [deleted] in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Do a two-step. Summarize each paper so the summaries all fit into the context window, then have it compare and contrast.
Rieux_n_Tarrou t1_j85le82 wrote
Reply to comment by SatoshiNotMe in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Yes it is confusing and I don’t think openAI is incentivized to clear up the confusion 😄
Rieux_n_Tarrou t1_j85l5ek wrote
Reply to comment by DreamWithinAMatrix in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
No only for gpt3 models such as davinci
Rieux_n_Tarrou t1_j85l08p wrote
Reply to comment by [deleted] in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Whisper is a voice to text model
Iunaml t1_j85g996 wrote
Reply to comment by ktpr in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Cite one more paper to get 0.15% more chance of being accepted!
[deleted] t1_j85flmh wrote
Reply to comment by Trakeen in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
The problem I found with chatgpt and other AI is the word limit. I believe it is 4000 words max. and that includes the summary as well.
If anyone knows a fix, please let me know. In the meantime, I use an AI-tool called scholarcy, but it lacks data to be fed with. I study a subject that is *very* reading-heavy, so I can't simply rely on the abstract, and 100 pages per week/course is mostly too much to handle, while working part-time.
A_Light_Spark t1_j85cy4f wrote
Reply to comment by Trakeen in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Case in point:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3530294/
The title and the abstract are almost disjointed. I come across papers like regularly like maybe 15% of the time?
[deleted] t1_j85cv32 wrote
[deleted] t1_j85b4hf wrote
Reply to comment by DreamWithinAMatrix in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
[deleted]
LetterRip t1_j85b07d wrote
Reply to comment by norcalnatv in The Inference Cost Of Search Disruption – Large Language Model Cost Analysis [D] by norcalnatv
Why not int4? Why not pruning? Why not various model compression tricks? int4 halves latency. At minimum they would do mixed int4/int8.
https://arxiv.org/abs/2206.01861
Why not distillation?
https://transformer.huggingface.co/model/distil-gpt2
NVidia using FasterTransformer and Triton inference server has a 32x speed up over baseline GPT-J,
I think their assumptions are at least an order of magnitude pessimistic.
As someone else notes, the vast majority of queries can be cached. Also there would likely be a Mixture of experts. No need for the heavy duty model when a trivial model can answer the question.
Trakeen t1_j85angn wrote
Reply to comment by radarsat1 in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Yea that certainly seems useful but it also sounds like a mix of search engine and chatgpt. MSs updates to bing might be able to do that?
Trakeen t1_j85aa80 wrote
Reply to comment by A_Light_Spark in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Probably depends on field? I’ve not typically encountered this and most other researchers are going to be looking at dozens of papers at least so they really don’t want to actually have to dig into a paper to find the meat
ktpr t1_j859ruq wrote
Reply to comment by endless_sea_of_stars in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Or, click here to auto-cite this paper to learn more about number 14!
ktpr t1_j859ne1 wrote
Reply to comment by Trakeen in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Imagine that!
Himalun t1_j8593ax wrote
Reply to comment by norcalnatv in The Inference Cost Of Search Disruption – Large Language Model Cost Analysis [D] by norcalnatv
It’s worth noting that both MS and Google own the data centers and hardware so it is likely cheaper for them to run. But still expensive.
endless_sea_of_stars t1_j858dvn wrote
Reply to comment by Iunaml in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
> abstract is meant is often a bit clickbaity.
Had a vision of a nightmare future where papers are written in click bait fashion.
Top Ten Shocking Properties of Positive Solutions of Higher Order Differential Equations and Their Astounding Applications in Oscillation Theory. You won't believe number 7!
SatoshiNotMe t1_j856nri wrote
Reply to comment by Rieux_n_Tarrou in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
A lot of people just write “using ChatGPT” in their app headlines when in fact they are actually using the GPT3 API. I will generously interpret this as being due to this genuine confusion :)
A_Light_Spark t1_j8564n5 wrote
Reply to comment by import_social-wit in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Climax my ass, I'm trying to learn, not to cum
import_social-wit t1_j85565r wrote
Reply to comment by A_Light_Spark in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Nobody likes having the climax spoiled during the first few pages of a story!
sweatierorc t1_j854tn3 wrote
Reply to comment by goj-145 in [D] Is it legal to use images or videos with copyright to train a model? by Tlaloc-Es
On the training part, it is probably legal, though you need to be careful about something like GDPR. E.g. for facial recognition, there are extra rules.
The "sharing model and/or its prediction" is the gray area.
Edit:t ypo
_poisonedrationality t1_j85uxxg wrote
Reply to comment by _sshin_ in [P] Introducing arxivGPT: chrome extension that summarizes arxived research papers using chatGPT by _sshin_
Do you know the difference between ChatGPT and GPT? Are you being misleading on purpose?