Recent comments in /f/MachineLearning
[deleted] t1_j7ioi7b wrote
[removed]
aicharades OP t1_j7io17m wrote
Reply to comment by Ordinary-Tooth-5140 in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades
Up to 200mb!
aicharades OP t1_j7invof wrote
Reply to comment by wittfm in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades
This is a series of open source libraries that can extend OpenAI completions model: https://langchain.readthedocs.io/en/latest/
You can make your own ChatGPT with its own reference library versus the current pre-2022 snapshot.
It’s possible to leapfrog chatgpt with LangChain and OpenAI Completions (excluding some of their labeled training data) until gpt4 comes out
currentscurrents t1_j7innd5 wrote
Reply to comment by VeritaSimulacra in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey
Getty is just the test case for the question of copyright and AI.
If you can't train models on copyrighted data this means that they can't learn information from the web outside of specific openly-licensed websites like Wikipedia. This would sharply limit their usefulness. It also seems distinctly unfair, since copyright is only supposed to protect the specific arrangement of words or pixels, not the information they contain or the artistic style they're in.
The big tech companies can afford to license content from Getty, but us little guys can't. If they win it will effectively kill open-source AI.
thiseye t1_j7imzm0 wrote
Reply to comment by bballerkt7 in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
I don't think Google will release something similar publicly for free until it's relatively solid. OpenAI isn't hurt by the dumb things ChatGPT says. Google has a brand to protect and will be held to a higher standard.
Also ChatGPT won't be free for long
Ordinary-Tooth-5140 t1_j7im3cp wrote
Proceed to input 20gb of text
-Rizhiy- t1_j7ilv1v wrote
Reply to comment by Sirisian in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
I feel that they won't be trying to generate novel responses from the model, but rather take knowledge graph + relevant data from the first few responses and ask the model to summarise that/change into an answer which humans find appealing.
That way you don't have to rely on the model to remember stuff, it can access all required information through attention.
reditum t1_j7ilt3y wrote
Reply to comment by trendafili in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
You just pay with unlimited access to your soul data
wittfm t1_j7ilmnm wrote
Are you using chatgpt or it is just a bait in the title? In any case, good work
[deleted] t1_j7ilc8a wrote
Reply to comment by PHEEEEELLLLLEEEEP in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey
[deleted]
reditum t1_j7il5mj wrote
Facebook: move fast and b r e a k things
Also Facebook/Meta/Zuckerbronium when they actually need to do something different that doesn't involve buying companies to form a monopoly: look guys we're working really hard on these legs for your avatars
clueless1245 t1_j7il1dv wrote
Reply to [P] I have implemented an RL agent for trading EUR/USD and I don't know what to do next... by Kiizmod0
Your model is learning to do is predict future market data from past market data, which fundamentally is not worthwhile because market data hinges on real-world news. If you want massive quantities of real-world news data in a structured/tagged format, look at GDELT.
Also, look at using Kaggle's GPU notebooks instead of Google's. You get 30 hours a week if you verify with your phone number, instead of Google's arbitrary secret heuristic based cutoff. Or look at something like runpod or vast.ai, rates for non secure GPUs are like a few cents an hour and datacenter GPUs not that expensive either.
P.S There are arbitrage opportunities you can spot using purely market data, but those are generally very short-term, don't warrant powerful models to detect and are pounced on by trading bots run by trading firms.
jlaw54 t1_j7il0ih wrote
Reply to comment by [deleted] in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
Toss a coin…..
jlaw54 t1_j7iky3o wrote
Reply to comment by farmingvillein in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
Yeah, if google wants to be competitive here they have to offer something just as good or better. A half solution won’t convert. Consumers are too smart for that in this space (overall).
reditum t1_j7ikqi1 wrote
No, thanks
clueless1245 t1_j7ijokz wrote
Reply to comment by [deleted] in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
Nope, they're exactly the same as far as advancing human knowledge goes.
Centurion902 t1_j7ii94j wrote
Reply to comment by MisterBadger in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey
This doesn't even mean anything unless you define inspiration.
aicharades OP t1_j7ii63v wrote
Reply to comment by __lawless in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades
exactly. the prompt is what the LangChain library uses to manage the text instructions for OpenAI
__lawless t1_j7ii2h1 wrote
Reply to comment by aicharades in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades
I see it is not a prompt per se, it is an analogue of map operation in ETL.
taleofbenji t1_j7ihwkj wrote
Reply to comment by telebierro in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
OpenAI uses tech pioneered by Google.
They didn't come out of nowhere.
jlaw54 t1_j7ihoki wrote
Reply to comment by datasciencepro in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
There are indications there has been some scrambling at google over this. But that they weren’t armed and researched, but they didn’t see this coming the way it did.
aicharades OP t1_j7ihm0p wrote
Reply to comment by __lawless in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades
Map prompt splits the text into chunks then summarizes those chunks. There's no reduce step.
Here's more detail on how it works: https://langchain.readthedocs.io/en/latest/modules/chains/combine_docs.html
HurricaneHenry t1_j7ihhyl wrote
Reply to comment by bballerkt7 in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
It’ll be baked into their search engine, which is free.
__lawless t1_j7ihgh5 wrote
Can you explain what map prompt is? Not sure I understood that part
currentscurrents t1_j7ioshb wrote
Reply to comment by _poisonedrationality in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey
> Besides, it's not clear to me whether these AI tools be used to benefit humanity as a whole
Of course they benefit humanity as a whole.
>I really hope this case sets ome decent precedents about how AI developers can use data they did not create.
You didn't create the data you used to train your brain, much of which was copyrighted. I see no reason why we should put that restriction on people trying to create artificial brains.