currentscurrents t1_j7ioshb wrote on February 7, 2023 at 1:53 AM

Reply to comment by _poisonedrationality in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey

> Besides, it's not clear to me whether these AI tools be used to benefit humanity as a whole

Of course they benefit humanity as a whole.

Language models allow computers to understand complex ideas expressed in plain english.
Automating art production will make custom art/comics/movies cheap and readily available.
ChatGPT-style AIs (if they can fix hallucination/accuracy problems) give you an oracle with all the knowledge of the internet.
They're getting less hype right now, but there's big advances in computer vision (CNNs/Vision Transformers) that are revolutionizing robotics and image processing.

>I really hope this case sets ome decent precedents about how AI developers can use data they did not create.

You didn't create the data you used to train your brain, much of which was copyrighted. I see no reason why we should put that restriction on people trying to create artificial brains.

[deleted] t1_j7ioi7b wrote on February 7, 2023 at 1:51 AM

Reply to [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

[removed]

aicharades OP t1_j7io17m wrote on February 7, 2023 at 1:47 AM

Reply to comment by Ordinary-Tooth-5140 in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

Up to 200mb!

aicharades OP t1_j7invof wrote on February 7, 2023 at 1:46 AM

Reply to comment by wittfm in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

This is a series of open source libraries that can extend OpenAI completions model: https://langchain.readthedocs.io/en/latest/

You can make your own ChatGPT with its own reference library versus the current pre-2022 snapshot.

It’s possible to leapfrog chatgpt with LangChain and OpenAI Completions (excluding some of their labeled training data) until gpt4 comes out

currentscurrents t1_j7innd5 wrote on February 7, 2023 at 1:44 AM

Reply to comment by VeritaSimulacra in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey

Getty is just the test case for the question of copyright and AI.

If you can't train models on copyrighted data this means that they can't learn information from the web outside of specific openly-licensed websites like Wikipedia. This would sharply limit their usefulness. It also seems distinctly unfair, since copyright is only supposed to protect the specific arrangement of words or pixels, not the information they contain or the artistic style they're in.

The big tech companies can afford to license content from Getty, but us little guys can't. If they win it will effectively kill open-source AI.

thiseye t1_j7imzm0 wrote on February 7, 2023 at 1:39 AM

Reply to comment by bballerkt7 in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

I don't think Google will release something similar publicly for free until it's relatively solid. OpenAI isn't hurt by the dumb things ChatGPT says. Google has a brand to protect and will be held to a higher standard.

Also ChatGPT won't be free for long

Ordinary-Tooth-5140 t1_j7im3cp wrote on February 7, 2023 at 1:32 AM

Reply to [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

Proceed to input 20gb of text

-Rizhiy- t1_j7ilv1v wrote on February 7, 2023 at 1:30 AM

Reply to comment by Sirisian in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

I feel that they won't be trying to generate novel responses from the model, but rather take knowledge graph + relevant data from the first few responses and ask the model to summarise that/change into an answer which humans find appealing.

That way you don't have to rely on the model to remember stuff, it can access all required information through attention.

reditum t1_j7ilt3y wrote on February 7, 2023 at 1:30 AM

Reply to comment by trendafili in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

You just pay with unlimited access to your ~~soul~~ data

wittfm t1_j7ilmnm wrote on February 7, 2023 at 1:28 AM

Reply to [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

Are you using chatgpt or it is just a bait in the title? In any case, good work

[deleted] t1_j7ilc8a wrote on February 7, 2023 at 1:26 AM

Reply to comment by PHEEEEELLLLLEEEEP in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey

[deleted]

reditum t1_j7il5mj wrote on February 7, 2023 at 1:24 AM

Reply to [D] Yann Lecun seems to be very petty against ChatGPT by supersoldierboy94

Facebook: move fast and b r e a k things

Also Facebook/Meta/Zuckerbronium when they actually need to do something different that doesn't involve buying companies to form a monopoly: look guys we're working really hard on these legs for your avatars

clueless1245 t1_j7il1dv wrote on February 7, 2023 at 1:23 AM

Reply to [P] I have implemented an RL agent for trading EUR/USD and I don't know what to do next... by Kiizmod0

Your model is learning to do is predict future market data from past market data, which fundamentally is not worthwhile because market data hinges on real-world news. If you want massive quantities of real-world news data in a structured/tagged format, look at GDELT.

https://www.gdeltproject.org/

Also, look at using Kaggle's GPU notebooks instead of Google's. You get 30 hours a week if you verify with your phone number, instead of Google's arbitrary secret heuristic based cutoff. Or look at something like runpod or vast.ai, rates for non secure GPUs are like a few cents an hour and datacenter GPUs not that expensive either.

P.S There are arbitrage opportunities you can spot using purely market data, but those are generally very short-term, don't warrant powerful models to detect and are pounced on by trading bots run by trading firms.

jlaw54 t1_j7il0ih wrote on February 7, 2023 at 1:23 AM

Reply to comment by [deleted] in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Toss a coin…..

jlaw54 t1_j7iky3o wrote on February 7, 2023 at 1:23 AM

Reply to comment by farmingvillein in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Yeah, if google wants to be competitive here they have to offer something just as good or better. A half solution won’t convert. Consumers are too smart for that in this space (overall).

reditum t1_j7ikqi1 wrote on February 7, 2023 at 1:21 AM

Reply to [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

No, thanks

clueless1245 t1_j7ijokz wrote on February 7, 2023 at 1:13 AM

Reply to comment by [deleted] in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Nope, they're exactly the same as far as advancing human knowledge goes.

Centurion902 t1_j7ii94j wrote on February 7, 2023 at 1:02 AM

Reply to comment by MisterBadger in [N] Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement by Wiskkey

This doesn't even mean anything unless you define inspiration.

aicharades OP t1_j7ii63v wrote on February 7, 2023 at 1:01 AM

Reply to comment by __lawless in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

exactly. the prompt is what the LangChain library uses to manage the text instructions for OpenAI

__lawless t1_j7ii2h1 wrote on February 7, 2023 at 1:00 AM

Reply to comment by aicharades in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

I see it is not a prompt per se, it is an analogue of map operation in ETL.

taleofbenji t1_j7ihwkj wrote on February 7, 2023 at 12:59 AM

Reply to comment by telebierro in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

OpenAI uses tech pioneered by Google.

They didn't come out of nowhere.

jlaw54 t1_j7ihoki wrote on February 7, 2023 at 12:57 AM

Reply to comment by datasciencepro in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

There are indications there has been some scrambling at google over this. But that they weren’t armed and researched, but they didn’t see this coming the way it did.

aicharades OP t1_j7ihm0p wrote on February 7, 2023 at 12:57 AM

Reply to comment by __lawless in [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

Map prompt splits the text into chunks then summarizes those chunks. There's no reduce step.

Here's more detail on how it works: https://langchain.readthedocs.io/en/latest/modules/chains/combine_docs.html

HurricaneHenry t1_j7ihhyl wrote on February 7, 2023 at 12:56 AM

Reply to comment by bballerkt7 in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

It’ll be baked into their search engine, which is free.

__lawless t1_j7ihgh5 wrote on February 7, 2023 at 12:56 AM

Reply to [P] ChatGPT without size limits: upload any pdf and apply any prompt to it by aicharades

Can you explain what map prompt is? Not sure I understood that part

Recent comments in /f/MachineLearning