Recent comments in /f/MachineLearning

utopiah t1_j4a9qq0 wrote

It's not controversial as long as you don't share it and make money with it, you are pretty much free to do whatever you want.

If you plan to share the output, meaning here what's generated, not just the code and checkpoints, or a training set that's under copyright, publicly then it's another question entirely and if you are serious about that I recommend seeking legal advice.

1

gdiamos t1_j4a96pu wrote

Reply to comment by mugbrushteeth in [D] Bitter lesson 2.0? by Tea_Pearce

Currently we have exascale computers, e.g. 1e18 flops at around 50e6 watts.

The power output of the sun is about 4e26 watts. That's 20 orders of magnitude on the table.

This paper claims that energy of computation can theoretically be reduced by another 22 orders of magnitude. https://arxiv.org/pdf/quant-ph/9908043.pdf

So physics (our current understanding) seems to allow at least 42 orders of magnitude bigger (computationally) learning machines than current generation foundation models, without leaving this solar system, and without converting mass into energy...

1

HaMMeReD t1_j4a7i5q wrote

ah it does say that, and noncommercial license at the top as well.

Doesn't matter much, you can't realistically use it anyways.

The source is there, but I don't think it's a pre-trained model, and it sounds really slow (like a second of audio will be like an hour of processing). It'd probably cost less to commission a real musician.

1

currentscurrents t1_j4a2las wrote

>Specifically, 1) we design an expert system to generate a melody by developing musical elements from motifs to phrases then to sections with repetitions and variations according to pre-given musical form; 2) considering the generated melody is lack of musical richness, we design a Transformer based refinement model to improve the melody without changing its musical form. MeloForm enjoys the advantages of precise musical form control by expert systems and musical richness learning via neural models.

16

markhachman OP t1_j4a1fyq wrote

I think what I'm talking would be an algorithm that understands the sounds of different instruments, their tonality, rhythm, and so on, in much the same way ChatGPT understands the relationship between words or presumably Vall-E understands phonemes -- and then understands how to put them together in the style of various artists.

I'll have to check out Riffusion, though, as I'm unfamiliar with it, thanks.

2

Smallpaul t1_j4a15b8 wrote

Reply to comment by nohat in [D] Bitter lesson 2.0? by Tea_Pearce

The first bitter lesson was "people who focused on 'more domain-specific algorithms' lost out to the people who just waited for massive compute power to become available." I think the second bitter lesson is intended to be Robotics-specific and it is "people who focus on 'robotics-specific algorithms' will lose out to the people who leverage large foundation models from non-robotics fields, like large language models."

4