Recent comments in /f/MachineLearning

suflaj t1_j9jjetb wrote

Because it requires the least amount of human intervention

Also because it subjectively sounds like magic to people who don't really understand it, so it both sells to management and to consumers.

At least it's easier for humans to cope it is like magic than to accept that a lot of what AI can do is just stuff that is trivial and doesn't require humanity to solve.

−12

Raaaaaav t1_j9jfmc7 wrote

I don't know if it is the current best AI to clone voices, but there is a zero-shot model named YOUR-TTS, it has pre-trained weights available and you only need around 1 min of your voice to make it sound quite similar. But you can always retrain it with more samples of your voice to improve the performance even more.

https://github.com/Edresson/YourTTS

I think it was also added to the Coqui-TTS toolkit.

https://github.com/coqui-ai/TTS

However I only played around with the demos in the original repo, and therefore don't know how to use it if you are serious about voice cloning.

1

chief167 t1_j9jev01 wrote

Define scale

Language models? Sure. Images? Sure. Huge amounts of transaction data to search for fraud? Xgboost all the way lol.

Church no free lunch theorem: there is no single approach best for every possible problem. Djeezes I hate it when marketing takes over. You learn this principle in the first chapter of literally every data course

20

IluvBsissa t1_j9j9ml9 wrote

Dr. Zhang, thank you so much. Please can you tell us more about your model's performance ? How would it do on standard MMLU ? Can it be improved by increasing parameters count ? The paper didn't mention if the human testers were average human or experts ?

6

VirtualHat t1_j9j8uvr wrote

For example, in IRIS dataset, the class label is not a linear combination of the input. Therefore, if your model class is all linear models, you won't find the optimal or in this case, even a good solution.

If you extend the model class to include non-linear functions, then your hypothesis space now at least contains a good solution, but finding it might be a bit more trickly.

15

VirtualHat t1_j9j8805 wrote

Linear models make an assumption that the solution is in the form of y=ax+b. If the solution is not in this form then the best solution will is likely to be a poor solution.

I think Emma Brunskill's notes are quite good at explaining this. Essentially the model will underfit as it is too simple. I am making an assumption though, that a large dataset implies a more complex non-linear solution, but this is generally the case.

10