Recent comments in /f/MachineLearning
Leptino t1_j4oxrdp wrote
Reply to comment by dmart89 in [D] Can ChatGPT flag it's own writings? by MrSpotgold
It shouldn't be too difficult to produce a watermark provided the output is something on the order of a paragraph. However, I don't think its always possible. For instance if I ask ChatGPT to replicate the previous paragraph by replacing all nouns and verbs and to keep the same meaning.
Further tweaking by a human should completely destroy any residual.
yahma t1_j4owot0 wrote
Reply to comment by MegavirusOfDoom in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
This may be the size of the datasets, but i it's hard to say how many parameters will be needed for a good llm that's just really good at explaining code.
armchair-progamer t1_j4ovjjm wrote
Reply to comment by dmart89 in [D] Can ChatGPT flag it's own writings? by MrSpotgold
> digital watermark
Wouldn't it be easier to store the model outputs or a perceptual hash, and then provide a way to determine if some text is similar to prior ChatGPT output? I assumed they were already doing something like this to collect usage data as they scrape new content.
ChatGPT already has a unique writing style, I'm not sure how you could add anything to the text which couldn't be trivially removed and do better
nateharada OP t1_j4ou7qz wrote
Reply to comment by MuonManLaserJab in [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
./popquiz_hotshot.sh
[deleted] t1_j4ou2kf wrote
[deleted]
nateharada OP t1_j4otocf wrote
Reply to comment by Fit_Schedule5951 in [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
This tool actually doesn't look at memory right now, just actual computation. Usually loading your model into memory eats up basically the max memory until the training is done, even if compute usage is very low.
If your training is hanging and still burning GPU cycles that'd be harder to detect I think.
T1METR4VEL t1_j4ol7zx wrote
Reply to comment by Ham05 in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
I sent you at chat
TedRabbit t1_j4okgzh wrote
Reply to [D] Model for detecting rectangle corners? by hundley10
I mean, seems like a basic convolution neural network would work well for this.
jloverich t1_j4oim54 wrote
Reply to [P] Looking for a CV/ML freelancer by bluebamboo3
Just use detectron2
sad_dad_is_a_mad_lad t1_j4ohl7t wrote
Reply to comment by avocadoughnut in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
I don't think there are any laws that protect their data in this way, except perhaps contract law because they have a hidden ToS that you have to accept to use their service. As long as you use it for free though, I'm not sure there is consideration, and well... I don't know how they would go about proving misuse or damages.
Certainly it would not be copyright law, given that GPT3 itself was trained on copyrighted data...
ML4Bratwurst t1_j4of1wg wrote
Reply to comment by junetwentyfirst2020 in [P] Looking for a CV/ML freelancer by bluebamboo3
I mean it's not like there are no libraries for human segmentation which already run on these devices....
MegavirusOfDoom t1_j4oelbd wrote
Reply to comment by LetGoAndBeReal in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
less than 500MB is used for code learning, 690GB is used for culture, geography, history, fiction and non-fiction... 2GB for cats, 2GB bread, horses, dogs, Cheese, Wine, Italy, France, Politics, Television, Music, Japan, Africa. less than 1% of the training is on science and technology, i.e. 300MB is biology, 200MB chemistry, 100MB physics, 400MB maths...
serge_mamian t1_j4ods2g wrote
The question is how the fuck does one get a 4090? I am really at my wits end, Amazon has a few at double MSRP.
nmfisher t1_j4odkrt wrote
Reply to comment by __lawless in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
- Choose your niche (speech recognition/image classification/LLMs/whatever)
- Start your own blog with good* technical content (i.e. not the shovel crap you see on Medium), and see if you can write some guest posts for an existing blog with decent traffic. Open-source your code on GH. Spread on social media.
- Give presentations at a few local events and make it clear you're also available for freelancing.
It might take a month or two but people will start contacting you.
* this is important, your blog content/presentation actually has to be worth reading. It doesn't have to be cutting-edge, but it has to be novel enough to convince someone that you have something special to offer. Implementing a lesser-known paper and showing your results is usually a good start (also it teaches you just how hard it is to recreate something based on a paper alone).
Fit_Schedule5951 t1_j4obl4w wrote
Reply to [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
Nice, I think an extension where this could be beneficial is when your process hangs - it's using full GPU memory but not training, this happened to me recently training models with fairseq. (I am not sure how you can catch these conditions)
NotDoingResearch2 t1_j4oaz7r wrote
Understanding what stable diffusion models are is easy. Understanding why they work and VAEs don’t is hard, especially when you consider they are just defective VAEs.
protocolypse t1_j4o8t8d wrote
This and the back blaze report are among my favorite articles.
m98789 t1_j4o7hlk wrote
Reply to comment by junetwentyfirst2020 in [P] Looking for a CV/ML freelancer by bluebamboo3
This is actually a reasonable estimate.
Alarmed_Syrup2670 t1_j4o39kr wrote
一一jv一一i17177
MuonManLaserJab t1_j4o2mlx wrote
Reply to [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
I have a little script called gpu_Speed that blows up my laptop if it drops below 50 mph % GPU utilization
junetwentyfirst2020 t1_j4ntud9 wrote
Reply to [P] Looking for a CV/ML freelancer by bluebamboo3
Sure. I’ll need an iOS engineer as well and god know what models are supported on device currently, so it’ll be 250k and I’ll need 6 months.
Cherubin0 t1_j4nt4cb wrote
Reply to [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
I always wonder how people get customers.
Ham05 t1_j4noofg wrote
If the goal is for a commercial endeavor I suggest bringing on an ML-specialized shop. Good ones can knock this out in a few sprints. PM me if you need more info.
asdfzzz2 t1_j4nl1tn wrote
Reply to comment by timdettmers in [D] Tim Dettmers' GPU advice blog updated for 4000 series by init__27
> This means, with an average of 60 watt idle and 350 watt max for a RTX 4090
RTX 4090 "idles" (stream at background) at 10-15 watt. 4k144hz monitor might change it, but 60 watt is way too much for GPU only.
bunni t1_j4ozu6b wrote
Reply to [D] Model for detecting rectangle corners? by hundley10
If you’re printing the cards you can use a different aruco marker for each size. These are easily detected and you can infer the dimensions and pose of the entire card.
http://mecaruco2.readthedocs.io/en/latest/notebooks_rst/Aruco/aruco_basics.html