Recent comments in /f/MachineLearning
monkeysingmonkeynew OP t1_j4pjoxj wrote
Reply to comment by thiru_2718 in [D] Is it possible to update random forest parameters with new data instead of retraining on all data? by monkeysingmonkeynew
Thanks! I'll muse this over
boo5000 t1_j4piqv8 wrote
Reply to comment by blueSGL in [P] I built arxiv-summary.com, a list of GPT-3 generated paper summaries by niclas_wue
Yeah — gotta run it through another AI to spruce up the variance lmao
thiru_2718 t1_j4piklu wrote
Reply to [D] Is it possible to update random forest parameters with new data instead of retraining on all data? by monkeysingmonkeynew
Inresting question. My intuition if you could maintain a continuously-updated cache of the metric you're using to split your branches (i.e. continuously compute mutual information for each fork), and we assume your new data roughly follows the same distribution as your old data, you maybe able to get away with only modifying the downstream branches of your trees which should be more efficient.
But if that assumption isn't true, then the new data changes your trees closer to the root, and there's little benefit.
ML4Bratwurst t1_j4pibv1 wrote
Reply to comment by __mishy__ in [P] Looking for a CV/ML freelancer by bluebamboo3
Good luck with that without data coming from the "v0"
blimpyway t1_j4pi713 wrote
Reply to comment by chaosmosis in [D] The Illustrated Stable Diffusion (Video) by jayalammar
The order of the words/tokens is normally encoded via positional embeddings that are added each to their respective token embedding. See e.g. https://machinelearningmastery.com/a-gentle-introduction-to-positional-encoding-in-transformer-models-part-1/
[deleted] t1_j4ph4kz wrote
Reply to comment by ThirdMover in [D] Can ChatGPT flag it's own writings? by MrSpotgold
[deleted]
MegavirusOfDoom t1_j4pfdi1 wrote
Reply to comment by yahma in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
Then we'd have to crawl all of stack exchange, all of wiki, and 1 terabyte of programming books... This "generalist NLP" is for article writing, for poetry.
I'm a big fan of teaching ChatGPT how to interpret graphs, the origin lines, to record in a vector engine that is couple with the NLP. For a coding engine, I believe NLP should be paired with a compiler, just like a maths specialized NLP should also have a mathlab type engine.
theaimlguy t1_j4pceky wrote
If it was possible to try distillation on ChatGPT to produce smaller models which could run on mobile hardware, it would have been great!
nmfisher t1_j4paqfc wrote
Reply to comment by Professional-Row9655 in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
Easiest way IMO is to scan the list of papers at the annual conferences in your given field, pick a handful with names that sound interesting, then try and find a paper that's referenced by two or more of them.
That's probably a good place to start - it's been around long enough that it's probably not a flash in the pan, but still "new" enough to be relevant.
bay_der t1_j4papbd wrote
Reply to comment by nateharada in [P] A small tool that shuts down your machine when GPU utilization drops too low. by nateharada
One way I have figured out is to put a watch on the log file.
__mishy__ t1_j4pamjg wrote
Reply to comment by ML4Bratwurst in [P] Looking for a CV/ML freelancer by bluebamboo3
Sure you can probably get v0 working in a week or two (assuming you know mobile stuff) but ironing out weird edge cases and assumed features takes forever
anothererrta t1_j4pagpo wrote
Reply to comment by timdettmers in [D] Tim Dettmers' GPU advice blog updated for 4000 series by init__27
If you go to all this trouble, please keep in mind that electricity prices vary a lot across the world. In some places in Europe people pay twice as much as you assumed above.
Making it clear how you arrive at your value calculation in an updated post (or even making it a dynamic calculator where people can enter their cost/kWh) would be very useful.
Acceptable-Cress-374 t1_j4pacws wrote
Reply to comment by Zondartul in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
> See teacher-student learning, and distillation.
Thanks, I'll check it out.
Professional-Row9655 t1_j4p9ub8 wrote
Reply to comment by nmfisher in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
Good one. Applies to software engineering practice too or any subfield of IT. On another note what resources you refer to generally to pick a paper for implementation ? Can you share some ?
TheTerrasque t1_j4p9kk7 wrote
There is a project called Petals that have BLOOM running for everyone to use. It's distributing the model over many machines and thus allows it to run on consumer hardware. There is a PoC chat at http://chat.petals.ml/
They just converted BLOOMZ and is currently setting up a network for that. That should be more suited for a chat interface. There's still missing gpu's though, so would be great with some more servers if people got some spare compute.
BeatLeJuce t1_j4p938f wrote
Reply to comment by royalemate357 in [D] Tim Dettmers' GPU advice blog updated for 4000 series by init__27
Thanks for the explanation? Why call it TF32 when it apperas to have 19 bits? (IIUC it's bfloat16 with 3 additional bits of mantissa?)
BeatLeJuce t1_j4p8p5g wrote
Reply to comment by Freonr2 in [D] Tim Dettmers' GPU advice blog updated for 4000 series by init__27
thanks!
blackkettle t1_j4p8nqv wrote
Reply to [R] The Predictive Forward-Forward Algorithm by radi-cho
It doesn’t seem to discuss the computational advantages in any detail. How interesting is this whole FF idea at this point? I’d love to hear more detailed analysis.
So far it seems like an interesting alternative but the “brain inspired” part is pushed in every article. In terms of accuracy it always seems slightly below traditional back prop. If there’s a huge computational improvement that would seriously recommend it I guess, but is there? Or is it just too early to tell?
__mishy__ t1_j4p5mm8 wrote
Reply to comment by farox in [D] I’m a Machine Learning Engineer for FAANG companies. What are some places I can get started doing freelance work for ML? by doctorjuice
Completely agree, I would also add just a couple of tips I've found (not directly related to the question):
- have a good circle of friends in ML you chat to, you will sometimes find yourself in places where you are the only ML expert and you will need people you trust to bounce ideas across/tell you about new things you missed
- invest in a decent workstation and if you can't afford one try to get your first gig where it's not needed and buy one as soon as you can. This has saved me tons of time over the years
- get good at showing results quickly to stakeholders... and I mean you should be able to hack it in an hour at most. They are paying you a lot of money and want the feeling of progress. Something in slides/powerpoint is OK, a dumb streamlit/whatever app is even better. Impressing a stakeholder is the best way to get repeat work
Unlikely-Advice-7168 t1_j4p515n wrote
Reply to [P] Built an at-cost, pay per second, open-source API for Tortoise text-to-speech (best I've heard!) by Apprehensive-Tax-214
Site doesn't work
[deleted] t1_j4p2vog wrote
Lethandralis t1_j4p15c3 wrote
Reply to comment by Lethandralis in [D] Model for detecting rectangle corners? by hundley10
This might not work if the cards have 6 degrees of freedom. You can check out CornerNet and its variants for anchor free corner estimation. The original paper detects two corners, but extending to four should be possible.
Another option is to use yolo to detect a rough bbox, and then use classical cv to refine corner locations.
Lethandralis t1_j4p0scb wrote
Reply to [D] Model for detecting rectangle corners? by hundley10
https://github.com/XinzeLee/RotateObjectDetection
There are yolo versions that allow oriented bbox detection. Above is just one example, there is a lot of repos on gihub.
Chocolate_Pickle t1_j4p01lm wrote
Reply to [R] The Predictive Forward-Forward Algorithm by radi-cho
Landing page here: https://arxiv.org/abs/2301.01452
Around-star t1_j4pk1lp wrote
Reply to Apple AI Residency 2023 [R] by Extension-Reward5756
Looking at the profiles of AI/ML Residents at apple, it is mostly PhD students, some with masters but did not find any holding only a bachelors. In that case, I should not have much hopes of hearing back :(