Recent comments in /f/MachineLearning
YOLOBOT666 t1_j6ziz4m wrote
Reply to comment by fuscarili in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
Yeah, this would be a course in RL, most likely using RL bible as main reference textbook. Agree with the other comment, these lectures are all available online.
What I found valuable in attending a course in person was the prof, lots of insights and intuitions explained in person/office hours was the most valuable part for me. While I was taking the RL course in person, I also referenced online lectures and notes.
In terms of data science interviews and jobs, Bayesian would be more useful, at least more than RL unless you found yourself in robotics or some very niche industry.
frequenttimetraveler t1_j6ziww8 wrote
Reply to comment by visarga in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
Chatgpt can be imperfect on cue
mtocrat t1_j6zin88 wrote
Reply to comment by koolaidman123 in [D] Why do LLMs like InstructGPT and LLM use RL to instead of supervised learning to learn from the user-ranked examples? by alpha-meta
supervised fine-tuning seems inherently limited here. You regress to the best in the set of answers but that's it. RLHF can improve beyond that, up to the point where the generalization capabilities of the reward model fail..
Professional_Poet489 t1_j6zgk6h wrote
Reply to comment by fuscarili in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
You can find good lectures on all of these topics on youtube, coursera, etc, but that's also true about Bayesian methods. RL is more fun IMO, but less employable for now. RL is used all over the place for things like recommender engines, ad promotion, etc. The concepts are super valuable. Bayesian methods are a bit more generic and common, and tbh are going out of vogue in most of robotics.
fuscarili OP t1_j6zfyha wrote
Reply to comment by t3co5cr in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
thx for your comment! I am reading and considering all of them
mongoosefist t1_j6zfxe8 wrote
Reply to comment by NitroXSC in [R] Extracting Training Data from Diffusion Models by pm_me_your_pay_slips
How would you know that you had recovered it if you didn't know the training data a priori?
fuscarili OP t1_j6zfun0 wrote
Reply to comment by northernbeggar in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
thx for your comment! I appreciate people speaking their minds
bjorneylol t1_j6zfs3j wrote
Reply to comment by CommunismDoesntWork in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
Except for the fact that it has next to zero usability if you use Firefox as a default browser, and there are no functional OS integrations
bjorneylol t1_j6zff0g wrote
Reply to comment by venustrapsflies in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
They announced back in like september it was no longer supported, so that tracks.
fuscarili OP t1_j6zfcnk wrote
Reply to comment by bring_dodo_back in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
Thx for your insights!
flyingbertman t1_j6zfccf wrote
Reply to comment by frequenttimetraveler in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
Peopel
fuscarili OP t1_j6zfazq wrote
Reply to comment by bablador in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
I wish! But my little brain is already overloading :(
fuscarili OP t1_j6zf6zc wrote
Reply to comment by [deleted] in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
Thx for your comment! Analyses like these help a lot draw a map in my mind of how these different fields are intertwined and applied in the different areas of ML, which is the first thing that I need to understand in order to make up my mind
t3co5cr t1_j6zf1mm wrote
Reply to [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
Not that it'll help you with your choice of elective, but reinforcement learning can be seen as a particular type of Bayesian modeling.
LeumasInkwater t1_j6zdg60 wrote
Reply to [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
Honestly all the GPT stuff they are introducing seems pretty useful.
I like the idea of having automatic tasks generated after a meeting. I usually jot down 'follow-up' items while in meetings, and send them out to relevant coworkers afterward. It would only save me 5 minutes or so after every call, but could maybe help me focus more on what's being said rather than writing everything down 🤷♂️.
Also flagging parts of a meeting that you missed, auto-chapters, and tagging sections by the speaker all seem genuinely helpful.
That being said, my company doesn't use Microsoft products, so I hope to see features like this come to other platforms.
HipposWild t1_j6zczw8 wrote
Reply to [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
How many gpus does that take to run?
northernbeggar t1_j6zcr9t wrote
Reply to [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
Judging from your question, Bayesian methods is the obvious choice. The basic of RL, which is likely to be the most significant part of your course, can be learned at your own convenience from the famous textbook by Sutton and Barto.
netw0rkf10w OP t1_j6zbfz4 wrote
Reply to comment by MadScientist-1214 in [D] ImageNet normalization vs [-1, 1] normalization by netw0rkf10w
Indeed. Maybe we have a new battle between [-1, 1] and [0, 1] lol.
netw0rkf10w OP t1_j6zbbkb wrote
Reply to comment by nicholsz in [D] ImageNet normalization vs [-1, 1] normalization by netw0rkf10w
Agreed!
visarga t1_j6zb9em wrote
Reply to comment by Nhabls in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
Many AI teams are scrambling now to label data with GPT-3 and train their small efficient models from GPT-3 predictions. This makes the hard part of data labelling much easier, speeds up development 10 times. In the end you get your cheap & fast models that work about as good as GPT-3 but only on a narrow task.
netw0rkf10w OP t1_j6zb957 wrote
Reply to comment by puppet_pals in [D] ImageNet normalization vs [-1, 1] normalization by netw0rkf10w
If I remember correctly it was first used in AlexNet, which started the deep learning era though. I agree that it doesn't make much sense nowadays, but it's still be used everywhere :\
fuscarili OP t1_j6zb7y9 wrote
Reply to comment by YOLOBOT666 in [D] I'm at a crossroads: Bayesian methods VS Reinforcement Learning, which to choose? by fuscarili
This is the syllabus:
Reinforcement learning in non-sequential problems:
- Non-contextual multi-armed bandits
- Contextual Multi-Armed Bandits
Reinforcement learning in sequential problems:
- Dynamic planning. Bellman's formula
- Value-Based Methods
- Policy-based methods
- Actor-critic methods
- Model-Based Methods
Would you say it's within the basic stuff? I honestly have no clue
visarga t1_j6zb3ca wrote
Reply to comment by wintermute93 in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
> largely the same kind of thing.
For what value of largely? How many coherent words can it write? Does it also obey commands and solve tasks?
visarga t1_j6zapcm wrote
Reply to comment by LeanderKu in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
That's why I keep pen and notebook open in front of my keyboard at all times, I take light notes during meetings and use it as scratchpad when I am thinking. I can fill 100 pages in a month, almost never re-read except for meeting notes.
halohunter t1_j6zj8gj wrote
Reply to comment by AnotherEuroWanker in [N] Microsoft integrates GPT 3.5 into Teams by bikeskata
Microsoft does not give a toss about Firefox. Power Bi and Power Apps also have bugs only in Firefox