Recent comments in /f/MachineLearning

akacukiii t1_j4ixh08 wrote

Hi. I'm an international grad student in the US and am looking for an internship for the summer. Please, if you have some tips, or if you care to have a look at my profile, just let me know. Thank you!

3

currentscurrents t1_j4ijlqv wrote

You can fine-tune image generator models and some smaller language models.

You can also do tasks that don't require super large models, like image recognition.

>that's beyond just some toy experiment?

Don't knock toy experiments too much! I'm having a lot of fun trying to build a differentiable neural computer or memory-augmented network in pytorch.

3

T1fa_nug t1_j4idpif wrote

Hello guys I'm new in the machine learning and I wanted to know if a i5 8th gen and a 1060 6 gb paired with 16 Gb of ram are they enough for any type work that could come my way??!

1

niclas_wue OP t1_j4i9r9w wrote

Thanks for your ideas. Building a paid experience for companies is a great idea, I will consider it.

Category tagging like ā€žcomputer visionā€œ, ā€žnatural language processingā€œ etc. should be relatively straightforward. Will implement this in the next couple of days :)

More paper specific tags could be generated using GPT-3, I think that would make sense, when the database is a bit larger. Right now, I would guess that most tags would be unique to a single paper.

3

RuairiSpain t1_j4i4t54 wrote

I left academia in the 1990s. When did paper titles becomes so vague? "In my day", you had a good idea what the paper was about just from the title. Reading the first 30-40 papers here, what are authors trying to do? Be comedians?

I need a more up-to-date buzzword thesaurus of research fields and fashions, so I can interpret the context/semantics of these titles! I feel old 😫

5

GasZealousideal8691 OP t1_j4hk6kz wrote

Im fairly certain it’s something with the model. Like even fine tuning is giving these weird errors, when it had no problems for GPT-Neo.

We also ran this stuff on T5, obviously had to configure the rest of the code differently but it was doing fine for that as well.

1

Reasonable_Ladder922 t1_j4hewm2 wrote

Your arxiv-summary.com project sounds like a great idea and a very useful tool for people in the field of machine learning. It's great that you're using PapersWithCode to filter out the most relevant papers, and that you're using GPT-3 to summarize the papers' sections and subsections.

The fact that the website is able to fetch new papers daily and parse their pdf and LaTeX source code to extract relevant sections and subsection, and then summarize those with GPT-3, it will make it more accessible for people to quickly understand the main ideas and contributions from the abstract.

It's great to hear that you have a search page and an archive page where users can get a chronological overview, this will help people to keep track of new publications in their field.

I wish you the best of luck with your project and I'm sure it will be a great resource for many people in the field of machine learning.

1