Recent comments in /f/MachineLearning

Mefaso t1_j9dbox6 wrote

>IMO reviewers at these journals/conferences need to be more mindful of this kind of plagiarism/low-effort submission.

Workshops in general have a very low bar, this surely wouldn't have been published in the main track.

Other than that I don't really see the point of this rant.

Yes there are a lot of bad papers, there are a lot of bad papers even in the main tracks, you just kind of get used to it.

It feels a lot like hitting down a well. Maybe these are some undergraduates doing their first research project and it's more about learning the methodologies and writing rather than very novel approaches.

16

PacmanIncarnate t1_j9d7yaw wrote

The bigger use isn’t games, but animation or VFX. They require high quality simulations that sometimes take days to render a few seconds of simulation. Every tech that can cut that time down without a substantial loss of quality is huge.

2

Borrowedshorts t1_j9cxi3r wrote

I'd really like to see more realistic ground (contact) physics with different textures and terrains. Someone might walk differently in a desert environments vs a forest environment vs a snow environment for example. If there's debris on the ground such as small rocks or other debris it may cause the character to adjust foot contact to compensate. Sloping features could also be incorporated and modeled. Walking is a big thing but vehicle movement in these environments is also something that can be drastically improved upon.

1

synth_mania t1_j9cug1p wrote

My point was that you said image classification has been around since before NNs. That is false. Image classification has only ever been done with NNs. Sometimes they are radically different than what is normally used today (e.g. RAMnets and WISARD), but they've always been NNs.

0

buyIdris666 t1_j9ctyh8 wrote

Yup. Nerf just replaced the construction step after you "register" all the camera positions using traditional algorithms. Usually via COLMAP.

Not saying that's a bad thing, existing algorithms are already good at estimating camera positions and parameters. It was the 3d reconstruction step that was previously lacking.

For anyone wanting to try this, I suggest using Nerf-W . The original Nerf required extremely accurate camera parameter estimates that you're not going to get with a cell camera and COLMAP. Nerf-w is capable of doing some fine adjustments as it runs. It even works decent reconstructing scenes using random internet photos.

The workflow is COLMAP to register the camera positions used to take the pictures and estimate camera parameters, then export those into the Nerf model. Most of the Nerf repos are already setup to make this easy.

This paper is a good overview of how to build a Nerf from random unaligned images. They did it using frames from a sitcom, but you could take a similar approach to Nerf almost anything https://arxiv.org/abs/2207.14279

12

ggf31416 t1_j9clwen wrote

I actually have a 3060 too, in theory a 3060ti should be up to 30% faster, but most of the times the 3060 is fast enough and faster than any T4.

For making a few images on stable diffusion maybe the difference will be 15 vs 20 seconds, for running whisper on several hours of audio it could be 45 minutes vs 1 hour. The difference will only matter if the model is optimized to fully use the GPU in the first place.

1

Stellar_____ t1_j9cd80k wrote

Hi guys,

I’m looking into machine learning and it’s use in shark conservation. The below figure shows the effectiveness of image classification of sharks.

Can anybody help me interpret this? The internet is telling me that if you follow two species to where they meet, the colour in the square represents how often one has been mistaken for the other. But if this is the case, why is there a uniform line down the middle showing a much higher number?

Thanks in advance from a confused biologist…

Normalized Confusion Matrix

1