Recent comments in /f/dataisbeautiful

Craig1207 t1_jbsb469 wrote

Hi all, I’m looking for data related resources like a ‘guess the movie’ based on a dashboard clues etc? I have used the Geckoboard examples but wondered if there are any others or suggestions in how to create own ones quickly? Thanks - want to make data related starter activities before sessions

2

PsychologicalEgg9377 OP t1_jbrqmco wrote

I assumed most people on this sub would be familiar with nonlinear dimensionality reduction but it looks some are not.

https://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction

This family of algorithms takes a data point that is normally represented in a high dimensional space and maps it to a lower dimensional representation. You generally lose information in the process, but in high-dimensional spaces there's often a lot of empty space that you can get rid of without losing much. The closest analogy I can think of is compressing a CD to mp3. You are losing information in the process, but if done correctly, the human ear can't tell much of a difference.

Why do this? One obvious reason is so you can plot highly dimensional data in 2D and 3D and get a better sense of certain spatial relationships. Once reduced, the vector components are difficult to describe in plain language. So it's not like "x is time and y is trades."

It's a confusing concept if you've never seen it before but it's very powerful and a common technique used in data science.

3

kompootor t1_jbrq4yy wrote

You really need to have vertical ticks indicating January for each year (and make that clear). Importantly, this will indicate that the dataset only runs through January 2023 (otherwise, that information must be indicated explicitly somewhere).

Also, if the thesis of the visualization is about the recent floods, then the time scale seems overly large, since you can pretty well capture the prior upward trend and volatility information by cutting off at 2018 or so. If however you were to include markings shading other major disasters and political turmoil, that would justify the time scale, and it would lend support to your thesis if, for example, you find other major events that don't correspond to dramatic changes in these economic indicators, and mark them as well. (You should be impartial in choosing events however. Significant events like the oil crash from 2014--2016 that sent Nigeria into a spiralling recession in 2016, should not be ignored.)

2

windowsfrozenshut t1_jbrp3yw wrote

Yeah, Utah is ridiculous. I was lucky and bought my house in 2018, but any median level earner here who waited until 2020 and after to try and buy is literally not able to afford anything right now. I'm not even in any of the metros listed on the chart, but my little bumfuck town is right there with them which is shocking. Double wides built in the 80's are selling for 100k at 10% interest plus 6-700 a month in lot rent, and that's the cheapest option.

2