Recent comments in /f/dataisbeautiful
Roughneck16 OP t1_jbsxi0v wrote
Reply to comment by float16 in Yield rate for Top 150 US Universities [OC] by Roughneck16
Its sponsoring institution isn't exactly strapped for cash 😉
adamjonah OP t1_jbsshn4 wrote
Reply to comment by djdood0o0o in Erling Haaland's Record Goal-Scoring Pace - Follow up [OC] by adamjonah
I'd have to check specifically but I scraped from the very beginning of the Premier League data on their website, I think that in the 90s there were 22 teams not 20 so that's probably the reason.
dml997 t1_jbso47w wrote
Your axes aren't labeled and the explanation is vague, so I have no idea what this is supposed to show.
[deleted] t1_jbsn44r wrote
Reply to comment by DM-me-ur-tits-plz- in [OC] Ratio of Median Sale Price of Single-Family Homes to Per Capita Income, by Metro Area by thatdude333
To be fair, you couldn't pay me enough to live in most of those top 10 towns in IL, and I'm from there.
[deleted] t1_jbsjsw2 wrote
Wooden_Imagination46 t1_jbsfgxz wrote
Color theme is irritating.
Craig1207 t1_jbsb469 wrote
Reply to [Topic][Open] Open Discussion Thread — Anybody can post a general visualization question or start a fresh discussion! by AutoModerator
Hi all, I’m looking for data related resources like a ‘guess the movie’ based on a dashboard clues etc? I have used the Geckoboard examples but wondered if there are any others or suggestions in how to create own ones quickly? Thanks - want to make data related starter activities before sessions
South5 t1_jbs4tvb wrote
Reply to comment by stalphonzo in John Wick's Weapon Usage - Original Trilogy [OC] by Crash_Recovery
‘A fuckin’ pencil’
[deleted] t1_jbs2v83 wrote
Reply to comment by PsychologicalEgg9377 in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
[removed]
GrizzlyHerder t1_jbs1t10 wrote
Reply to comment by datainspace in [OC] Impact of Russian Missile strikes on Ukraine's ICT Infrastructure by datainspace
Can someone smarter than me on this metric summarize what was learned here?
Serprotease t1_jbrzgsc wrote
Reply to comment by Kool-aid_Crusader in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
If it’s a PCA or similar, the labels are basically Dimension 1 and Dimension 2 so It’s kinda ok to skip them.
felix_using_reddit t1_jbry346 wrote
Reply to comment by Crash_Recovery in John Wick's Weapon Usage - Original Trilogy [OC] by Crash_Recovery
Will you update this for Chp4?
djdood0o0o t1_jbrxp2n wrote
Why does it go to 40 instead of 38?
extreme_imbecile t1_jbrxblv wrote
Reply to comment by captainpicard6912 in [OC] Ratio of Median Sale Price of Single-Family Homes to Per Capita Income, by Metro Area by thatdude333
Yeah haha. Teachers and bank tellers have no right to live within an hour of where they work but I have a right to own five rentals because the market says so haha. Wait what the fuck why are schools closed on Fridays and my favorite restaurants shutting down?!
rocket_labo t1_jbrta5y wrote
It’s hard to gain intuition about this plot as the legends don’t connect with the title. What trading patterns are you considering here (momentum, mean reverting, or chart patterns?) and why are they not represented in the legend?
Traitor_Donald_Trump t1_jbrriu1 wrote
Reply to comment by DataMan62 in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
I assume each dot is an individual blue chip company, the color is an investment sector.
It makes sense, this shows most movement by big money managers and ETFs readjusting in sync.
Turbulent-Key-5486 t1_jbrr9by wrote
Reply to comment by thehallmarkcard in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
energy/healthcare and communication services/basic materials share colors, yeah.
PsychologicalEgg9377 OP t1_jbrqr7z wrote
Reply to comment by Spillz-2011 in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
Tell me you are a data scientist without telling me you are a data scientist!
PsychologicalEgg9377 OP t1_jbrqmco wrote
I assumed most people on this sub would be familiar with nonlinear dimensionality reduction but it looks some are not.
https://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction
This family of algorithms takes a data point that is normally represented in a high dimensional space and maps it to a lower dimensional representation. You generally lose information in the process, but in high-dimensional spaces there's often a lot of empty space that you can get rid of without losing much. The closest analogy I can think of is compressing a CD to mp3. You are losing information in the process, but if done correctly, the human ear can't tell much of a difference.
Why do this? One obvious reason is so you can plot highly dimensional data in 2D and 3D and get a better sense of certain spatial relationships. Once reduced, the vector components are difficult to describe in plain language. So it's not like "x is time and y is trades."
It's a confusing concept if you've never seen it before but it's very powerful and a common technique used in data science.
kompootor t1_jbrq4yy wrote
Reply to comment by thexylom in [OC] Nigeria's inflation rate has raised to levels not seen in a decade, following devastating floods. by thexylom
You really need to have vertical ticks indicating January for each year (and make that clear). Importantly, this will indicate that the dataset only runs through January 2023 (otherwise, that information must be indicated explicitly somewhere).
Also, if the thesis of the visualization is about the recent floods, then the time scale seems overly large, since you can pretty well capture the prior upward trend and volatility information by cutting off at 2018 or so. If however you were to include markings shading other major disasters and political turmoil, that would justify the time scale, and it would lend support to your thesis if, for example, you find other major events that don't correspond to dramatic changes in these economic indicators, and mark them as well. (You should be impartial in choosing events however. Significant events like the oil crash from 2014--2016 that sent Nigeria into a spiralling recession in 2016, should not be ignored.)
goodluckonyourexams t1_jbrpz7a wrote
Reply to comment by DataMan62 in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
click link
windowsfrozenshut t1_jbrppie wrote
Reply to comment by [deleted] in [OC] Ratio of Median Sale Price of Single-Family Homes to Per Capita Income, by Metro Area by thatdude333
Pay that thing off and live like a king!
windowsfrozenshut t1_jbrp3yw wrote
Reply to comment by Muffinman3571 in [OC] Ratio of Median Sale Price of Single-Family Homes to Per Capita Income, by Metro Area by thatdude333
Yeah, Utah is ridiculous. I was lucky and bought my house in 2018, but any median level earner here who waited until 2020 and after to try and buy is literally not able to afford anything right now. I'm not even in any of the metros listed on the chart, but my little bumfuck town is right there with them which is shocking. Double wides built in the 80's are selling for 100k at 10% interest plus 6-700 a month in lot rent, and that's the cheapest option.
DataMan62 t1_jbrnxw0 wrote
Reply to comment by PsychologicalEgg9377 in [OC] Dimensionality reduction of stock trading patterns by PsychologicalEgg9377
I can’t tell which stocks trade like their group BECAUSE NOTHING IS LABELED!
djdood0o0o t1_jbsxmyk wrote
Reply to comment by adamjonah in Erling Haaland's Record Goal-Scoring Pace - Follow up [OC] by adamjonah
Yeah i thought that could be the answer. I suppose it would only have to take 1 season of that rule for it to change the graph. Very interesting table anyway.