Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf
[OC] Biggest e-commerce companies in the world
/r/dataisbeautiful
https://redd.it/yyq9m7
I developed an API to analyze domain names.
Hello guys, I recently launched my Domain Analysis API. This API allow you get thorough analysis of your domain ranges from domain length all the way to past domain (history) sales and number of mentions. For more information : https://rapidapi.com/getbishopi/api/domain-analysis/
/r/datasets
https://redd.it/yxw16f
Modern day slavery - 6500 dead migrants since Qatar was named World Cup host
https://www.statista.com/chart/24268/deaths-of-south-asian-migrants-in-qatar/
/r/dataisbeautiful
https://redd.it/yyjb10
Size comparison of the world's smallest countries.
/r/MapPorn
https://redd.it/yyg60s
A shaded relief map of Australia rendered from 3d data and satellite imagery [OC]
/r/dataisbeautiful
https://redd.it/yyd2at
Q Inverse version of the birthday problem
The birthday problem asks for the probability that, in a room of randomly selected people, any two of those people have the same birthday. Followed by asking how many people is required to reach a >50% chance that two people have the same birthday.
The question I have is similar:
How many people are required before there is a >50% probability that all 366 possible birthdays (because leap years) are represented amongst the randomly selected people?
/r/statistics
https://redd.it/yxfbhq
Coining the term "bowtiscate", not sure if it has been named before
/r/mathpics
https://redd.it/yxuats
USA recognition in the 18th century
/r/MapPorn
https://redd.it/yy0gs1
How to escape Jupyter in a corporate environment
There are quite a few recent posts about pros and cons of Jupyter in the DS field and I don't want to add to that. My question is more about tooling prevalent in corporate environments. Most of the time Python DS stack is limited to remotely hosted Jupyter lab instances (in contrast R stack includes paid subscriptions to Rstudio server, which is a great tool that can live in a browser!)
What are the options available of one wants a hybrid IDE + Notebook workspace if ssh is blocked (almost all the time in corporate)
I feel like Jupyter is hindering a push towards a Foss or paid alternate to Rstudio server for python. (I've heard VS code is attempting sever side tool which can be accessed through the browser but not sure if it's available for production (corporate use) yet. I know there can be workarounds using paid (new rstudio for python) options but since jupiter is so widespread its really hard to get the heads to spend money on something new and untested.
I'm surprised that as a community we seem to be content with J.
/r/datascience
https://redd.it/yy42j2
C Are ML interviews generally this insane?
ML positions seem incredibly difficult to get, and especially so in this job market.
Recently got to the final interview stage somewhere where they had an absolutely ridiculous. I don’t even know if its worth it anymore.
This place had a 4-6 hour long take home data analysis/ML assignment which also involved making an interactive dashboard, then a round where you had to explain the the assignment.
And if that wasnt enough then the final round had 1 technical section which was stat/ML that went well and 1 technical which happened to be hardcore CS graph algorithms which I completely failed. And failing that basically meant failing the entire final interview
And then they also had a research talk as well as a standard behavioral interview.
Is this par for the course nowadays? It just seems extremely grueling. ML (as opposed to just regular DS) seems super competitive to get into and companies are asking far too much.
Do you literally have to grind away your free time on leetcode just to land an ML position now? Im starting to question if its even worth it or just stick to regular DS and collect the paycheck even if its boring. Maybe just doing some more interesting ML/DL as a side hobby thing at times
/r/statistics
https://redd.it/yxje64
2022 congressional map results
/r/MapPorn
https://redd.it/yxvt1e
[OC] Gold Reserves by Country in Freight Cars (in Tons) | 2021-2022
/r/dataisbeautiful
https://redd.it/yxtkr6
[OC] Best selling video game consoles of all time - as of Nov 2022
/r/dataisbeautiful
https://redd.it/yxmuh0
At the very least, use rainbow colors.
/r/dataisugly
https://redd.it/ywsdub
[Academic] Survey about gaps in medical care system (US Residents, 18+)
https://docs.google.com/forms/d/e/1FAIpQLSe3M5lS6OMZvQ73bumdXKApotXsIXagNP0FWLSi-_H_X3-u_w/viewform?usp=sf_link
/r/SampleSize
https://redd.it/yxdqay
POV: you're on reddit rn procrastinating [OC]
/r/visualization
https://redd.it/yy07f3
[OC] The last 3 years were harsh for some billionaires. See who has lost the most in terms of net worth.
/r/dataisbeautiful
https://redd.it/yygh3w
Then & Now Portugal's Drug Decriminalization
/r/visualization
https://redd.it/yykcf3
Bigfoot Sightings Dataset (and some analysis)
Some extremely interesting data on the sightings of Bigfoot, aka Sasquatch. With this dataset, you can see exactly where the most sightings are coming from, what these sightings look like, and even the environment in which the sighting took place.
This data originated from [The Bigfoot Field Researchers Organization](https://www.bfro.net/), and became available on Kaggle [here](https://www.kaggle.com/datasets/josephvm/bigfoot-sightings-data). \[Self-promotion\] We've re-hosted the data in Gigasheet here for exploration before downloading the file: [https://app.gigasheet.com/spreadsheet/Bigfoot-Sightings/3f64218d\_3ea2\_47a4\_900c\_5c975ffcf0ad?public=true](https://app.gigasheet.com/spreadsheet/Bigfoot-Sightings/3f64218d_3ea2_47a4_900c_5c975ffcf0ad?public=true)
Here are some highlights:
* **Washington State has the highest number of sightings** out of all states with around 11% of all sightings.
* The **majority of Bigfoot sightings occur during the summer** in which approximately 34% of sightings take place. Fall is in second place with around 27% of sightings.
* **2012 was the year with the most recorded sightings** with 191. Bigfoot sightings seem to be trending upwards, which might mean we are close to finding them (or maybe we are going crazy!)
/r/datasets
https://redd.it/yxrjxa
[OC] How many people died for the Qatar World Cup.
/r/dataisbeautiful
https://redd.it/yyh8f5
The continents can be arranged to look like a chicken
/r/MapPorn
https://redd.it/yy8lom
[OC] Visualizing eight of Donald Trump’s false or misleading claims from his presidential bid announcement
/r/dataisbeautiful
https://redd.it/yxp3zv
[OC] Costco hotdog & soda combo price 1985 vs 2022
/r/dataisbeautiful
https://redd.it/yy08qy
Q Are the significance and p values conditional probabilities? If so, how can you conditional on the null being true if the null hypothesis is not a random variable? it should be true or not right?
/r/statistics
https://redd.it/yxdkk1
thats not it
/r/dataisugly
https://redd.it/ywbh91
Animal Showdown, Round 3 (Everyone)
https://docs.google.com/forms/d/e/1FAIpQLSetbnaoaDPabFxDLezsZ-4y4VPIPFo00m8qnPtwbdG3mL_wHA/viewform?usp=sf_link
/r/SampleSize
https://redd.it/yxrgw4
Is paid leave available for mothers of infants?
/r/MapPorn
https://redd.it/yxt5ew
[OC] Layoffs in the tech industry over the last month for selected companies
/r/dataisbeautiful
https://redd.it/yxlplb
[Advice] Seeking assistance with presenting data
/r/visualization
https://redd.it/yx5wcf
High IQ move by whoever plotted this
/r/dataisugly
https://redd.it/ywbmst