Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf
Population growth in each state 1950-2016
/r/MapPorn
https://redd.it/zxln5g
[OC] The Number of Endangered Species in Each US State
/r/dataisbeautiful
https://redd.it/zxs8k3
A Tool to create a dataset of semantic segmentation on website screenshots from their DOM
https://github.com/dmvaldman/html_semantic_seg
/r/datasets
https://redd.it/zxa1go
[OC] Race to the Bottom: Stocks That Lost the Most Market Cap in 2022
/r/dataisbeautiful
https://redd.it/zxa0fp
What Actually Happens When You Recycle
/r/Infographics
https://redd.it/zwewey
Number of World Cup Finals Lost
/r/MapPorn
https://redd.it/zx9ko7
The 100 Biggest Public Companies in the World in 2022
/r/Infographics
https://redd.it/zx9sfc
Spreadsheet of all FIFA 23 fut player data (OC)
https://docs.google.com/spreadsheets/d/1zdnAtkEwM8p21PsPP1KJg7vJuXWzvcWDYT8mxEt3Omw/edit?usp=sharing
/r/datasets
https://redd.it/zwp1q1
How do you pluralize English words? (All)
Google forms link
/r/SampleSize
https://redd.it/zwt5x4
[OC] November 2022 for five Boeing 737-800s
/r/dataisbeautiful
https://redd.it/zwo89h
Datasets - Cheminformatics related data
Can somebody give pointers as to where I can find antibody/protein/peptide datasets which contain experimental information related to,
* plasma half-life,
* thermal stability,
* solubility,
* Aggregation propensity,
* Immunogencity.
I need it for data mining and analytics.
/r/datasets
https://redd.it/zu017x
Top 50 Big Data Analytics Tools and Software You should know in 2023
https://bigdataanalyticsnews.com/top-big-data-analytics-tools/
/r/datasets
https://redd.it/zvj7pz
[OC] User m-rage posted their flower blooming records here, but that was deleted for lacking computer-generated content. I entered it into a spreadsheet, added graphs plus some date analysis. I used OpenOffice Calc. [13778x3084]
https://redd.it/zw044u
@datascientology
Advice for finding a detailed dataset on stocks listed in the S&P 500?
I need to find a dataset for a college course. I'm interested in finance and would like to do some exploratory analysis on S&P 500 stocks. I've already found something that is almost perfect for what I want to do: S&P 500 Companies with Financial Information | Kaggle
However, I would prefer a dataset of the exact same format but that has even more columns. I would especially like to have more categorical columns, since the only truly categorical one from this one is the sector. I know that there are a lot more columns that could make sense for such a dataset, such as historical return over the past N years, or other financials. So I really do feel like there must be some dataset out there that looks exactly like this but has even more columns.
I'm just having trouble with finding such a dataset. So far I've just been googling something like "dataset of s&p 500 stocks with financial information" or "dataset of s&p 500 stocks with many columns," but I haven't really found anything better yet by doing so.
I would appreciate any suggestions.
/r/datasets
https://redd.it/zw6lyy
Steam's average player does not obey the rules of scale
/r/dataisugly
https://redd.it/zvvj72
Can Someone Verify If This Economic Data From the Fed Is Accurate? I find it hard to believe
So, I am learning data analytics, and I got this dataset from the fed about per capita personal income in Teton, Wyoming. You can check it yourself here https://fred.stlouisfed.org/series/PCPI56039
But, I find this number hard to believe. It says per capita personal income in Teton is $318,297, which is way higher than even New York. Apparently their data comes from bea.gov, which confirms the same number (318,297).
Meanwhile, wikipedia lists Teton, Wyoming per capita personal income as $43,444. Sure, data on wikipedia might be old, but not as old as 1995 if you check wikipedia source in the footer (meanwhile, the fed data shows Teton per capita personal income has been above $43,000 since 1995).
Forgive my ignorance, I am not American, and I have never been to United States, I just want to make sure the numbers from bea.gov are accurate, since there seems to be conflicting information
/r/datasets
https://redd.it/zwi4i9
Rents by municipality in mainland France €/m² [OC]
https://redd.it/zx00bl
@datascientology
[OC] Shark Tank investors profile
https://redd.it/zxifdp
@datascientology
ChatGPT Extension for Jupyter Notebooks: Personal Code Assistant
Hi!
I want to share a browser extension that I have been working on. This extension is designed to help programmers get assistance with their code directly from within their Jupyter Notebooks, through ChatGPT.
The extension can help with code formatting (e.g., auto-comments), it can explain code snippets or errors, or you can use it to generate code based on your instructions. It's like having a personal code assistant right at your fingertips!
I find it boosts my coding productivity, and I hope you find it useful too. Give it a try, and let me know what you think!
You can find an early version here:
https://github.com/TiesdeKok/chat-gpt-jupyter-extension
/r/datascience
https://redd.it/zwppsu
Super Stylish Ways to Tie a Scarves
/r/Infographics
https://redd.it/zwdpdy
Wood smoking flavor guide
/r/Infographics
https://redd.it/zwktq2
[OC] How to remember the name (now with sound)
/r/dataisbeautiful
https://redd.it/zx96hp
Should r/SampleSize accept images in posts? Results
, - ~ ~ ~ - ,
, ' \ ' ,
, \ 33.3% ,
, \ Nope ,
, \ ,
, _,
, ,
, 66.7% ,
, Sure ,
, , '
' - , , '
N = 96
/r/SampleSize
https://redd.it/zw7mrc
Announcements: Image posts have returned! (Those who share results on r/SampleSize)
Users of r/SampleSize from before the last wave of moderators rejoice! After a couple people requested the return of image posts, including most recently u/ToLoveThemAll, we've hashed out in the background how it'll work, and now image posts are allowed when using the **Results** flair! They will work as follows.
* You have the option to make an Image post to have Reddit host an image to our subreddit.
* Results is the only flair that images will be allowed, any other flair posting an image will be removed.
* Results-flaired-image-posts will still be filtered, and will be pushed forward on an approval basis. We will receive a modmail every time a user attempts to post using the Results flair, so we can manually approve images and threads.
/r/SampleSize
https://redd.it/zw80ic
[OC] Breakdown of how dating went for me in 2022 as a 22M
/r/dataisbeautiful
https://redd.it/zw4e8l
The Endangered Alphabets Project is looking for volunteers
https://mobile.twitter.com/TBAlphabets/status/1607436824988852230
/r/datasets
https://redd.it/zvxy8n
[OC] 30 Most &. least prosperous countries in the world according to Legatum Prosperity Index.
/r/dataisbeautiful
https://redd.it/zwehys
Yearly Deaths by Natural Disaster, going backwards from 2021 to 1900 [OC]
/r/dataisbeautiful
https://redd.it/zw5fjs
Data science & analytics style guide and best practices.
Hi! I've been workin on DA for a bit more than a year now and I love it but I haven't found good documentation about it.
I see a lot of resources with a strong focus on tools (softwares or languages e.g.) but laking rigurosity on definitions and best practices.
I would like to find some book or documentation regarding the following topics:
Definitions: Dimentions, facts, types of analytics, wrangling, cleaning etc.
Best practices: recomended procedures for ETL, querys etc.
Thanks a Lot!
/r/datascience
https://redd.it/zw04md
[OC] North American cities by number of major sports championships (Updated December 2022)
/r/dataisbeautiful
https://redd.it/zvw7pf