Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf
[R] QUALCOMM demos 3D reconstruction on AR glasses — monocular depth estimation with self supervised neural network processed on glasses and smartphone in realtime
/r/MachineLearning
https://redd.it/z60wuh
Visualization of my Strava running data of the last 4 years. Any ideas on how to improve this?
/r/visualization
https://redd.it/z5i0uf
Dear Hiring Managers in DS field, how to boost your chances for landing entry job, with no prior experience in DS?
/r/datascience
https://redd.it/z5ho8z
Study on gamers. Just 40 people left. Please help us 🥺 (+16)
Dear gamers,
we are an international team of scientists studying video gaming communities. We are currently running a study on the psychological characteristics of various types of gamers (platforms, genres, etc.) and we would like to invite you to participate. It will take about 20 minutes and consist of a series of questionnaires presented on the free-to-use Psytoolkit experimental platform (https://www.psytoolkit.org/). The study will be entirely anonymous, but completing all the questionnaires and leaving a working email address will qualify you for a chance to win a 100, 200, or 300 Euro (or around 680$) gift card of your choice in a raffle at the end of the study. Every submission will be of great help. From casual to hardcore gamers, from PC, to mobile. to console players, we welcome all who engage in the gaming culture. The whole research team is thankful for your support.
For more information, follow this link below:
https://www.psytoolkit.org/c/3.4.0/survey?s=EdxEY
/r/SampleSize
https://redd.it/z58oxt
I need an English-Old English dataset in CSV format for training a machine translation model.
title
/r/datasets
https://redd.it/z5fb2q
Russian list of unfriendly countries 2021 and 2022
/r/MapPorn
https://redd.it/z597k9
C End of year Salary Sharing thread
This is the official thread for sharing your current salaries (or recent offers) for the end of 2022.
Please only post salaries/offers if you're including hard numbers, but feel free to use a throwaway account if you're concerned about anonymity. You can also generalize some of your answers (e.g. "Large CRO" or "Pharma"), or add fields if you feel something is particularly relevant.
1. Title(e.g statistical programmer, biostatistician, statistical analyst, data scientist):
2. Country/Location:
3. $Remote:
4. Salary:
5. Company/Industry:
6. Education:
7. Total years of Experience:
8. $Internship
9. $Coop
10. Relocation/Signing Bonus:
11. Stock and/or recurring bonuses:
12. Total comp:
Note that while the primary purpose of these threads is obviously to share compensation info, discussion is also encouraged.
/r/statistics
https://redd.it/z5cpvm
[OC] Percentage of quiz takers who knew each country in a 'Name the Countries of the World" quiz.
/r/dataisbeautiful
https://redd.it/z5dul3
[OC] - US Yield Curve, mean yield curve spread, and percent of all yield curve combinations that are inverted
/r/dataisbeautiful
https://redd.it/z5a7x0
D Paper Explained - CICERO: An AI agent that negotiates, persuades, and cooperates with people (Video)
https://youtu.be/ciNMc0Czmfc
A team from Meta AI has developed Cicero, an agent that can play the game Diplomacy, in which players have to communicate via chat messages to coordinate and plan into the future.
​
OUTLINE:
0:00 - Introduction
9:50 - AI in cooperation games
13:50 - Cicero agent overview
25:00 - A controllable dialogue model
36:50 - Dialogue-conditional strategic planning
49:00 - Message filtering
53:45 - Cicero's play against humans
55:15 - More examples & discussion
​
Homepage: https://ai.facebook.com/research/cicero/
Code: https://github.com/facebookresearch/diplomacy\_cicero
Blog: https://ai.facebook.com/blog/cicero-ai-negotiates-persuades-and-cooperates-with-people/
Paper: https://www.science.org/doi/10.1126/science.ade9097
​
Abstract:
Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce Cicero, the first AI agent to achieve human-level performance in Diplomacy, a strategy game involving both cooperation and competition that emphasizes natural language negotiation and tactical coordination between seven players. Cicero integrates a language model with planning and reinforcement learning algorithms by inferring players' beliefs and intentions from its conversations and generating dialogue in pursuit of its plans. Across 40 games of an anonymous online Diplomacy league, Cicero achieved more than double the average score of the human players and ranked in the top 10% of participants who played more than one game.
​
Authors: Anton Bakhtin, Noam Brown, Emily Dinan, Gabriele Farina, Colin Flaherty, Daniel Fried, Andrew Goff, Jonathan Gray, Hengyuan Hu, Athul Paul Jacob, Mojtaba Komeili, Karthik Konath, Minae Kwon, Adam Lerer, Mike Lewis, Alexander H. Miller, Sasha Mitts, Adithya Renduchintala, Stephen Roller, Dirk Rowe, Weiyan Shi, Joe Spisak, Alexander Wei, David Wu, Hugh Zhang, Markus Zijlstra
/r/MachineLearning
https://redd.it/z4s2kp
Sunrise and sunset times throughout the year, arranged in a circle [OC]
/r/dataisbeautiful
https://redd.it/z55n18
[P] I trained a dog to fetch a stick using Deep Reinforcement Learning
/r/MachineLearning
https://redd.it/z52bsl
The 2022 brain
/r/funnycharts
https://redd.it/yw768h
P OpenELM, a library combining evolutionary algorithms and language models
Hi all,
This is a new library combining large language models with evolutionary algorithms for code synthesis, by CarperAI.
Github: https://github.com/CarperAI/OpenELM
Huggingface model: https://huggingface.co/CarperAI/diff-codegen-350m
Blog post: https://carper.ai/openelm-release/
ELM stands for Evolution Through Large Models, a technique from a recent OpenAI paper demonstrating that large language models can act as intelligent mutation operators in an evolutionary algorithm, enabling diverse and high quality generation of code in domains not seen in the language model’s training set.
The library contains an implementation of MAP-Elites with a language model as the mutatation operator, and the Sodaracer 2D environment as a testbed where you can evolve robots with a language model.
In addition, there is also an an open-source diff model fine-tuned on GitHub diffs from Salesforce’ CodeGen 350M code synthesis model, under an MIT license. This diff model will let you more easily generate intelligent code suggestions in ELM.
/r/MachineLearning
https://redd.it/z4pjnt
UN Convention on the rights of persons with Disabilities
/r/MapPorn
https://redd.it/z5zd9w
Importance of technology knowledge (AWS, GCP, Spark) vs. system design and research for 2nd role Data Scientist
tl;dr: Dear hiring managers, when you're interviewing candidates for a job that requires knowledge of a technology, how important that knowledge is in comparison with research, data-science and system design experience?
​
Hi, sorry for the long title,
I have been working for a startup for about a year, and on a personal level, I am having a great time. It is fully funded, and the bust in the economy didn't hit us hard. The is gone through a pivot, so the old product is funding the new product which I taking part in.
On the professional level, in the past year, I had the chance to work with NLP, tabular, and touch several subjects (feature selection/importance, data drift, time series). Because of the pivot, the architecture of the whole system needs to be defined. How we are working with clients, serving them models, making sure the models' performance doesn't decay over time, for example.
Consequentially, in the past year, I read a lot, both blog posts, and academic papers. In general, the management is open to new ideas, so, being a researcher at heart I try to promote several projects with different departures.
My five-year plan is to move to one of the large companies (not necessarily FAANG, but who knows, I hope it doesn't sound like I am full of myself). I was taking the day to look into job descriptions in such companies, and I saw that many of them require familiarity with technologies like GCP/Hadoop/AWS and others that I am not familiar with.
Now for the real question - I am entirely sure that taking the job at a startup for my first job was a great decision in terms of personal growth. Friends that started working for corporations have dealt with one or two tasks at most in the past year. They didn't take any part in planning, and their research experience is little to nothing. Nevertheless, while I worked mostly in Jupyter notebooks, they worked with giant Git repositories. They know Spark, Hadoop, AWS and probably more.
I feel like I am in a good place, so I am not planning on leaving anytime soon, but given this five-year plan, when should be the ideal time to start looking for a new place?
​
EDIT: added the tl;dr
/r/datascience
https://redd.it/z5kco9
The songs which have reached 1 billion streams on Spotify
/r/Infographics
https://redd.it/z5kk89
[OC] Bot that creates timelapses of a websites history
/r/dataisbeautiful
https://redd.it/z5ogvf
[OC] Crime statistics from the USA and Australia. This is the result of an embarrassing amount of time researching because of a pointless internet argument. These are just found statistics NOT a social commentary.
https://redd.it/z5lq5v
@datascientology
The Biggest Source of Power in Every State and Province [OC]
/r/dataisbeautiful
https://redd.it/z5bjhk
Countries with recorded temperature extremes above 48°C and under -48°C
/r/MapPorn
https://redd.it/z5i5kz
Most important skills to cultivate
I’m finishing a physics/astronomy program in about a year and have a few elective spots open. I’ve heard data science is a good route for math/physics people. What kind of skills are most important to get your foot in the door and which classes would help most with those? Thanks!
/r/datascience
https://redd.it/z4spvt
[OC] The Slow Decline of Key Changes in Popular Music
/r/dataisbeautiful
https://redd.it/z5dty1
LGBT+ Rights in the Middle East.
/r/MapPorn
https://redd.it/z4zh7m
Original Data > Processing > Output [OC]
https://redd.it/z51gpw
@datascientology
My iPod touch decided to display 58% of battery charge as roughly 75% on the lock screen
/r/dataisugly
https://redd.it/z4gex7
Got promoted to manage a small team (less than 4)
So I got promoted and now will manage a small team. We do a mix of BI and basic datascience. Any tips how to organize the work for a small team from your experience? Or any other tips for that matter
Thanks
/r/datascience
https://redd.it/z4q7sg
[OC] Top 10 largest oil fields by 2021 production
/r/dataisbeautiful
https://redd.it/z4ipyx
[OC] - Google searches for "food bank"
/r/dataisbeautiful
https://redd.it/z4ut5j