Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf
Personal project for PhDs and scientists P
Hello!
I've developed a project NaimAI, to help PhDs and scientists in their scientific literaure review. To describe it brievely, it has 3 main features : 1 search in papers, 2 structures abstracts into objectives, methods and results and 3 generates automatically a (pseudo) literature review.
I wrote a yaassinekaddi/literature-review-with-naimai-open-sourced-fcbdb36762de">medium article that goes through the details.
Github repos : https://github.com/yassinekdi/naimai
I've created a subreddit in case : r/naimai4science
I'd be happy to have your opinion about it and hopefully this could be useful!
/r/MachineLearning
https://redd.it/zg3bsd
Length of bars means nothing. lol, nice, reddit.
/r/dataisugly
https://redd.it/zg6da4
D We're the Meta AI research team behind CICERO, the first AI agent to achieve human-level performance in the game Diplomacy. We’ll be answering your questions on December 8th starting at 10am PT. Ask us anything!
EDIT 11:58am PT: Thanks for all the great questions, we stayed an almost an hour longer than originally planned to try to get through as many as possible — but we’re signing off now! We had a great time and thanks for all thoughtful questions!
PROOF: https://i.redd.it/8skvttie6j4a1.png
We’re part of the research team behind CICERO, Meta AI’s latest research in cooperative AI. CICERO is the first AI agent to achieve human-level performance in the game Diplomacy. Diplomacy is a complex strategy game involving both cooperation and competition that emphasizes natural language negotiation between seven players. Over the course of 40 two-hour games with 82 human players, CICERO achieved more than double the average score of other players, ranked in the top 10% of players who played more than one game, and placed 2nd out of 19 participants who played at least 5 games. Here are some highlights from our recent announcement:
NLP x RL/Planning: CICERO combines techniques in NLP and RL/planning, by coupling a controllable dialogue module with a strategic reasoning engine.
Controlling dialogue via plans: In addition to being grounded in the game state and dialogue history, CICERO’s dialogue model was trained to be controllable via a set of intents or plans in the game. This allows CICERO to use language intentionally and to move beyond imitation learning by conditioning on plans selected by the strategic reasoning engine.
Selecting plans: CICERO uses a strategic reasoning module to make plans (and select intents) in the game. This module runs a planning algorithm which takes into account the game state, the dialogue, and the strength/likelihood of various actions. Plans are recomputed every time CICERO sends/receives a message.
Filtering messages: We built an ensemble of classifiers to detect low quality messages, like messages contradicting the game state/dialogue history or messages which have low strategic value. We used this ensemble to aggressively filter CICERO’s messages.
Human-like play: Over the course of 72 hours of play – which involved sending 5,277 messages – CICERO was not detected as an AI agent.
You can check out some of our materials and open-sourced artifacts here:
Research paper
[Project overview](https://ai.facebook.com/research/cicero/)
Diplomacy gameplay page
[Github repo](https://github.com/facebookresearch/diplomacy_cicero)
Our latest blog post
Joining us today for the AMA are:
Andrew Goff (AG), 3x Diplomacy World Champion
Alexander Miller (AM), Research Engineering Manager
Noam Brown (NB), Research Scientist [(u/NoamBrown)](https://www.reddit.com/user/NoamBrown/)
Mike Lewis (ML), Research Scientist (u/mikelewis0)
David Wu (DW), Research Engineer [(u/icosaplex)](https://www.reddit.com/user/icosaplex/)
Emily Dinan (ED), Research Engineer
Anton Bakhtin (AB), Research Engineer
Adam Lerer (AL), Research Engineer
Jonathan Gray (JG), Research Engineer
Colin Flaherty (CF), Research Engineer (u/c-flaherty)
We’ll be here on December 8, 2022 @ 10:00AM PT - 11:00AM PT.
/r/MachineLearning
https://redd.it/zfeh67
[OC] How media divides us: MSNBC vs Fox News - What stories or topics are they pushing over the last week (Dec 1st to Dec 8th)? How do they compare to Reuters?
https://redd.it/zg1ezn
@datascientology
Going beyond 100%
/r/dataisugly
https://redd.it/zfql16
Judea Pearl, a pioneering figure in artificial intelligence, long argued that AI has been stuck in a decades-long rut because of our struggles digitising causal reasoning. That's why the outcome of this basic test is sending chills down my spine.
/r/datascience
https://redd.it/zfrynz
Using Open Data to find NIMBY counties in the USA
Hi all,
This is Tim Sehn CEO of DoltHub, famous on this sub for our data bounties and maybe more importantly, the novel open data they produce.
For our US Housing prices bounty we collected 112GB of schema-ed housing sale data from around the US.
We had a bounty contributor use the data to analyze where the NIMBY and YIMBY counties were in the US using a novel method.
https://www.dolthub.com/blog/2022-12-08-nimby-yimby/
I thought this sub would be interested in the open dataset and use the analysis as an inspiration on how to use it.
/r/datasets
https://redd.it/zfe9lo
Obesity in North America (2021)
/r/MapPorn
https://redd.it/zfa5qh
Swans: The ultimate gift from your true love [OC]
/r/dataisbeautiful
https://redd.it/zfcoks
[OC] Where each US House member (of next Congress) received their undergraduate degree
/r/dataisbeautiful
https://redd.it/zfhpja
How many graph design rules can be broken at once?
/r/dataisugly
https://redd.it/zf4goy
MapPorn Discussion Thread for December, 2022
This thread is for general MapPorn discussion. Exchange ideas, ask for maps, talk about cartography, etc. Have a thought that doesn't fit in another thread, post it here.
/r/MapPorn
https://redd.it/z9lo41
A Comprehensive FIFA World Cup 2022 dataset with detailed player and team statistics.
https://www.kaggle.com/datasets/swaptr/fifa-world-cup-2022-statistics
/r/datasets
https://redd.it/zdn0m4
A map from LIFE magazine, February 10, 1916. It shows readers the possible consequences of the US refusal to help the Entente countries in the war against Germany.
/r/MapPorn
https://redd.it/zf5fy2
[OC] I spent the past two years pouring my soul into a website that allows you to visualize virtually every U.S. company's international supply chain. E.x. What products, how much, which factories and where does GameStop import from? (Just type a company in the search box)
https://www.importyeti.com/company/gamestop
/r/dataisbeautiful
https://redd.it/zf3gva
Topic Sentence - Infographics
/r/Infographics
https://redd.it/zfvx7i
The State of World Press Freedom
/r/MapPorn
https://redd.it/zfstet
Countries with English-speaking Leaders, Europe (Dec. 2022)
/r/MapPorn
https://redd.it/zg21t5
The Average Age and Income of Home Buyers and Home Sellers in the 50 Biggest Metro Areas
/r/Infographics
https://redd.it/zg1z64
3D Graph of Riemann sums
/r/mathpics
https://redd.it/zf20gl
[OC] The most popular tool brands on Reddit 2022 (r/Tools)
/r/dataisbeautiful
https://redd.it/zf8j7x
Sex composition of baby names in the USA and England/Wales: 2021 [OC]
/r/dataisbeautiful
https://redd.it/zff7ih
ELI5 stationarity q
The more I read about stationarity, the more confused I get. I'd be grateful for a plain English explanation of what it actually is.
/r/statistics
https://redd.it/zer98e
CEO pay has skyrocketed 1,460% since 1978: CEOs were paid 399 times as much as a typical worker in 2021
https://www.epi.org/publication/ceo-pay-in-2021/?utm_source=sillychillly
/r/dataisbeautiful
https://redd.it/zf594b
E Is a bachelors degree in applied statistics sustainable for a career in data analysis?
I'm currently enrolled for a BS with a Major in Applied Statistics at my university, but as I am nearing the end of my undergraduate studies, I have started to research different masters and PhD programs within statistics.
In all transparency, I know my GPA so far isn't fantastic, and although I am on track to get my bachelors degree, I am debating whether or not getting my masters or PhD in this field is worth it, given the requirements, skills, time, and money needed in order to accomplish this.
My primary interest within applied statistics is data analysis, and I think I can get by just fine with a bachelors degree, but would anyone recommend furthering my education in the field? What would be the advantages and disadvantages of doing so?
Thank you so much for the help!
/r/statistics
https://redd.it/zevlny
Which Countries Have the Highest Inflation?
/r/visualization
https://redd.it/ze09cg
How is the first bar 35 but not able to reach 35 on the axis?
/r/dataisugly
https://redd.it/zedo2d