datascientology | Образование

Telegram-канал datascientology - Data Scientology

1234

Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf

Подписаться на канал

Data Scientology

Sober days vs High/Drunk days 2021 and 2022 compared

/r/visualization
https://redd.it/100kemu

Читать полностью…

Data Scientology

D Simple Questions Thread

Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead!

Thread will stay alive until next one so keep posting after the date in the title.

Thanks to everyone for answering questions in the previous thread!

/r/MachineLearning
https://redd.it/100mjlp

Читать полностью…

Data Scientology

30 years since the Velvet Divorce of Czechoslovakia

/r/MapPorn
https://redd.it/100kebq

Читать полностью…

Data Scientology

Today Joseph and Mary would have to pass through 15 checkpoints to get from Nazareth to Bethlehem. - MAP

/r/MapPorn
https://redd.it/100j9p4

Читать полностью…

Data Scientology

PyTorch discloses malicious dependency chain compromise
https://www.bleepingcomputer.com/news/security/pytorch-discloses-malicious-dependency-chain-compromise-over-holidays/

/r/datascience
https://redd.it/100o1a3

Читать полностью…

Data Scientology

How can you become a senior data scientist if there are no jobs for juniors

hello, i graduate soon with a computer engineering degree and im checking the specializations with the highest paying salaries (mainly in the uae,ksa and mea) and i stumbled upon data science but as i've read more and more it seems that there are almost no junior level jobs but a lot of senior level jobs, but how does that make any sense if i cant get a junior level job how can i become a senior? unless im mistaken dont u need years of experience to become a senior? and before anyone tells me i understand that i shouldn't choose a career based on money alone but i legit have 0 passion and i love nothing in the workfield atm so this is the most objective thing i can think of and i meant an msc in ds.

/r/datascience
https://redd.it/100ponn

Читать полностью…

Data Scientology

D Data cleaning techniques for PDF documents with semantically meaningful parts

I am seeking insights and best practices for data preprocessing and cleaning in PDF documents. I am interested in extracting only the body text content from a PDF and discarding everything else, such as page numbers, footnotes, headers, and footers (see attached image for an example of semantically meaningful sections).

I have noticed that in Microsoft Word, a user can simply drag in a PDF and Word seems to automatically understand which parts are headers, footnotes, etc. I am speculating that Word may be utilizing machine learning techniques to analyze the layout and formatting of the PDF and classify different sections accordingly. Alternatively, Word may be utilizing pre-defined rules or patterns to identify common elements such as headers and footnotes. I know of related techniques for example to extract layout information from receipts and the like (LayoutLM, Xu et al., https://arxiv.org/abs/1912.13318) and tabular data (TableNet, Paliwal et al., https://ieeexplore.ieee.org/document/8978013), but nothing to solve layout extraction in this particular domain.

I am curious to know if there are any techniques or algorithms that can replicate this behavior in Word. Any suggestions or recommendations for data cleaning in PDF documents, would be greatly appreciated.

Image of PDF with semantically meaningful sections

/r/MachineLearning
https://redd.it/100rbhp

Читать полностью…

Data Scientology

□ 6 □ Carbon | 07-04-18 | by Xponentialdesign

/r/mathpics
https://redd.it/xbf2dg

Читать полностью…

Data Scientology

Hilbert Curve

/r/mathpics
https://redd.it/ynrov6

Читать полностью…

Data Scientology

Animated version of the bowtiscate from my previous post

/r/mathpics
https://redd.it/yxw6h9

Читать полностью…

Data Scientology

Clock with only 9s

/r/mathpics
https://redd.it/yz2smk

Читать полностью…

Data Scientology

A Spherical triangle and a Cardioid on a surface of a unit Sphere
https://youtu.be/bwPWsahg-cs

/r/mathpics
https://redd.it/zbpwg9

Читать полностью…

Data Scientology

Non-Euclidian Cone (Complex Set)

/r/mathpics
https://redd.it/zhh94j

Читать полностью…

Data Scientology

Truncated Octahedron 3D model I made from scratch (explanation in a comment)

/r/mathpics
https://redd.it/zuljxi

Читать полностью…

Data Scientology

Doomslayer VS Topology

/r/mathpics
https://redd.it/zyztx3

Читать полностью…

Data Scientology

E FYI: Statistical Rethinking (2023) by rmcelreath

The first pre-recorded lecture from of Statistical Rethinking (2023) by Richard McElreath (MPI EVA) is available now on his Youtube channel Richard McElreath - YouTube. Pre-recorded lectures will be uploaded twice a week. There is a chance to join weekly discussion, but it is already full at the moment.

The course will be about causal inference and Bayesian data analysis. You can find the course materials on GitHub - rmcelreath/stat\_rethinking\_2023: Statistical Rethinking Course for Jan-Mar 2023.

Last year course's trailer: Statistical Rethinking 2022 - Theatrical Trailer - YouTube

/r/statistics
https://redd.it/1015ekv

Читать полностью…

Data Scientology

Astrophysics background for data science

Hello all, I have a PhD in Astrophysics and 5+ year of postdoc experience from one of the Ivy league University in the US. Have extensively used python and data statistics for my research. Been looking into data science as a career opportunity. Was wondering if someone from the similar backgrounds can share their experiences in this transition. Any advice? I don’t have any industry experience but can I use my academic experience to apply at mid-level data scientists positions? Any advice would be great.

/r/datascience
https://redd.it/10134pi

Читать полностью…

Data Scientology

[OC] Animated GIF showing pPeople with a Wikipedia page by Continent over time 1500 - 1990 (per decade)

https://redd.it/100jltw
@datascientology

Читать полностью…

Data Scientology

Let's share some resources to improve out soft-skills

The beginning of the new year is a great time to learn something new (or to improve your knowledge). Soft skills are extremely important to our career, so I suggest sharing some resources (books, courses, blogposts, etc) to improve them.

Some links from me:

* [https://www.howtodeal.dev/](https://www.howtodeal.dev/) Any large project has a lot of people in different positions working on it. This website gives a broad categorization of them and gives suggestions on how to deal with them;
* [https://www.eviltester.com/post/online-presentation-speaking-tips/](https://www.eviltester.com/post/online-presentation-speaking-tips/) \- suggestions on how to make great online presentations (especially actual in post-COVID world)
* [https://eleganthack.com/presenting-is-performance/](https://eleganthack.com/presenting-is-performance/) \- general advice on making awesome presentations

/r/datascience
https://redd.it/100eea9

Читать полностью…

Data Scientology

My frustrating year of applying to internships (you won't believe what happens at the end!)

/r/datascience
https://redd.it/100adix

Читать полностью…

Data Scientology

[N] Compromised PyTorch-nightly dependency
https://pytorch.org/blog/compromised-nightly-dependency/

/r/MachineLearning
https://redd.it/100amit

Читать полностью…

Data Scientology

A slightly modified version of my primality checking algorithm plotted over polar coordinates.

/r/mathpics
https://redd.it/xaij03

Читать полностью…

Data Scientology

Element N14 ( by ojovivoMotion )

/r/mathpics
https://redd.it/xm1415

Читать полностью…

Data Scientology

Base 3, diagonal elementary CA, made in MS Excel and colorized with Photoshop. I wanna get into making prints. (LIC)

/r/mathpics
https://redd.it/z36z3w

Читать полностью…

Data Scientology

Hilbert Curve, part II

/r/mathpics
https://redd.it/yz9kzy

Читать полностью…

Data Scientology

Help me.

/r/mathpics
https://redd.it/yzuljn

Читать полностью…

Data Scientology

Simulations of Effect of Detonation Upon Plate

/r/mathpics
https://redd.it/zq3ig8

Читать полностью…

Data Scientology

XVII Circulo circum consumitur | 03-10-19 | by Xponentialdesign

/r/mathpics
https://redd.it/zu37xr

Читать полностью…

Data Scientology

Please suggest some cool mathematical models that I can 3d print.

I will be getting a 3d printer in the near future. What are some cool and crazy mathematical models that I could print. Link to an .stl file will be great, but I am also looking forward to creating my own models so links to formulas and pictures also appreciated.

I am going to start with platonic solids (nested wireframes maybe), fractal solids (Sierpiński pyramid, mandelbulb), Gömböc, Klein bottle, slide rule etc. But I am looking for more ideas.

Editing to add more ideas: solids of constant width, sphericons

/r/mathpics
https://redd.it/zw64c4

Читать полностью…

Data Scientology

[OC] Most Popular Movie Genre Combinations up to 2023

/r/dataisbeautiful
https://redd.it/100il3k

Читать полностью…
Подписаться на канал