datascientology | Образование

Telegram-канал datascientology - Data Scientology

1234

Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf

Подписаться на канал

Data Scientology

[R]eading List for Andrej Karpathy’s “Busy person’s intro to Large Language Models” Video

I loved Andrej’s talk about in his “Busy person’s intro to Large Language Models” video, so I decided to create a reading list to dive in deeper to a lot of the topics. I feel like he did a great job of describing the state of the art for anyone from an ML Researcher to any engineer who is interested in learning more.

The full talk can be found here:
https://youtu.be/zjkBMFhNj_g?si=fPvPyOVmV-FCTFEx

Here’s the reading list:
https://blog.oxen.ai/reading-list-for-andrej-karpathys-intro-to-large-language-models-video/

Let me know if you have any other papers you would add!

/r/MachineLearning
https://redd.it/184qeuo

Читать полностью…

Data Scientology

Who wants to take on Duolingo?

Hello /r/LanguageTechnology \-

I've recently built my prototype for an NLP based language learning application which was received very positively. I've decided to go full time on it and make it a reality, and am looking for another technical co-founder to join me on this journey as CTO.

I'm a technical founder, but as I'm currently doing all the development, design, admin, sales, support, funding, etc, I'm looking for someone to just focus on the tech. The tech includes:

\- python backend

\- clojurescript frontend

\- NLP (currently via stanza, although partial support for spacy as well)

\- lots of LLM integrations

\- lots of TTS integrations

\- lots of vector embeddings

​

If any polyglots out there are passionate about NLP/LLMs/AI, genuinely believe they can change the world, dream in docker files and model weights, and just want to work on really cool tech, please send me a DM 🙏 let's change the world together.


Site can be found at https://phrasing.app. The app is currently in private beta, if you're interested in the application but not the job, fill out the beta application and send me a DM and I'll prioritize your access 😁

PS: I messaged mods first for permission, but heard no response. I hope this post is well received.

​

/r/LanguageTechnology
https://redd.it/180h4s9

Читать полностью…

Data Scientology

801. @r_softwaregore
802. @ShitLiberalsSay

Читать полностью…

Data Scientology

601. @r_shittyramen
602. @r_scrubs
603. @r_denmark
604. @r_piano
605. @r_dndmemes
606. @demonslayer_newz
607. @r_gaming
608. @r_ramiayana
609. @r_Computers
610. @r_fightporn
611. @r_sciencegeeks
612. @r_pinetime
613. @r_Literaturememes
614. @Asus_Tuf
615. @dailydankmemes
616. @r_aviation
617. @nature_eco
618. @r_creepyasterisks
619. @r_holdmybeer
620. @r_FemaleCelebrityBiceps
621. @r_lua
622. @Octoberstrike
623. @r_chels
624. @r_pornhubcomments
625. @r_Podcasts
626. @eristocracia
627. @kingKillerChronicle
628. @minecraft_en
629. @r_space
630. @ChannelZeroNetwork
631. @rSurrealMemes
632. @r_engrish
633. @r_comics
634. @r_pics_redux
635. @memes_Evangelion
636. @r_sweatypalms
637. @r_MiraculousLadybug
638. @r_funny
639. @r_ece
640. @OldSchoolRuneScape2007
641. @r_izlam
642. @r_djs
643. @r_war
644. @r_Arabfunny
645. @r_quotesporn
646. @asexualityonreddit
647. @r_coding
648. @r_metalmemes
649. @r_thelastairbender
650. @hub_posts
651. @r_btd6
652. @r_emacs
653. @FakeHistoryP0RN
654. @r_remotejs
655. @r_wholesomememes
656. @r_udemyfreebies
657. @r_HentaiMemes
658. @anime_wallpaper_HQ
659. @r_emulation
660. @r_preppers
661. @rCarsIndia
662. @r_edc
663. @CosplayReddit
664. @northkoreanews
665. @r_reallifedoodles
666. @r_Otonokizaka
667. @MinecraftModded
668. @anime_gifs_hub
669. @r_0sanitymemes
670. @CrossfitGirlsbackup
671. @r_boxoffice
672. @r_behindthegifs
673. @r_FantasyPL
674. @r_climbing
675. @r_iww
676. @r_television
677. @Indiancelebs
678. @r_bodybuilding
679. @programmer_humor
680. @r_adhd
681. @r_RimWorld
682. @r_xxxtentacion
683. @r_pcmasterrace
684. @r_lifeprotips
685. @r_Ultrakill
686. @r_battlestations
687. @r_adhdmeme
688. @r_thehatedone
689. @subgeniuschurch
690. @giveaway_gift
691. @r_lal_salaam
692. @r_SelfHosted
693. @FamilyGuyMemes
694. @reddit_whatcouldgowrong
695. @r_linuxmemes_1
696. @r_PhoenixSC
697. @r_shitposting0
698. @r_Animemes
699. @wallpapers_desktop_mobile
700. @redditart

Читать полностью…

Data Scientology

401. @r_ItemShop
402. @ichimechtenleben
403. @r_Tottenham
404. @durrmemes
405. @r_52book
406. @r_memetemplatesofficial
407. @r_historicalmemes
408. @r_confidentlyincorrect
409. @r_Jeles
410. @r_yakuzagames
411. @r_dark_humor
412. @r_moescape
413. @awwnime
414. @r_libertarian
415. @TerribleFacebookMemes
416. @r_silenthill
417. @r_hamsters
418. @r_PokemonRMXP
419. @Reddit_NBA
420. @r_devops
421. @r_StardewValley
422. @frontlinegirls
423. @r_stray
424. @artificialintelligence24x7
425. @r_systemadmin
426. @r_xboxone
427. @r_thedivision
428. @r_okbuddybaka
429. @animewaifuss
430. @r_AmongUs
431. @rkolc
432. @r_communism
433. @CallOfDutyWarzone_reddit
434. @r_linuxmemes
435. @r_nottheonion
436. @r_beamazed
437. @r_onejob
438. @animemaids_hot
439. @r_listentothis
440. @miku_nakano_fandom
441. @premierleague_r
442. @reddit_argentina
443. @r_cpp
444. @SailingX
445. @r_copypasta
446. @mangareddit
447. @r_DaniDev
448. @DragonBallShitposts
449. @iamatotalpieceofshit
450. @r_thesilphroad
451. @r_ps2
452. @datascientology
453. @fullegoism
454. @odd_takes
455. @passdenied
456. @r_Ratorix
457. @r_interestingasfuck
458. @r_rallyporn
459. @r_jokes
460. @r_jailbreak
461. @r_movies2
462. @r_Morocco
463. @r_books
464. @r_technoblade
465. @r_Blursedimages
466. @r_imaginary_network
467. @r_fpv
468. @r_GranTurismo
469. @chessmemesenglish
470. @r_Avicii
471. @rddit
472. @VaporwaveAesthetics
473. @r_overwatch
474. @rtf2memes
475. @r_battlecats
476. @Chainsaw_Man_Topic
477. @r_usenet
478. @r_egg_irl
479. @r_kcv
480. @dailyfoodporn
481. @r_sdarksouls
482. @r_googleplaydeals
483. @r_notinteresting
484. @r_dota2
485. @okbuddyretardd
486. @r_cricket
487. @attack_on_titan_topic
488. @PraiseTheCameraMan
489. @memanon
490. @r_rimesegate
491. @r_travis_scott
492. @r_kemonomimi
493. @whitepeopletweets
494. @r_latestagecapitalism
495. @rExmuslim
496. @r_okbuddyfresca
497. @RedditHistory
498. @r_apphookup
499. @ThereWasAnAttempt
500. @rdataisbeautiful

Читать полностью…

Data Scientology

201. @reddit_brasil
202. @reddit_pride
203. @r_nootropics
204. @mikuichika_nakano
205. @r_ShitpostTC
206. @r_arma
207. @r_youshouldknow
208. @r_psychology1
209. @r_HistoryAnimemes
210. @r_gharkekalesh
211. @r_diy
212. @rStableDiffusion
213. @BrandNewSentence
214. @r_tensei
215. @stablediffusion_r
216. @r_TrashTaste
217. @r_technicallythetruth
218. @r_furrypasta
219. @holdmycosmo
220. @r_embedded
221. @worldnews_reddit
222. @fakealbumcovers
223. @r_tylerthecreator
224. @r_mlp
225. @r_youtubehaiku
226. @r_crappydesign
227. @r_coys
228. @wutttttttt
229. @rAnarchism
230. @trans_memes
231. @fate_hot
232. @r_wheredidthesodago
233. @GnarMains
234. @Mapporncirclejerk
235. @redditpiracy
236. @r_fantheories
237. @CricketShitpost
238. @r_TechSupportGore
239. @r_ToolBand
240. @r_dotnet
241. @r_privacy
242. @r_Tipovi
243. @r_redpillmalayalam
244. @r_digimon
245. @r_technology
246. @r_Bloodborne
247. @r_stonks
248. @imaginarylands
249. @r_outrun
250. @r_opm
251. @r_SlimeRancher
252. @r_Ferrets
253. @r_SuperSentai
254. @reddit_animalsbeingderps
255. @r_trackers
256. @r_mwiii
257. @rdogelore
258. @AllTwitter
259. @r_breadtube
260. @r_badcode
261. @r_suicidewatch
262. @r_animearmpits
263. @Windows11Group
264. @chainsawfolk
265. @r_oneshot
266. @r_wallpapers
267. @r_Windows_Redesign
268. @r_tupac
269. @r_IKEAhacks
270. @r_vinyl
271. @r_trashpandas
272. @footballmanagergames
273. @r_simpsonshitpost
274. @r_chodi
275. @CanalLixo
276. @r_araragi
277. @r_MinecraftMemes
278. @r_programmerhumor
279. @jenkinsci
280. @reddit_androiddev
281. @dailygratitudee
282. @r_chemistry
283. @r_abandoned
284. @animals_telegram
285. @loliconsunite
286. @wallstreetnewsitalia
287. @minimalwallz
288. @r_workreform
289. @R_Punny
290. @r_zargoryangalaksisi
291. @r_imaginarylandscapes
292. @r_indiangaming
293. @MoviePosterTG
294. @r_Julia
295. @r_rainbow6
296. @r_radiocontrol
297. @r_imgoingtohellforthis
298. @reddit_gif
299. @lyricalquotes
300. @r_PoliticalMemes

Читать полностью…

Data Scientology

Other @reddit2telegram channels powered by @r_channels:
01. @r_greenandpleasant
02. @r_photoshopbattles
03. @r_apple
04. @reddit_OSHA
05. @r_moviequotes
06. @PrivacyGuides
07. @reddit_lego
08. @r_nijisanji
09. @facepalmers
10. @rhyderabad
11. @sffpc
12. @reddit_all
13. @r_combatfootage
14. @r_nosafetysmokingfirst
15. @manpill
16. @r_valorant
17. @rshittymoviedetails
18. @r_mapporn
19. @food_from_reddit
20. @r_changemyview
21. @r_smugs
22. @r_freegamefindings
23. @r_Davie504
24. @r_neovim
25. @aesthetic_waifu_wallpapers
26. @r_adporn
27. @r_fantasy
28. @r_indianmemes
29. @Awwducational
30. @RealRacing3TG
31. @r_videomemes
32. @DongistanSub
33. @r_BigAnimeTiddies
34. @R_MildlyPenis
35. @r_animegifs
36. @bestoftweets
37. @lostbackup
38. @r_League_Of_Memes
39. @r_twinpeaks
40. @r_vexillology
41. @r_educationalgifs
42. @r_magiarecord
43. @r_bakchodi
44. @r_movieclub
45. @GameplayMation
46. @quotesporn
47. @r_apexlegends
48. @r_me_irl
49. @r_formuladank
50. @soccerx
51. @harrypotterbackup
52. @rselfie
53. @Boku_No_Hero_Academia_Topic
54. @Emulationx
55. @r_devilmaycry
56. @moonshotcryptos
57. @r_Sino
58. @AlternateReality
59. @BeautifulFemalesbackup
60. @r_wholesome
61. @rickandmorty_en
62. @r_turkeyjerky
63. @r_ComedyCemetery
64. @MemeArea
65. @Fgrandorder
66. @SkinnyWithAbsbackup
67. @MarbleRacing
68. @Unexpected_Reddit
69. @denpasong
70. @r_algotrading
71. @r_animememe
72. @r_leftistvexillology
73. @r_gtaonline
74. @r_wow
75. @aapexlegends_game
76. @r_cutelittlefangs
77. @r_indiaa
78. @r_thinkpad
79. @r_churchofemma
80. @news_reddit
81. @worldnewsvideo
82. @r_InternetIsBeautiful
83. @r_ilMasseo
84. @r_LeagueOfLegends
85. @hololive_yuri
86. @vtuber_en
87. @r_WatchPeopleDieInside
88. @r_bangalore
89. @InstaReality
90. @r_witcher3
91. @r_indiandankmemes
92. @GGPoE
93. @arkotonog
94. @r_jacksepticeye
95. @r_houkai3rd
96. @r_science
97. @r_kochin
98. @r_crackwatch
99. @r_Gintama
100. @JojosBizarreShitposts

Читать полностью…

Data Scientology

Have a live conversation about a basketball game with GPT4V, Whisper, TTS

/r/computervision
https://redd.it/17ywiwp

Читать полностью…

Data Scientology

Start with Large Language Models (LLMs) in 2023

This is a complete guide to start and improve your LLM skills in 2023 without an advanced background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

The complete article: https://www.louisbouchard.ai/from-zero-to-hero-with-llms/

All the links on GitHub: https://github.com/louisfb01/start-llms

Artificial is a fantastic field, and so are language models like GPT-4, Claude..., but it goes extremely fast. Don't miss out on the most important and exciting news by joining great communities, people, newsletters, and more you can all find in this guide!

This guide is intended for anyone with a small background in programming and machine learning. Simple python knowledge is enough to get you started. There is no specific order to follow, but a classic path would be from top to bottom. If you don't like reading books, skip it, if you don't want to follow an online course, you can skip it as well. There is not a single way to become a "LLM expert" and with motivation, you can absolutely achieve it.

/r/deeplearning
https://redd.it/17qo9lt

Читать полностью…

Data Scientology

how well do NLP masters prepare you for industry jobs?

Hey everyone!

I am currently a second-year student enrolled in a linguistics program but I am taking some NLP courses (python programming, machine learning, neural networks, machine translation, databases, maybe statistics). I have a great interest in phonetics & speech technology and am therefore looking into Edinburgh's speech and language processing master as well as other one-year more general NLP masters. However, with all of this I am just unsure if a one-year masters, especially a highly specialised one such as edinburgh's, will sufficiently prepare me for an industry job right after graduation? Should I be looking more into two-years programs such as the ones in Germany?

I would greatly appreciate any input!

/r/LanguageTechnology
https://redd.it/17nm5i9

Читать полностью…

Data Scientology

Part 1: Building Vision Transformer from Scratch: A PyTorch Deep Dive

I've just published the first installment of my Vision Transformer series article. find the full functional example of colab link given in the article
Part 1: Building Vision Transformer from Scratch: A PyTorch Deep Dive Plus a Teaser on LORA for Part 2

pashashaik/part-1-building-vision-transformers-from-scratch-a-pytorch-deep-dive-plus-a-teaser-on-lora-for-beef0f3aef5c">pashashaik/part-1-building-vision-transformers-from-scratch-a-pytorch-deep-dive-plus-a-teaser-on-lora-for-beef0f3aef5c" rel="nofollow">https://medium.com/@pashashaik/part-1-building-vision-transformers-from-scratch-a-pytorch-deep-dive-plus-a-teaser-on-lora-for-beef0f3aef5c

/r/deeplearning
https://redd.it/17m6cix

Читать полностью…

Data Scientology

Give a few prompts to A.I. and it can now generate videos such as this...
https://vimeo.com/877454859

/r/LanguageTechnology
https://redd.it/17g7lfu

Читать полностью…

Data Scientology

Is it useful to take a statistics class for computer vision?

Apologies for inundating the subreddit with questions about courses lol. Am wondering if it is useful somehow though, or only marginally (since stats helps with ML, which helps with computer vision).

Also, is it more useful to know in context of a research/academia setting, or in a industrial setting?

/r/computervision
https://redd.it/17cp5d5

Читать полностью…

Data Scientology

Is it better to take a class on 3D understanding, or a course on the physics of visual appearance?

The former course talks about: Explicit, Implicit, and Neural 3D Representations, Differentiable Rendering, Single-view 3D Prediction: Objects, Scenes, and Humans, Neural Rendering, Multi-view 3D Inference: Radiance Fields, Multi-plane Images, Implicit Surfaces, etc., Generative 3D Models, Shape Abstraction, Mesh and Point cloud processing.

The second course talks about more physics and optics stuff, like principles pf photometry, light fields, reflection, refraction, polarization, caustics, lighting and shadows, BRDFs, vision in bad weather, and applications in aerial, underwater, medical, and microscopic imaging.

At this point, I think I'm interested in biomedical applications of computer vision, so I am leaning towards the second course. However, I don't know enough about the job market out there for computer vision -- it would seem to me that self-driving cars and all would prefer the former course. Furthermore, I wonder if that course is better for expanding my understanding, or the second one. I know that I should pick based on what I want to do, not what's more popular necessarily -- but I also don't quite have that figured out either. I just feel that I don't find generative AI my type of thing (despite it being a big deal lol).

I also have a background in classical signal processing, electromagnetism, etc. being an EE so I thought maybe the 2nd course would complement my background more.

Any advice is greatly appreciated!

/r/computervision
https://redd.it/17bsmnh

Читать полностью…

Data Scientology

Context aware chunking with LLM


I'm working on an embedding and recalll project.

My database is made mainly on a small amount of selected textbooks. With my current chunking strategy, however, the recall does not perform very well since lots of info are lost during the chunking process. I've tried everything... Even with a huge percentage of overlap and using the text separators, lots of info are missing. Also, I tried with lots of methods to generate the text that I use as query: the original question, rephrased (by llm) question or a generic answer generated by LLM. I also tried some kind of keyword or "key phrases ", but as I can see the problem is in the chunking process, not in the query generations.

I then tried to use openai api to chunk the file: the results are amazing... Ok, i had to do a lots of "prompt refinement", but the result is worth it. I mainly used Gpt-3.5-turbo-16k
(obviously gpt4 is best, but damn is expensive with long context. Also text-davinci-003 and it's edit version outperform gpt3.5, but they have only 4k context and are more expensive than 3.5 turbo)

Also, I used the llm to add a series of info and keywords to the Metadata.
Anyway, as a student, that is not economically sustainable for me.

I've seen that llama models are quite able to do that task if used with really low temp and top P, but 7 (and I think even 13B) are not enough to have a an acceptable reliability on the output.

Anyway, I can't run more than a 7B q4 on my hardware.
I've made some research and I've found that replicate could be a good resources, but it doesn't have any model that have more than 4k of context length. The price to push a custom model is too much for me.

Someone have some advice for me? There is some project that is doing something similar? Also, there is some fine tuned llama that is tuned as "edit" model and not "complete" or chat?

Thanks in advance for any kind of answers.

/r/LanguageTechnology
https://redd.it/171r2c1

Читать полностью…

Data Scientology

I built an open source motion capture system that costs $20 and runs at 150fps! Details in comments

/r/computervision
https://redd.it/17xi20j

Читать полностью…

Data Scientology

🙋
Q: How can I help?
A: Support us on Patreon and promote your favorite channels!

Q: How to make similar channels?
A: Ask at @r_channels or use manual at https://github.com/Fillll/reddit2telegram.

Q: Where to donate?
A: Patreon: https://www.patreon.com/reddit2telegram. Other ways: https://bit.ly/r2t_donate.

Читать полностью…

Data Scientology

701. @r_KamenRider
702. @r_megane
703. @r_ik_ihe
704. @r_marvelunlimited
705. @asiangirlsbeingcute
706. @r_KaIT
707. @r_php
708. @r_frankocean
709. @JEENEETardsReddit2
710. @tyingherhairup
711. @r_SuperModelIndia
712. @r_tessafowler
713. @r_rust
714. @proceduralgeneration
715. @r_modelmakers
716. @r_BlackMagicFuckery
717. @COMPLETE_ANARCHY
718. @r_FallGuysGame
719. @r_PokemonMasters
720. @musictheorymemes
721. @r_progresspics
722. @reddit_wtf
723. @r_CryptoMoonShot
724. @r_islam_channel
725. @r_riscv
726. @rantitrampo
727. @macsetupsbackup
728. @WhatsWrongWithYourDog
729. @Tumblrcontent
730. @r_crappyoffbrands
731. @r_desktops
732. @r_mildlyinfuriating
733. @r_yesyesyesyesno
734. @r_PoliticalCompassMemes
735. @Genshin_Impact_reddit
736. @Mootivati0n
737. @r_memesITA
738. @r_cryptocurrency
739. @r_furry
740. @r_sbubby
741. @grndordr
742. @r_deepintoutube
743. @r_suggest
744. @rdrawing
745. @r_CozyPlaces
746. @r_lostgeneration
747. @r_Windows
748. @r_starfield
749. @uminekoreddit
750. @r_okaybuddyhololive
751. @r_Euroleague
752. @r_00ag9603
753. @M2_D4
754. @r_burdurland
755. @NewGreentexts
756. @r_dalle2
757. @r_gifs
758. @IngressReddit
759. @r_iNoobChannel
760. @r_ps3
761. @r_arknights
762. @OneTrueMegumin
763. @r_masterhacker
764. @AAAAAGGHHHH
765. @r_formula1
766. @tnomod
767. @ranalog
768. @r_til
769. @r_CursedComments
770. @instantkarma_XO
771. @r_onepiecer
772. @r_catgifs
773. @r_ClashOfClans
774. @pythondaily
775. @r_FreeGamesOnSteam
776. @IngressPrimeFeedback
777. @r_eldenring
778. @NatureIsLit
779. @r_SwitchHacks
780. @r_atheism
781. @anime_titties
782. @r_Pony_irl
783. @r_truefilm
784. @sub_eminem
785. @r_dankinindia
786. @r_persona5
787. @spider_man_memes
788. @r_unexpectedhamilton
789. @cryptoinstantnews2
790. @r_mildlyvagina
791. @r_DataHoarder
792. @r_Showerthoughts
793. @r_JapanPics
794. @r_Chargers
795. @intensememes
796. @r_Padres
797. @r_WikiLeaks
798. @r_malaysia
799. @r_MWII
800. @news756

Читать полностью…

Data Scientology

501. @r_moviescirclejerk
502. @r_dgb
503. @rnosleep
504. @reddit_facepalm
505. @r_climbingcirclejerk
506. @TheyDidTheMath
507. @r_scala
508. @anime_streetwear
509. @r_reddevils
510. @r_documentaries
511. @soccermemer
512. @r_comedyheaven
513. @r_goodanimemes
514. @r_designporn
515. @indepthstories
516. @r_ODSP
517. @r_bolehland
518. @r_texans
519. @r_bugbounty
520. @Cinema4Dbackup
521. @r_turkey
522. @r_TikTok_Tits
523. @r_SatisfactoryGame
524. @r_3dprinting
525. @r_dankmemes
526. @InstaIndia
527. @r_demisexuality
528. @redmeme
529. @AnimeHindiMemes
530. @prolifetipss
531. @r_africa
532. @notme_irl
533. @r_SISwimsuitGirls
534. @r_HighQualityGifs
535. @r_MuscularCelebrities
536. @r_animalcrossing
537. @r_MashuKyrielight
538. @r_Maybe
539. @bollybng
540. @r_MCFC
541. @privacymemes1
542. @r_metalgearsolid
543. @r_ShitpostXIV
544. @r_gamingmemes
545. @DankNaruto
546. @admeme
547. @reddit_android
548. @r_woooosh
549. @redditvideos
550. @GIFFFs
551. @r_PuppyLinux
552. @r_ihadastroke
553. @blue_archive_reddit
554. @r_disneyvacation
555. @r_marvelstudios
556. @r_heraldry
557. @r_progmetal
558. @ya_metro
559. @r_Celebs
560. @r_Librandu
561. @r_kgbtr
562. @r_bitcoin
563. @r_mechanicalkeyboards
564. @r_osugame
565. @r_weirdcore
566. @r_theexpanse
567. @r_LiverpoolFC
568. @r_2meirl4meirl
569. @r_evilbuildings
570. @r_scp
571. @r_frugalmalefashion
572. @r_nofap
573. @MSILaptops
574. @r_ramen
575. @FitAndNaturalbackup
576. @r_gentoo
577. @ShrineOfNino
578. @reddit_cartoons
579. @grimdank
580. @r_zig
581. @r_androidapps
582. @ani_bm
583. @r_opensignups
584. @BikiniMoe
585. @r_edgerunners
586. @r_homeassistant
587. @reddit_fashion
588. @r_corgi
589. @r_publicfreakout
590. @r_tamamo
591. @DetroitBecomeHumanbackup
592. @r_porn
593. @r_channels_tifu
594. @r_antimeme
595. @trueoffmychest
596. @r_foxes
597. @r_PlipPlip
598. @r_grandorder
599. @r_punee
600. @r_wasletztepreis

Читать полностью…

Data Scientology

301. @r_PSX
302. @pc_gaming_memes
303. @r_animeirl
304. @cutie_kittycats
305. @r_StarWarsMemes
306. @r_vexillologycirclejerk
307. @r_vault_hunters
308. @dankscpmemes
309. @medieval_memes
310. @slavelabour
311. @rAnimewallpaper
312. @OldSchoolCool
313. @IndiaSocialSubreddit
314. @okbuddyretard
315. @anime_hot_wallpapers
316. @soccer_reddit
317. @streetmoe
318. @r_kratom
319. @r_MaxEstLa
320. @stardewvalley_en
321. @r_ChinaDress
322. @r_frogs
323. @r_vim
324. @comedynecrophilia
325. @r_redfall
326. @WandaVision_reddit
327. @dash_cams
328. @rcarporn
329. @r_perfecttiming
330. @r_proseporn
331. @r_cyberpunk2077
332. @rDDLC
333. @r_tfirl
334. @r_hiphopheads
335. @TheVampireDiariesbackup
336. @r_sandman
337. @qt_reddit
338. @r_linux
339. @r_PewdiepieSubmissions
340. @mash_kyrie
341. @r_therewasanattempt
342. @r_TIHI
343. @rnerds
344. @just_hmmm
345. @r_technope
346. @dogecoin_reddit
347. @r_polandball
348. @WatchPeopleVim
349. @CringyTiktok
350. @antimlms
351. @r_shitposters_paradise
352. @onepunchmansubreddit
353. @r_getmotivated
354. @r_shitpostcrusaders
355. @eye_bleach
356. @r_fatestaynight
357. @BetterEveryLoop
358. @macappsbackup
359. @r_talesfromtechsupport
360. @r_explainmelikeimfive
361. @r_naruto
362. @redditmovie
363. @weirddalle
364. @ShrineOfMiku
365. @r_starterpacks
366. @r_sweden
367. @r_greentext
368. @r_thatsinsane
369. @env_chat
370. @r_askmen
371. @r_mild
372. @r_HermitCraft
373. @r_deathStranding
374. @LivestreamFail
375. @r_invites
376. @r_Damnthatsinteresting
377. @r_moviesuggestions
378. @r_playboicarti
379. @ATBGE
380. @r_vinesauce
381. @r_komisan
382. @r_gunners
383. @r_econ
384. @DoctorWhumour
385. @r_hearthstone
386. @cheerleadersbackup
387. @okbuddypersona
388. @r_DiscoElysium
389. @AssholeDesign
390. @reddit_Dota2
391. @r_bash
392. @didntknowiwantedthat
393. @r_buildapcsales
394. @r_wellthatsucks
395. @r_kopyamakarna
396. @r_MadokaMagica
397. @r_wireguard
398. @r_Bertra
399. @r_kanye
400. @reddit196

Читать полностью…

Data Scientology

101. @r_DeepFriedMemes
102. @chessmemes
103. @r_dontdeadopeninside
104. @r_AskReddit
105. @r_kerala
106. @r_AxieInfinity
107. @r_thinkpadsforsale
108. @foxgirls_hot
109. @ImaginaryPics
110. @NFL_reddit
111. @r_etymology
112. @reddit2telegram
113. @AzurLane_sub
114. @rekabufeed
115. @r_kendricklamar
116. @WTF_PICTURES
117. @r_antiwork
118. @RedditCats
119. @redditshortfilms
120. @r_plsnobulli
121. @legalcatadvice
122. @r_nvidia
123. @YoutubeCompendium
124. @NikonBackup
125. @catmemes_reddit
126. @coolguides
127. @rmallubabes
128. @PoliticalHumor
129. @programmingreddit
130. @rsoccerbetting
131. @r_BokuNoMetaAcademia
132. @reddit_trackballs
133. @r_ExpandDong
134. @vfxbackup
135. @RedditGames
136. @r_propagandaposters
137. @r_Unity3D
138. @r_pubgmobile
139. @RussianIsSoHard
140. @Rattit
141. @r_DetroitPistons
142. @r_catastrophicfailure
143. @anime_bikini_waifus
144. @Idiots_In_Cars
145. @r_bapcsalescanada
146. @r_minecraft
147. @r_teenagers
148. @r_raspberry_pi
149. @r_one_punch_man
150. @r_blueteamsec
151. @JEENEETardsReddit
152. @r_traumacore
153. @r_illegallysmolcats
154. @r_chemicalreactiongifs
155. @r_cursed
156. @r_dndgreentext
157. @r_ContraPoints
158. @r_terraria
159. @reddit_elm
160. @Next_Level_Skills
161. @The100backup
162. @r_hackintosh
163. @gameofthronesbackup
164. @r_avatar_memes
165. @failures_of_capitalism
166. @r_okbuddyretard
167. @g4m3savisos
168. @thefalconandthews_reddit
169. @r_k12sysadmin
170. @churchoftohsaka
171. @r_CoolGithubProjects
172. @r_streetwear
173. @CallOfDutyMobile_reddit
174. @r_versus
175. @ImaginationExplorer
176. @one_piece_topic
177. @r_VirginVsChad
178. @r_manga2
179. @r_creepy
180. @UNBGBBIIVCHIDCTIICBG
181. @EliteDanger0us
182. @r_malazan
183. @kstxi
184. @r_okbuddychicanery
185. @r_BikiniBottomTwitter
186. @r_WritingPrompts
187. @SubredditMix
188. @rareinsults
189. @blackpeopletweets
190. @r_okbuddyrintard
191. @r_funnystories
192. @saber_fgo
193. @r_outerwilds
194. @r_hololive
195. @brasildob
196. @Dreamcatcher_reddit
197. @rJackSucksAtLife
198. @oddly_satisfy
199. @instant_regret
200. @imaginary_maps

Читать полностью…

Data Scientology

🎂🎂🎂🎂🎂🎂🎂
🎁 Today @datascientology is 7 years old.
🎉 Congratulations! 🎈

Читать полностью…

Data Scientology

Serverless development experience for embedded computer vision

I recently published the version 1.0 of the Pipeless framework. It provides the development experience of serverless web frameworks to create computer vision applications that run directly on devices.

What that means is you provide some Python functions and those are executed when there is a new video frame. The framework manages everything for you including parallelization, streams management, executing your functions when they have to be executed, etc. You can run it in your devices and provide input streams via a CLI or REST API. It supports multi-stream processing, dynamic stream configuration, ships some inference runtimes so you just need to provide a model, and a bunch of other cool features.

It is working at a very decent performance. I have reached real-time 15 FPS in a CPU of 4 cores with a YOLO model. When using GPU it more than doubles that.

If someone is interested I really appreciate feedback!

You can find the repo here: https://github.com/pipeless-ai/pipeless

/r/computervision
https://redd.it/17vemga

Читать полностью…

Data Scientology

D What AI topics are you curious about but rarely see in the spotlight?

I'm a data engineer who somehow ended up as a software developer. So many of my friends are working now with the OpenAI api to add generative capabilities to their product, but they lack A LOT of context when it comes to how LLMs actually works.

This is why I started writing popular-science style articles that unpack AI concepts for software developers working on real-world application. It started kind of slow, honestly I wrote a bit too "brainy" for them, but now I've found a voice that resonance with this audience much better and I want to ramp up my writing cadence.

I would love to hear your thoughts about what concepts I should write about next?
What get you excited and you find hard to explain to someone with a different background?

/r/MachineLearning
https://redd.it/17riznw

Читать полностью…

Data Scientology

R Idempotent Generative Network

Paper: https://arxiv.org/abs/2311.01462

Blog: https://assafshocher.github.io/IGN/

Abstract:

>We propose a new approach for generative modeling based on training a neural network to be idempotent. An idempotent operator is one that can be applied sequentially without changing the result beyond the initial application, namely f(f(z))=f(z). The proposed model f is trained to map a source distribution (e.g, Gaussian noise) to a target distribution (e.g. realistic images) using the following objectives: (1) Instances from the target distribution should map to themselves, namely f(x)=x. We define the target manifold as the set of all instances that f maps to themselves. (2) Instances that form the source distribution should map onto the defined target manifold. This is achieved by optimizing the idempotence term, f(f(z))=f(z) which encourages the range of f(z) to be on the target manifold. Under ideal assumptions such a process provably converges to the target distribution. This strategy results in a model capable of generating an output in one step, maintaining a consistent latent space, while also allowing sequential applications for refinement. Additionally, we find that by processing inputs from both target and source distributions, the model adeptly projects corrupted or modified data back to the target manifold. This work is a first step towards a ``global projector'' that enables projecting any input into a target data distribution.

​

/r/MachineLearning
https://redd.it/17otzfw

Читать полностью…

Data Scientology

What’s your responsibility as computer vision developer?

I feel like I’m not doing computer vision. I have started a project at my current organization where I am building the defect detection system from scratch. I am mainly spending my time collecting the dataset, labeling them and trying various models. My team uses highly accurate available models from AWS Rekognition and Roboflow which is one click training process. I feel like anyone can collect, label and test the models.

/r/computervision
https://redd.it/17kssfp

Читать полностью…

Data Scientology

imops - ultra fast classical CV algorithms for Python

Hi everyone!

tl/dr: imops is a collection of carefully optimized CV algorithms for numpy arrays of any dimension!

I work in medical imaging, mostly with 3D CT/MRI. This is a pretty computational heavy field with a focus on near-real time processing.

To my surprise, many of the CV algorithms from scipy and skimage are painfully slow. We reimplemented some of them in Cython and added support for arrays of any dimension.

You can find the project here, and the benchmarks section contains a comparison with scipy/skimage counterparts.

If you're interested in contributing, or would like to see another function implemented, don't hesitate to open a PR or create an issue!

/r/computervision
https://redd.it/17h0i9t

Читать полностью…

Data Scientology

D Is Computer Vision dead? - “Quo Vadis, Computer Vision?”

In ICCV23, several top notch researchers shared their insights (in a workshop called “Quo Vadis, Computer Vision?”) wrt the current state of Computer Vision, especially in light of the meteoric raise of LLMs. Has CV stalled? Is CV dead?

E.g.MIT’s professor Bill Freeman, has some interesting points on foundation models: “FM aren’t fundamental, therefore not stable". Jitendra Malik argues "video can describe the world better than text."

/r/MachineLearning
https://redd.it/17eak3w

Читать полностью…

Data Scientology

What should I self-study specifically to become hirable in the NLP/machine learning field

Hi, I have majors in cognitive science, linguistics, philosophy, and a minor in computer science. I know python, java, and SQL at the leetcode/interview level. I have studied algebra, linear algebra, calculus, differential equations. I would love to apply my linguistics knowledge in a tech job (I have not had a tech job at all before). However, I did not get the chance to study NLP or machine learning in college, so I feel like I do not know how to bridge the gap between these two disciplines. What should I do or study to know what I am doing, to be able to get an entry-level NLP position?

/r/LanguageTechnology
https://redd.it/16mrr90

Читать полностью…

Data Scientology

Seeking learning resources that go deep into NLP foundations and which target advanced-intermediate technical learners

# TL;DR:

Semi-experienced MLE seeking to deepen my knowledge of the modeling side of NLP. Can you recommend any courses or other resources to pursue for this?

Ideally I'd like resources which target advanced-intermediate practitioners and which spend only minimal time on theoretical linguistics concepts (e.g., "What is syntax?"; "What is a morpheme?"; "What is distributional semantics?" - Less because that stuff is unimportant, and more because I already know it inside and out.)

------------

# My brief background

Theoretical linguistics grad here. Several years ago and many years out of school, I set off to STEM-up and become a machine learning engineer (MLE) in NLP. I spent a few years hardcore self-studying math, programming, ML/DL, and NLP. In the end I succeeded, learning just enough to get hired as an NLP MLE using only web-based resources and a few PDFs.

I've been in the role for 3-4 years now, and in the interim I have learned a TON of practical skills about SWE and DevOps that I hadn't learned while self-studying the theory. However, my knowledge of ML/DL/NLP theory hasn't actually grown much. This is mostly because where I work, the modeling is left to PhD'ed researchers, not as much the engineers, and not at all the junior engineers like me. So I'd like to get back into learning that side of things.

Because I prioritized breadth over depth during that initial self-study period, I'm passingly familiar with a fair number of NLP tasks and techniques. But I am a master of none. So on this second pass, because now I know the basics and can "speak the lingo", I'd like to go deep, especially on the foundations on NLP.

# What I'm hoping to get from this post

With this background, can anyone recommend (ideally) courses, books, blogs, or other resources for learning things such as the following?

- foundational NLP tasks (e.g., POS-tagging, dependency parsing, NER, sentiment analysis, summarization) and associated popular models/approaches
- probability theory for NLP
- traditional/non-deep NLP
- sequence classification
- clustering/unsupervised methods for NLP
- multitask learning in NLP


# Why? "Who cares?"

At this point, someone may ask:

> You've already "made it" as an MLE, and modeling clearly is not required. So who cares?

I would retort that as one advances from junior to mid to senior and beyond, "hey I can write great code" should gradually take a backseat to "hey I can make informed decisions about what data and models to pursue given the goals and constraints of a specific business case."

I really want to be able to reason about and make pragmatic arguments like "Well, business case A can be re-conceptualized as a kind of question-answering task (random example), therefore it would be reasonable to start with model B and optimize a cost function C with optimizer D. For that, we'd want to collect at least E examples of data type F, and run some model experiments on G cloud resource, ..." Etc. etc etc. Right now I could follow along if a senior engineer came up with such an argument, then probably execute the work mostly on my own. But I would struggle to come up with and then stand behind the argument myself.

I recently had my first technical interviews (for senior level), and while I did well in the coding rounds, I really felt my deficiencies when answering "What would you do in scenario X, and why?" type questions and talking about the nitty gritty of model architectures. My responses were often just like "Um, I'd throw a transformer model at it", in part because that's 90% of what we do where I work, but also because I lack experience which makes me tend towards "when all you have is a hammer everything looks like a nail" style thinking. Hence, this post.

Anyway, enough ramble. I really look forward to any suggestions I receive. Thanks in advance.

/r/LanguageTechnology
https://redd.it/174k2wv

Читать полностью…
Подписаться на канал