Data Science by ODS.ai 🦜
46.1K subscribers
664 photos
77 videos
7 files
1.75K links
First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former. To reach editors contact: @malev
加ε…₯钑道
VeGAN release

A small library to easily train various existing #GAN using #PyTorch. Based on provided generator and discriminator, VeGANs trains them with a selected GAN algorithm.

Link: https://github.com/unit8co/vegans
StanfordNLP release.

New Python #NLProc package. StanfordNLP provides native, neural (PyTorch) tokenization, POS tagging and dependency parsing for 53 languages based on UD v2 β€” and a Python CoreNLP interface.

Pip: pip install stanfordnlp
PyPI: https://pypi.org/project/stanfordnlp/
Link: https://stanfordnlp.github.io/stanfordnlp/

#NLP #Stanford
Forwarded from Just links
Typical data scientist/machine learning engineer:
> understands that data science is generally engineering most of programmers can deal with
> do want to be overpaid and thus to keep others away from field
> keeps saying rare useless skill he possesses is definitively necessary for data science
Towards reconstructing intelligible speech from the human auditory cortex



Columbia neuroengineers have created a system that translates thought into intelligible, recognizable speech. By monitoring someone's brain activity, the technology can reconstruct the words a person hears with unprecedented clarity.

Link: https://www.nature.com/articles/s41598-018-37359-z

#BCI #thought2text
​​StNet: Local and Global Spatial-Temporal Modeling for Action Recognition

Baidu in one of #AAAI19 papers, proposed StNet, a novel framework for both local and global spatial-temporal modeling in videos. StNet outperforms several state-of-the-art approaches in action recognition and balances btw accuracy and model complexity.

ArXiV: https://arxiv.org/pdf/1811.01549.pdf

#SOTA #Baidu
​​DeepSlide: A Sliding Window Framework for Classification of High Resolution Microscopy Images (Whole-Slide Images)

DeepSlide, our #DeepLearning library for classification/visualization of high-resolution pathology images is open-source and available on GitHub.

GitHub: https://github.com/BMIRDS/deepslide

#histopathology #healthcare #PyTorch
​​Introducing AresDB: Uber’s GPU-Powered Open Source, Real-time Analytics Engine

Link: https://eng.uber.com/aresdb/

#Uber #analytics #opensource
​​Fair Regression for Health Care Spending

What happens, if fairness built into the objective function for continuous outcomes & see large improvements in group undercompensation?

This is the most interesting & potentially impactful analysis of fairness in #ML for #healthcare, which can lead to significant improvement in the life of millions.

ArXiV: https://arxiv.org/abs/1901.10566
GitHub: https://github.com/zinka88/Fair-Regression

#statistics #regression
​​Bayesian Statistics explained to Beginners in Simple English

Now some #entrylevel material, which still might be useful to review, because repetitio est mater studiorum.

Link: https://www.analyticsvidhya.com/blog/2016/06/bayesian-statistics-beginners-simple-english/
State-of-the-art (SOTA) collection of Paperswithcode

A great site, worth spreading word about: Papers With Code now includes 950+ ML tasks, 500+ evaluation tables (including SOTA results) and 8500+ papers with code. Probably the largest collection of NLP tasks, including 140+ tasks and 100 datasets.

Link: https://paperswithcode.com
Sota link: https://paperswithcode.com/sota

#Meta #collection #sota #useful
Manifold: A Model-Agnostic Visual Debugging Tool for Machine Learning at Uber

Seesm like there is no week without any news from #Uber engineering team. This time Uber built Manifold, a model-agnostic visualization tool for #ML performance diagnosis and model debugging, to facilitate a more informed and actionable model iteration process.

Link: https://ubere.ng/2Hac0O8

#Pipeline #administration
​​Implementation of character based convolutional neural network

A #PyTorch implementation of Character Based ConvNets for text classification published by Yan LeCun in 2015 is open-sourced on. Many training features and hacks are implemented.

Link: https://github.com/ahmedbesbes/character-based-cnn
Forwarded from Karim Iskakov - ΠΊΠ°Π½Π°Π» (karfly_bot)
"Flickr-Faces-HQ (FFHQ) dataset is out now. 70 000 high-quality 1024Γ—1024 PNG images. Good variety. Used for Style-GAN paper"
πŸ”Ž github.com/NVlabs/ffhq-dataset
πŸ“‰ @loss_function_porn
πŸ‘1
EE-559 – DEEP LEARNING (SPRING 2019)

Deep learning course covering the main deep learning tools and theoretical results, with examples in the #PyTorch framework.

Taught by FranΓ§ois Fleuret from Γ‰cole Polytechnique FΓ©dΓ©rale de Lausanne, Switzerland.

Link: https://fleuret.org/ee559/

#DL #course #learnhardgopro
​​Neural Networks seem to follow a puzzlingly simple strategy to classify images

Interesting article on how actually #NN see images and what helps to distinct different classes.

Link: https://medium.com/bethgelab/neural-networks-seem-to-follow-a-puzzlingly-simple-strategy-to-classify-images-f4229317261f

#BagNet #ResNet #Dl #CV