Data Science by ODS.ai ๐Ÿฆœ
46K subscribers
676 photos
77 videos
7 files
1.75K links
First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former. To reach editors contact: @malev
ๅŠ ๅ…ฅ้ข‘้“
#python 3.8 is released. The worst python release so far. :=

I hope, that python4 will concentrate on removing useless stuff from the core, performance, and extending typing support.

Ideally, asyncio should be moved to a separate package, := should be undone. We all make mistakes.

https://docs.python.org/3/whatsnew/3.8.html
โ€‹โ€‹Generative Image Translation for Data Augmentation in Colorectal Histopathology Images

#GAN that generates near-real #histology images according to a Turing test with 4 pathologists. The results can be used for training #DL models for detecting rare histological patterns.

ArXiV: https://arxiv.org/abs/1910.05827
Code: https://github.com/BMIRDS/HistoGAN

#CV #healthlearning #biolearning #medical
ODS breakfast in Paris! See you this Saturday (19th) at 10:30 at Malongo Cafรฉ, 50 Rue Saint-Andrรฉ des Arts.
๐ŸŽ“ Reinforcement Learning Course from OpenAI

Reinforcement Learning becoming significant part of the data scientist toolbox.
OpenAI created and published one of the best courses in #RL. Algorithms implementation written in #Tensorflow.
But if you are more comfortable with #PyTorch, we have found #PyTorch implementation of this algs

OpenAI Course: https://spinningup.openai.com/en/latest/
Tensorflow Code: https://github.com/openai/spinningup
PyTorch Code: https://github.com/kashif/firedup

#MOOC #edu #course #OpenAI
Applying deep learning and Tensorflow to improve brain MRI images quality

Taking brain MRI images is complicated procedure as the orientation, location, and coverage needs to be correct in all three spatial dimentsions. The quality and consistency of positioning and orientation of the slices relies heavily on the skill and experience of the scan operator. This process can be time-consuming and difficult, especially for complex anatomies. As a result, there can be inconsistencies from scan operator to scan operator. This lack of consistency can make the job of the radiologist in interpreting these images more difficult especially when a patient is being scanned as a follow up to previous MRI exam and they are trying to identify subtle changes in anatomy or disease progression over time.

The researchers from GE Healthcare Magnetic Resonance Imaging team developed an approach to aid the scan operator. The approach is based on 3 deep neural networks, can be adopted to take MRI images of the other body parts and achieves 99.2% accuracy score. The researchers notice that Tensorflow significantly helped them to develop and deliver the approach to the production.

Medium article: https://medium.com/tensorflow/intelligent-scanning-using-deep-learning-for-mri-36dd620882c4
GE Helthcare website: https://www.gehealthcare.com

#Tensorflow #medicine #casestudy #DL #CV
โ€‹โ€‹Using open repositories to create ageing mirror

@Genekogan on Twitter reported working on a prototype, which is capable of #aging person's image in real time, developing the trend started by #FaceApp

Github: https://github.com/genekogan/glow/
Client: https://github.com/genekogan/ofxRunway

#GAN #DL #CV #WIP
Applying deep learning to Airbnb search

Story of how #Airbnb research team moved from using #GBDT (gradient boosting) to #NN (neural networks) for search, with all the metrics and hypothesises.

Link: https://blog.acolyer.org/2019/10/09/applying-deep-learning-to-airbnb-search/
ODS breakfast in Paris! See you this Saturday (26th) at 10:30 at Malongo Cafรฉ, 50 Rue Saint-Andrรฉ des Arts.
Efficient multi-lingual language model fine-tuning

Most of the worldโ€™s text is not in English. To enable researchers and practitioners to build impactful solutions in their domains, understanding how our NLP architectures fare in many languages needs to be more than an afterthought.
In this post, we introduce our latest paper that studies multilingual text classification and introduces #MultiFiT, a novel method based on #ULMFiT.

MultiFiT, trained on 100 labeled documents in the target language, outperforms multi-lingual BERT. It also outperforms the cutting-edge LASER algorithm-even though LASER requires a corpus of parallel texts, and MultiFiT does not.

Post: http://nlp.fast.ai/classification/2019/09/10/multifit.htmlโ€ฆ
Paper: https://arxiv.org/abs/1909.04761
Tweet: https://twitter.com/seb_ruder/status/1186744388908654597?s=20

#NLP #DL #FineTuning
Learning a unified embeding for visual search at #Pinterest

How Pinterest created unified embeddings for images from different fields searching instead three different by use #multitask approach.

Link: https://blog.acolyer.org/2019/10/11/learning-a-unified-embedding-for-visual-search-at-pinterest/

#Search #CV #embeddings
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5)

The approach casts every language problem as a text-to-text task. For example, English-to-German translation โ€“ input: "translate English to German: That is good." target: "Das ist gut." or sentiment ID โ€“ input: "sentiment: This movie is terrible!", target: "negative"

Transfer learning for NLP usually uses unlabeled data for pre-training, so they assembled the "Colossal Clean Crawled Corpus" (C4), ~750GB of cleaned text from Common Crawl.

Compared to different architectural variants including encoder-decoder models and language models in various configurations and with various objectives. The encoder-decoder architecture performed best in our text-to-text setting.

More at the thread by the tweet: https://twitter.com/colinraffel/status/1187161460033458177?s=20

Paper: https://arxiv.org/abs/1910.10683
Code/models/data/etc: https://github.com/google-research/text-to-text-transfer-transformer

#NLP #DL #transformer
โ€‹โ€‹ICCV 2019 papers

ICCV 2019 โ€“ one of the major tier A conferences on Computer Vision. These are papers presented at the conference. We are definitely going to post short descriptions of the most influential ones, but if you don't want to wait, here is the link:

Link: http://openaccess.thecvf.com/ICCV2019.py

#CV #Papers
This media is not supported in your browser
VIEW IN TELEGRAM
FUNIT: Few-Shot Unsupervised Image-to-Image Translation

A team of NVIDIA researchers has defined new AI techniques that give computers enough smarts to see a picture of one animal and recreate its expression and pose on the face of any other creature. The work is powered in part by generative adversarial networks (GANs), an emerging AI technique that pits one neural network against another.

Blog: https://blogs.nvidia.com/blog/2019/10/27/ai-gans-pets-ganimals/
Paper: https://arxiv.org/abs/1905.01723
ะกode: https://github.com/NVlabs/FUNIT
GANimal app: http://nvidia-research-mingyuliu.com/ganimal/

#CV #GAN #ICCV
YOLACT_ Real-Time Instance Segmentation [ICCV Trailer].mp4
19.2 MB
YOLACT: Real-time Instance Segmentation

Fully-convolutional model for real-time instance segmentation that achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is significantly faster than any previous competitive approach. They obtain this result after training on only one GPU.


video: https://www.youtube.com/watch?v=0pMfmo8qfpQ
paper: https://arxiv.org/abs/1904.02689
code: https://github.com/dbolya/yolact

#yolo #instance_segmentation #segmentation #real_time
๐ŸŽƒMoscow Data Halloween on the 31st of October

Itโ€™s gonna be one of the most unusual data science meetups!

We will have several Black ML talks, Data Science PPT Karaoke from Hell, costume contest with prizes, lots of fun and afterparty.

Registration link: https://corp.mail.ru/ru/press/events/678/