BigData
3.3K subscribers
754 photos
89 videos
3 files
835 links
Data Science : Big Data : Machine Learning : Deep Learning. По всем вопросам @evgenycarter
加入频道
Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping

PyGOD is a Python library for graph outlier detection (anomaly detection).

Github: https://github.com/pygod-team/pygod

Dataset : https://paperswithcode.com/dataset/ogb

Paper: https://arxiv.org/abs/2206.10071v1

👉 @bigdata_1
StrengthNet

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

Github: https://github.com/ttslr/strengthnet

Paper: https://arxiv.org/abs/2110.03156

MOSNet: https://github.com/lochenchou/MOSNet

👉 @bigdata_1
1
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

This library implements some of the most common (Variational) Autoencoder models.

Github: https://github.com/clementchadebec/benchmark_VAE

Paper: https://arxiv.org/abs/2206.08309v1

Dataset: https://paperswithcode.com/dataset/celeba

👉 @bigdata_1
👍21
DaisyRec 2.0: Benchmarking Recommendation for Rigorous Evaluation

DaisyRec-v2.0 is a Python toolkit developed for benchmarking top-N recommendation task.

Github: https://github.com/recsys-benchmark/daisyrec-v2.0

Command Generator : http://daisyrecguicommandgenerator.pythonanywhere.com/

Paper: https://arxiv.org/abs/2206.10848v1

Tutorial: https://github.com/recsys-benchmark/DaisyRec-v2.0/blob/main/DaisyRec-v2.0-Tutorial.ipynb

👉 @bigdata_1
1👍1
SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning

We introduce SoundSpaces 2.0, a platform for on-the-fly geometry-based audio rendering for 3D environments.

Github: https://github.com/facebookresearch/sound-spaces

Paper: https://arxiv.org/abs/2206.08312v1

Dataset: https://paperswithcode.com/dataset/librispeech

👉 @bigdata_1
1
Frequency Dynamic Convolution-Recurrent Neural Network (FDY-CRNN) for Sound Event Detection

Frequency Dynamic Convolution applied kernel that adapts to each freqeuncy bin of input, in order to remove tranlation equivariance of 2D convolution along the frequency axis.

Github: https://github.com/frednam93/FDY-SED

Paper: https://arxiv.org/abs/2206.11645v1

Dataset: https://paperswithcode.com/dataset/desed

👉 @bigdata_1
1
Retrosynthetic Planning with Retro*

graph-based search policy that eliminates the redundant explorations of any intermediate molecules.

Github: https://github.com/binghong-ml/retro_star

Paper: https://arxiv.org/abs/2206.11477v1

Dataset: https://www.dropbox.com/s/ar9cupb18hv96gj/retro_data.zip?dl=0

👉 @bigdata_1
1👍1
Bi-DexHands: Bimanual Dexterous Manipulation via Reinforcement Learning

Bi-DexHands provides a collection of bimanual dexterous manipulations tasks and reinforcement learning algorithms.

Github: https://github.com/pku-marl/dexteroushands

Isaac Gym: https://developer.nvidia.com/isaac-gym

Paper: https://arxiv.org/abs/2206.08686

👉 @bigdata_1
1👍1
MindWare: Efficient Open-source AutoML System

MindWare is an efficient open-source system to help users to automate the process of: 1) data pre-processing, 2) feature engineering, 3) algorithm selection, 4) architecture design, 5) hyper-parameter tuning, and 6) model ensembling.

Github: https://github.com/PKU-DAIR/mindware

Docs: https://mindware.readthedocs.io/en/latest/

Paper: https://arxiv.org/abs/2206.09423v1

👉 @bigdata_1
👍21
Tntorch - Tensor Network Learning with PyTorch

PyTorch-powered modeling and learning library using tensor networks. Installation: pip install tntorch

Github: https://github.com/rballester/tntorch

Docs site: http://tntorch.readthedocs.io/

Paper: https://arxiv.org/abs/2206.11128v1

👉 @bigdata_1
1
Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech

IMS Toucan is a toolkit for teaching, training and using state-of-the-art Speech Synthesis models.

Github: https://github.com/rballester/tntorch

Pre-Generated Audios: https://multilingualtoucan.github.io/

Cloning prosody across speakers: https://toucanprosodycloningdemo.github.io/

Paper: https://arxiv.org/abs/2206.12229v1

👉 @bigdata_1
1
Insubstantial Object Detection

Dataset comprised of 600 videos (141,017 frames) covering various distances, sizes, visibility, and scenes captured by different spectral ranges.

Github: https://github.com/calayzhou/iod-video

Project: https://calayzhou.github.io/

Paper: https://arxiv.org/abs/2206.11459v1

Dataset: https://paperswithcode.com/dataset/coco

👉 @bigdata_1
1
How to Grid Search Hyperparameters for Deep Learning Models in Python With Keras

https://machinelearningmastery.com/grid-search-hyperparameters-deep-learning-models-python-keras/

👉 @bigdata_1
1
A Comprehensive Survey on Deep Gait Recognition: Algorithms, Datasets and Challenges

Github: https://github.com/shiqiyu/opengait

Paper: https://arxiv.org/abs/2206.13732v1

Dataset: https://paperswithcode.com/dataset/usf

👉 @bigdata_1
1
FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

Github: https://github.com/timothyhtimothy/fast-vqa

Paper: https://arxiv.org/abs/2207.02595v1

Dataset: https://paperswithcode.com/dataset/kinetics

👉 @bigdata_1
1
Aug-NeRF: Training Stronger Neural Radiance Fields with Triple-Level Physically-Grounded Augmentations

For the first time brings the power of robust data augmentations into regularizing the NeRF training.

Github: https://github.com/vita-group/aug-nerf

Paper: https://arxiv.org/abs/2207.01164v1

Cloud Drive: https://drive.google.com/drive/folders/128yBriW1IG_3NJ5Rp7APSTZsJqdJdfc1

👉 @bigdata_1
1