BigData
3.3K subscribers
754 photos
89 videos
3 files
835 links
Data Science : Big Data : Machine Learning : Deep Learning. По всем вопросам @evgenycarter
加入频道
Learning Protein Representations via Complete 3D Graph Networks

DIG: Dive into Graphs is a turnkey library for graph deep learning research.

Github: https://github.com/divelab/DIG

Paper: https://arxiv.org/abs/2207.12600v1

Tutorials: https://diveintographs.readthedocs.io/en/latest/tutorials/graphdf.html

Documentation: https://diveintographs.readthedocs.io/

Benchmarks: https://github.com/divelab/DIG/tree/dig-stable/benchmarks

Dataset: https://paperswithcode.com/dataset/atom3d

👉 @bigdata_1
1
ALBench: A Framework for Evaluating Active Learning in Object Detection

An active learning benchmark framework named as ALBench for evaluating active learning in object detection.

Github: https://github.com/industryessentials/ymir

Paper: https://arxiv.org/abs/2207.13339v1

Projects: https://github.com/IndustryEssentials/ymir/projects

Dataset: https://paperswithcode.com/dataset/coco

👉 @bigdata_1
1
Rewriting Geometric Rules of a GAN

Method which allows edit a GAN model to synthesize many unseen objects with the desired shape

Github: https://github.com/peterwang512/ganwarping

Paper: https://arxiv.org/abs/2207.14288v1

Project: https://peterwang512.github.io/GANWarping/

Dataset: https://paperswithcode.com/dataset/ffhq

Video: https://www.youtube.com/watch?v=2m7_rbsO6Hk

👉 @bigdata_1
1😁1
1
This media is not supported in your browser
VIEW IN TELEGRAM
Expanding Language-Image Pretrained Models for General Video Recognition by Microsoft.

Video-specific prompting scheme, which leverages video content information for generating discriminative textual prompts.

Github: https://github.com/microsoft/VideoX/tree/master/X-CLIP

Paper: https://arxiv.org/abs/2208.02816v1

Dataset: https://paperswithcode.com/dataset/ucf101

👉 @bigdata_1
11
P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting

Github: https://github.com/wangzy22/P2P

Paper: https://arxiv.org/abs/2208.02812v1

Dataset: https://paperswithcode.com/dataset/imagenet

Model: https://shapenet.cs.stanford.edu/media/modelnet40_normal_resampled.zip

👉 @bigdata_1
1
Per-Clip Video Object Segmentation

Progressive matching mechanism for efficient information-passing within a clip.

Github: https://github.com/pkyong95/PCVOS

Paper: https://arxiv.org/abs/2208.01924v1

Dataset: https://paperswithcode.com/dataset/davis

Video: https://youtu.be/6QATHDwrUx0

👉 @bigdata_1
1
1
ROC: A New Paradigm for Lyric-to-Melody Generation

Muzic is a research project on AI music that empowers music understanding and generation with deep learning and artificial intelligence.

Github: https://github.com/microsoft/muzic

Paper: https://arxiv.org/abs/2208.05697v1

Project: https://www.microsoft.com/en-us/research/project/ai-music/

👉 @bigdata_1
❤‍🔥1
Speech Enhancement and Dereverberation with Diffusion-based Generative Models

Github: https://github.com/sp-uhh/sgmse

Paper: https://arxiv.org/abs/2208.05830v1

Pretrained checkpoints: https://drive.google.com/drive/folders/1CSnkhUSoiv3RG0xg7WEcVapyLuwDaLbe?usp=sharing

👉 @bigdata_1
StyleFaceV - Official PyTorch Implementation

StyleFaceV produces high-fidelity identity-preserving face videos with vivid movements

Github: https://github.com/arthur-qiu/stylefacev

Project: http://haonanqiu.com/projects/StyleFaceV.html

Video: https://youtu.be/BZNLcD04-Fc

Paper: https://arxiv.org/abs/2208.07862v1

Dataset: https://paperswithcode.com/dataset/faceforensics-1

👉 @bigdata_1
1
Unifying Visual Perception by Dispersible Points Learning

Conceptually simple, flexible, and universal visual perception head for variant visual task

Github: https://github.com/sense-x/unihead

Paper: https://arxiv.org/abs/2208.08630v1

Model: https://drive.google.com/file/d/1TwFCog_PMd1HWA7s-s9pN2F_fgyMyR3x/view

Datasets: https://paperswithcode.com/dataset/imagenet

👉 @bigdata_1
1
2
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks

Masked "language" modeling on images (Imglish), texts (English), and image-text pairs ("parallel sentences") in a unified manner.

Github: https://github.com/microsoft/unilm/tree/master/beit

Paper: https://arxiv.org/abs/2208.10442v1

Datasets: https://paperswithcode.com/dataset/visual-genome

👉 @bigdata_1
2