Data Science Jupyter Notebooks
8.7K subscribers
85 photos
24 videos
9 files
202 links
Explore the world of Data Science through Jupyter Notebooks—insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
加入频道
🔖 ImageBind: One Embedding Space To Bind Them All

📝 This project is a significant step forward in understanding and connecting information from diverse sources like images, text, audio, video, and even motion sensor data.

⚙️ Supports 6 Modalities:

📷 Image
📝 Text
🔈 Audi
🎥 Video
🦴 IMU sensor data (e.g., accelerometer)
🙄 Depth/Thermal & 3D data
Interestingly, only some modalities had labels, yet ImageBind learned to align them through self-supervised learning.


💫 Key Features:

..No need for paired data (e.g., images and audio don’t have to be aligned)..Leverages contrastive learning for learning joint embedding space
..Competes with CLIP and AudioCLIP, but with better accuracy and coverage..Enables zero-shot retrieval (e.g., finding relevant video using just a sentence)


📌 Repo: https://github.com/facebookresearch/ImageBind

🔍 By: https://yangx.top/DataScienceN 🌟

#ImageBind #MultimodalAI #MetaAI #DeepLearning #SelfSupervised
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3🔥2
This media is not supported in your browser
VIEW IN TELEGRAM
🐈‍⬛ TTT Long Video Generation 🐈‍⬛

▶️ A novel architecture for video generation, adapting the #CogVideoX 5B model by incorporating #TestTimeTraining (TTT) layers.
Adding TTT layers into a pre-trained Transformer enables generating a one-minute clip from text storyboards.
Videos, code & annotations released 💙

🔗 Review: https://t.ly/mhlTN
📄 Paper: arxiv.org/pdf/2504.05298
🌐 Project: test-time-training.github.io/video-dit
🧑‍💻 Repo: github.com/test-time-training/ttt-video-dit

#AI #VideoGeneration #MachineLearning #DeepLearning #Transformers #TTT #GenerativeAI

🔍 By: https://yangx.top/DataScienceN5
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3🥰2
🚀 New Tutorial: Automatic Number Plate Recognition (ANPR) with YOLOv11 + GPT-4o-mini!


This hands-on tutorial shows you how to combine the real-time detection power of YOLOv11 with the language understanding of GPT-4o-mini to build a smart, high-accuracy ANPR system! From setup to smart prompt engineering, everything is covered step-by-step. 🚗💡

🎯 Key Highlights:
YOLOv11 + GPT-4o-mini = High-precision number plate recognition
Real-time video processing in Google Colab
Smart prompt engineering for enhanced OCR performance

📢 A must-watch if you're into computer vision, deep learning, or OpenAI integrations!


🔗 Colab Notebook
▶️ Watch on YouTube


#YOLOv11 #GPT4o #OpenAI #ANPR #OCR #ComputerVision #DeepLearning #AI #DataScience #Python #Ultralytics #MachineLearning #Colab #NumberPlateRecognition

🔍 By : https://yangx.top/DataScienceN
👍21🔥1