⚙️ Supports 6 Modalities:
Interestingly, only some modalities had labels, yet ImageBind learned to align them through self-supervised learning.
..No need for paired data (e.g., images and audio don’t have to be aligned)..Leverages contrastive learning for learning joint embedding space
..Competes with CLIP and AudioCLIP, but with better accuracy and coverage..Enables zero-shot retrieval (e.g., finding relevant video using just a sentence)
#ImageBind #MultimodalAI #MetaAI #DeepLearning #SelfSupervised
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3🔥2
This media is not supported in your browser
VIEW IN TELEGRAM
Adding TTT layers into a pre-trained Transformer enables generating a one-minute clip from text storyboards.
Videos, code & annotations released
#AI #VideoGeneration #MachineLearning #DeepLearning #Transformers #TTT #GenerativeAI
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3🥰2
🚀 New Tutorial: Automatic Number Plate Recognition (ANPR) with YOLOv11 + GPT-4o-mini!
This hands-on tutorial shows you how to combine the real-time detection power of YOLOv11 with the language understanding of GPT-4o-mini to build a smart, high-accuracy ANPR system! From setup to smart prompt engineering, everything is covered step-by-step. 🚗💡
🎯 Key Highlights:
✅ YOLOv11 + GPT-4o-mini = High-precision number plate recognition
✅ Real-time video processing in Google Colab
✅ Smart prompt engineering for enhanced OCR performance
📢 A must-watch if you're into computer vision, deep learning, or OpenAI integrations!
🔗 Colab Notebook
▶️ Watch on YouTube
#YOLOv11 #GPT4o #OpenAI #ANPR #OCR #ComputerVision #DeepLearning #AI #DataScience #Python #Ultralytics #MachineLearning #Colab #NumberPlateRecognition
🔍 By : https://yangx.top/DataScienceN
This hands-on tutorial shows you how to combine the real-time detection power of YOLOv11 with the language understanding of GPT-4o-mini to build a smart, high-accuracy ANPR system! From setup to smart prompt engineering, everything is covered step-by-step. 🚗💡
🎯 Key Highlights:
✅ YOLOv11 + GPT-4o-mini = High-precision number plate recognition
✅ Real-time video processing in Google Colab
✅ Smart prompt engineering for enhanced OCR performance
📢 A must-watch if you're into computer vision, deep learning, or OpenAI integrations!
🔗 Colab Notebook
▶️ Watch on YouTube
#YOLOv11 #GPT4o #OpenAI #ANPR #OCR #ComputerVision #DeepLearning #AI #DataScience #Python #Ultralytics #MachineLearning #Colab #NumberPlateRecognition
🔍 By : https://yangx.top/DataScienceN
👍2❤1🔥1
𝑯𝒐𝒎𝒐𝒈𝒓𝒂𝒑𝒉𝒚 𝒂𝒏𝒅 𝑲𝒆𝒚𝒑𝒐𝒊𝒏𝒕 𝒇𝒐𝒓 𝑭𝒐𝒐𝒕𝒃𝒂𝒍𝒍 𝑨𝒏𝒂𝒍𝒚𝒕𝒊𝒄𝒔 ⚽️📐
🚀 Highlighting the latest strides in football field analysis using computer vision, this post shares a single frame from our video that demonstrates how homography and keypoint detection combine to produce precise minimap overlays. 🧠🎯
🧩 At the heart of this project lies the refinement of field keypoint extraction. Our experiments show a clear link between both the number and accuracy of detected keypoints and the overall quality of the minimap. 🗺️
📊 Enhanced keypoint precision leads to a more reliable homography transformation, resulting in a richer, more accurate tactical view. ⚙️⚡
🏆 For this work, we leveraged the championship-winning keypoint detection model from the SoccerNet Calibration Challenge:
📈 Implementing and evaluating this state‑of‑the‑art solution has deepened our appreciation for keypoint‑driven approaches in sports analytics. 📹📌
🔗 https://lnkd.in/em94QDFE
📡 By: https://yangx.top/DataScienceN
#ObjectDetection hashtag#DeepLearning hashtag#Detectron2 hashtag#ComputerVision hashtag#AI
hashtag#Football hashtag#SportsTech hashtag#MachineLearning hashtag#ComputerVision hashtag#AIinSports
hashtag#FutureOfFootball hashtag#SportsAnalytics
hashtag#TechInnovation hashtag#SportsAI hashtag#AIinFootball hashtag#AI hashtag#AIandSports hashtag#AIandSports
hashtag#FootballAnalytics hashtag#python hashtag#ai hashtag#yolo hashtag
🚀 Highlighting the latest strides in football field analysis using computer vision, this post shares a single frame from our video that demonstrates how homography and keypoint detection combine to produce precise minimap overlays. 🧠🎯
🧩 At the heart of this project lies the refinement of field keypoint extraction. Our experiments show a clear link between both the number and accuracy of detected keypoints and the overall quality of the minimap. 🗺️
📊 Enhanced keypoint precision leads to a more reliable homography transformation, resulting in a richer, more accurate tactical view. ⚙️⚡
🏆 For this work, we leveraged the championship-winning keypoint detection model from the SoccerNet Calibration Challenge:
📈 Implementing and evaluating this state‑of‑the‑art solution has deepened our appreciation for keypoint‑driven approaches in sports analytics. 📹📌
🔗 https://lnkd.in/em94QDFE
📡 By: https://yangx.top/DataScienceN
#ObjectDetection hashtag#DeepLearning hashtag#Detectron2 hashtag#ComputerVision hashtag#AI
hashtag#Football hashtag#SportsTech hashtag#MachineLearning hashtag#ComputerVision hashtag#AIinSports
hashtag#FutureOfFootball hashtag#SportsAnalytics
hashtag#TechInnovation hashtag#SportsAI hashtag#AIinFootball hashtag#AI hashtag#AIandSports hashtag#AIandSports
hashtag#FootballAnalytics hashtag#python hashtag#ai hashtag#yolo hashtag
lnkd.in
LinkedIn
This link will take you to a page that’s not on LinkedIn
👍4❤1🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
Introducing CoMotion, a project that detects and tracks detailed 3D poses of multiple people using a single monocular camera stream. This system maintains temporally coherent predictions in crowded scenes filled with difficult poses and occlusions, enabling online tracking through frames with high accuracy.
🔍 Key Features:
- Precise detection and tracking in crowded scenes
- Temporal coherence even with occlusions
- High accuracy in tracking multiple people over time
This project advances 3D human motion tracking by offering faster and more accurate tracking of multiple individuals compared to existing systems.
#AI #DeepLearning #3DTracking #ComputerVision #PoseEstimation
Please open Telegram to view this post
VIEW IN TELEGRAM
👍2🔥1
🎯 Trackers Library is Officially Released! 🚀
If you're working in computer vision and object tracking, this one's for you!
💡 Trackers is a powerful open-source library with support for a wide range of detection models and tracking algorithms:
✅ Plug-and-play compatibility with detection models from:
Roboflow Inference, Hugging Face Transformers, Ultralytics, MMDetection, and more!
✅ Tracking algorithms supported:
SORT, DeepSORT, and advanced trackers like StrongSORT, BoT‑SORT, ByteTrack, OC‑SORT – with even more coming soon!
🧩 Released under the permissive Apache 2.0 license – free for everyone to use and contribute.
👏 Huge thanks to Piotr Skalski for co-developing this library, and to Raif Olson and Onuralp SEZER for their outstanding contributions!
📌 Links:
🔗 GitHub
🔗 Docs
📚 Quick-start notebooks for SORT and DeepSORT are linked 👇🏻
https://www.linkedin.com/posts/skalskip92_trackers-library-is-out-plugandplay-activity-7321128111503253504-3U6-?utm_source=share&utm_medium=member_desktop&rcm=ACoAAEXwhVcBcv2n3wq8JzEai3TfWmKLRLTefYo
#ComputerVision #ObjectTracking #OpenSource #DeepLearning #AI
📡 By: https://yangx.top/DataScienceN
If you're working in computer vision and object tracking, this one's for you!
💡 Trackers is a powerful open-source library with support for a wide range of detection models and tracking algorithms:
✅ Plug-and-play compatibility with detection models from:
Roboflow Inference, Hugging Face Transformers, Ultralytics, MMDetection, and more!
✅ Tracking algorithms supported:
SORT, DeepSORT, and advanced trackers like StrongSORT, BoT‑SORT, ByteTrack, OC‑SORT – with even more coming soon!
🧩 Released under the permissive Apache 2.0 license – free for everyone to use and contribute.
👏 Huge thanks to Piotr Skalski for co-developing this library, and to Raif Olson and Onuralp SEZER for their outstanding contributions!
📌 Links:
🔗 GitHub
🔗 Docs
📚 Quick-start notebooks for SORT and DeepSORT are linked 👇🏻
https://www.linkedin.com/posts/skalskip92_trackers-library-is-out-plugandplay-activity-7321128111503253504-3U6-?utm_source=share&utm_medium=member_desktop&rcm=ACoAAEXwhVcBcv2n3wq8JzEai3TfWmKLRLTefYo
#ComputerVision #ObjectTracking #OpenSource #DeepLearning #AI
📡 By: https://yangx.top/DataScienceN
Linkedin
Trackers Library is Out! | Piotr Skalski
Trackers Library is Out! 🔥 🔥 🔥
- Plug‑and‑play integration with detectors from Transformers, Inference, Ultralytics, PaddlePaddle, MMDetection, and more.
- Built‑in support for SORT and DeepSORT today, with StrongSORT, BoT‑SORT, ByteTrack, OC‑SORT, and…
- Plug‑and‑play integration with detectors from Transformers, Inference, Ultralytics, PaddlePaddle, MMDetection, and more.
- Built‑in support for SORT and DeepSORT today, with StrongSORT, BoT‑SORT, ByteTrack, OC‑SORT, and…
👍4❤1🔥1
🚀 The new HQ-SAM (High-Quality Segment Anything Model) has just been added to the Hugging Face Transformers library!
This is an enhanced version of the original SAM (Segment Anything Model) introduced by Meta in 2023. HQ-SAM significantly improves the segmentation of fine and detailed objects, while preserving all the powerful features of SAM — including prompt-based interaction, fast inference, and strong zero-shot performance. That means you can easily switch to HQ-SAM wherever you used SAM!
The improvements come from just a few additional learnable parameters. The authors collected a high-quality dataset with 44,000 fine-grained masks from various sources, and impressively trained the model in just 4 hours using 8 GPUs — all while keeping the core SAM weights frozen.
The newly introduced parameters include:
* A High-Quality Token
* A Global-Local Feature Fusion mechanism
This work was presented at NeurIPS 2023 and still holds state-of-the-art performance in zero-shot segmentation on the SGinW benchmark.
📄 Documentation: https://lnkd.in/e5iDT6Tf
🧠 Model Access: https://lnkd.in/ehS6ZUyv
💻 Source Code: https://lnkd.in/eg5qiKC2
#ArtificialIntelligence #ComputerVision #Transformers #Segmentation #DeepLearning #PretrainedModels #ResearchAndDevelopment #AdvancedModels #ImageAnalysis #HQ_SAM #SegmentAnything #SAMmodel #ZeroShotSegmentation #NeurIPS2023 #AIresearch #FoundationModels #OpenSourceAI #SOTA
🌟https://yangx.top/DataScienceN
This is an enhanced version of the original SAM (Segment Anything Model) introduced by Meta in 2023. HQ-SAM significantly improves the segmentation of fine and detailed objects, while preserving all the powerful features of SAM — including prompt-based interaction, fast inference, and strong zero-shot performance. That means you can easily switch to HQ-SAM wherever you used SAM!
The improvements come from just a few additional learnable parameters. The authors collected a high-quality dataset with 44,000 fine-grained masks from various sources, and impressively trained the model in just 4 hours using 8 GPUs — all while keeping the core SAM weights frozen.
The newly introduced parameters include:
* A High-Quality Token
* A Global-Local Feature Fusion mechanism
This work was presented at NeurIPS 2023 and still holds state-of-the-art performance in zero-shot segmentation on the SGinW benchmark.
📄 Documentation: https://lnkd.in/e5iDT6Tf
🧠 Model Access: https://lnkd.in/ehS6ZUyv
💻 Source Code: https://lnkd.in/eg5qiKC2
#ArtificialIntelligence #ComputerVision #Transformers #Segmentation #DeepLearning #PretrainedModels #ResearchAndDevelopment #AdvancedModels #ImageAnalysis #HQ_SAM #SegmentAnything #SAMmodel #ZeroShotSegmentation #NeurIPS2023 #AIresearch #FoundationModels #OpenSourceAI #SOTA
🌟https://yangx.top/DataScienceN
lnkd.in
LinkedIn
This link will take you to a page that’s not on LinkedIn
❤2👍2🔥1
🔥Powerful Combo: Ultralytics YOLO11 + Sony Semicon | AITRIOS (Global) Platform + Raspberry Pi
We’ve recently updated our Sony IMX model export to fully support YOLO11n detection models! This means you can now seamlessly run YOLO11n models directly on Raspberry Pi AI Cameras powered by the Sony IMX500 sensor — making it even easier to develop advanced Edge AI applications. 💡
To test this new export workflow, I trained a model on the VisDrone dataset and exported it using the following command:
👉
🔍Benchmark results for YOLO11n on IMX500:✅ Inference Time: 62.50 ms✅ mAP50-95 (B): 0.644📌 Want to learn more about YOLO11 and Sony IMX500? Check it out here ➡️
https://docs.ultralytics.com/integrations/sony-imx500/
#EdgeAI#YOLO11#SonyIMX500#AITRIOS#ObjectDetection#RaspberryPiAI#ComputerVision#DeepLearning#OnDeviceAI#ModelDeployment
🌟https://yangx.top/DataScienceN
We’ve recently updated our Sony IMX model export to fully support YOLO11n detection models! This means you can now seamlessly run YOLO11n models directly on Raspberry Pi AI Cameras powered by the Sony IMX500 sensor — making it even easier to develop advanced Edge AI applications. 💡
To test this new export workflow, I trained a model on the VisDrone dataset and exported it using the following command:
👉
yolo export model=<path_to_drone_model> format=imx data=VisDrone.yaml🎥 The video below shows the result of this process!
🔍Benchmark results for YOLO11n on IMX500:✅ Inference Time: 62.50 ms✅ mAP50-95 (B): 0.644📌 Want to learn more about YOLO11 and Sony IMX500? Check it out here ➡️
https://docs.ultralytics.com/integrations/sony-imx500/
#EdgeAI#YOLO11#SonyIMX500#AITRIOS#ObjectDetection#RaspberryPiAI#ComputerVision#DeepLearning#OnDeviceAI#ModelDeployment
🌟https://yangx.top/DataScienceN
Ultralytics
SONY IMX500
Learn to export Ultralytics YOLO11 models to Sony's IMX500 format for efficient edge AI deployment on Raspberry Pi AI Camera with on-chip processing.
👍1🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
NVIDIA introduces GENMO, a unified generalist model for human motion that seamlessly combines motion estimation and generation within a single framework. GENMO supports conditioning on videos, 2D keypoints, text, music, and 3D keyframes, enabling highly versatile motion understanding and synthesis.
Currently, no official code release is available.
Review:
https://t.ly/Q5T_Y
Paper:
https://lnkd.in/ds36BY49
Project Page:
https://lnkd.in/dAYHhuFU
#NVIDIA #GENMO #HumanMotion #DeepLearning #AI #ComputerVision #MotionGeneration #MachineLearning #MultimodalAI #3DReconstruction
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3
Forwarded from Python | Machine Learning | Coding | R
LLM Interview Questions.pdf
71.2 KB
Top 50 LLM Interview Questions!
#LLM #AIInterviews #MachineLearning #DeepLearning #NLP #LLMInterviewPrep #ModelArchitectures #AITheory #TechInterviews #MLBasics #InterviewQuestions #LargeLanguageModels
✉️ Our Telegram channels: https://yangx.top/addlist/0f6vfFbEMdAwODBk📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
❤1
Forwarded from Python | Machine Learning | Coding | R
10 GitHub repos to build a career in AI engineering:
(100% free step-by-step roadmap)
1️⃣ ML for Beginners by Microsoft
A 12-week project-based curriculum that teaches classical ML using Scikit-learn on real-world datasets.
Includes quizzes, lessons, and hands-on projects, with some videos.
GitHub repo → https://lnkd.in/dCxStbYv
2️⃣ AI for Beginners by Microsoft
This repo covers neural networks, NLP, CV, transformers, ethics & more. There are hands-on labs in PyTorch & TensorFlow using Jupyter.
Beginner-friendly, project-based, and full of real-world apps.
GitHub repo → https://lnkd.in/dwS5Jk9E
3️⃣ Neural Networks: Zero to Hero
Now that you’ve grasped the foundations of AI/ML, it’s time to dive deeper.
This repo by Andrej Karpathy builds modern deep learning systems from scratch, including GPTs.
GitHub repo → https://lnkd.in/dXAQWucq
4️⃣ DL Paper Implementations
So far, you have learned the fundamentals of AI, ML, and DL. Now study how the best architectures work.
This repo covers well-documented PyTorch implementations of 60+ research papers on Transformers, GANs, Diffusion models, etc.
GitHub repo → https://lnkd.in/dTrtDrvs
5️⃣ Made With ML
Now it’s time to learn how to go from notebooks to production.
Made With ML teaches you how to design, develop, deploy, and iterate on real-world ML systems using MLOps, CI/CD, and best practices.
GitHub repo → https://lnkd.in/dYyjjBGb
6️⃣ Hands-on LLMs
- You've built neural nets.
- You've explored GPTs and LLMs.
Now apply them. This is a visually rich repo that covers everything about LLMs, like tokenization, fine-tuning, RAG, etc.
GitHub repo → https://lnkd.in/dh2FwYFe
7️⃣ Advanced RAG Techniques
Hands-on LLMs will give you a good grasp of RAG systems. Now learn advanced RAG techniques.
This repo covers 30+ methods to make RAG systems faster, smarter, and accurate, like HyDE, GraphRAG, etc.
GitHub repo → https://lnkd.in/dBKxtX-D
8️⃣ AI Agents for Beginners by Microsoft
After diving into LLMs and mastering RAG, learn how to build AI agents.
This hands-on course covers building AI agents using frameworks like AutoGen.
GitHub repo → https://lnkd.in/dbFeuznE
9️⃣ Agents Towards Production
The above course will teach what AI agents are. Next, learn how to ship them.
This is a practical playbook for building agents covering memory, orchestration, deployment, security & more.
GitHub repo → https://lnkd.in/dcwmamSb
🔟 AI Engg. Hub
To truly master LLMs, RAG, and AI agents, you need projects.
This covers 70+ real-world examples, tutorials, and agent app you can build, adapt, and ship.
GitHub repo → https://lnkd.in/geMYm3b6
(100% free step-by-step roadmap)
A 12-week project-based curriculum that teaches classical ML using Scikit-learn on real-world datasets.
Includes quizzes, lessons, and hands-on projects, with some videos.
GitHub repo → https://lnkd.in/dCxStbYv
This repo covers neural networks, NLP, CV, transformers, ethics & more. There are hands-on labs in PyTorch & TensorFlow using Jupyter.
Beginner-friendly, project-based, and full of real-world apps.
GitHub repo → https://lnkd.in/dwS5Jk9E
Now that you’ve grasped the foundations of AI/ML, it’s time to dive deeper.
This repo by Andrej Karpathy builds modern deep learning systems from scratch, including GPTs.
GitHub repo → https://lnkd.in/dXAQWucq
So far, you have learned the fundamentals of AI, ML, and DL. Now study how the best architectures work.
This repo covers well-documented PyTorch implementations of 60+ research papers on Transformers, GANs, Diffusion models, etc.
GitHub repo → https://lnkd.in/dTrtDrvs
Now it’s time to learn how to go from notebooks to production.
Made With ML teaches you how to design, develop, deploy, and iterate on real-world ML systems using MLOps, CI/CD, and best practices.
GitHub repo → https://lnkd.in/dYyjjBGb
- You've built neural nets.
- You've explored GPTs and LLMs.
Now apply them. This is a visually rich repo that covers everything about LLMs, like tokenization, fine-tuning, RAG, etc.
GitHub repo → https://lnkd.in/dh2FwYFe
Hands-on LLMs will give you a good grasp of RAG systems. Now learn advanced RAG techniques.
This repo covers 30+ methods to make RAG systems faster, smarter, and accurate, like HyDE, GraphRAG, etc.
GitHub repo → https://lnkd.in/dBKxtX-D
After diving into LLMs and mastering RAG, learn how to build AI agents.
This hands-on course covers building AI agents using frameworks like AutoGen.
GitHub repo → https://lnkd.in/dbFeuznE
The above course will teach what AI agents are. Next, learn how to ship them.
This is a practical playbook for building agents covering memory, orchestration, deployment, security & more.
GitHub repo → https://lnkd.in/dcwmamSb
To truly master LLMs, RAG, and AI agents, you need projects.
This covers 70+ real-world examples, tutorials, and agent app you can build, adapt, and ship.
GitHub repo → https://lnkd.in/geMYm3b6
#AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers
✉️ Our Telegram channels: https://yangx.top/addlist/0f6vfFbEMdAwODBk📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
❤3