Data Science Jupyter Notebooks
8.69K subscribers
85 photos
24 videos
9 files
202 links
Explore the world of Data Science through Jupyter Notebooks—insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
加入频道
🚀 FREE IT Study Kits for 2025 — Grab Yours Now!

Just found these zero-cost resources from SPOTO👇
Perfect if you're prepping for #Cisco, #AWS, #PMP, #AI, #Python, #Excel, or #Cybersecurity!
100% Free
No signup traps
Instantly downloadable

📘 IT Certs E-book: https://bit.ly/4fJSoLP
☁️ Cloud & AI Kits: https://bit.ly/3F3lc5B
📊 Cybersecurity, Python & Excel: https://bit.ly/4mFrA4g
🧠 Skill Test (Free!): https://bit.ly/3PoKH39
Tag a friend & level up together 💪

🌐 Join the IT Study Group: https://chat.whatsapp.com/E3Vkxa19HPO9ZVkWslBO8s
📲 1-on-1 Exam Help: https://wa.link/k0vy3x
👑Last 24 HOURS to grab Mid-Year Mega Sale prices!Don’t miss Lucky Draw👇
https://bit.ly/43VgcbT
10 GitHub repos to build a career in AI engineering:

(100% free step-by-step roadmap)

1️⃣ ML for Beginners by Microsoft

A 12-week project-based curriculum that teaches classical ML using Scikit-learn on real-world datasets.

Includes quizzes, lessons, and hands-on projects, with some videos.

GitHub repo → https://lnkd.in/dCxStbYv

2️⃣ AI for Beginners by Microsoft

This repo covers neural networks, NLP, CV, transformers, ethics & more. There are hands-on labs in PyTorch & TensorFlow using Jupyter.

Beginner-friendly, project-based, and full of real-world apps.

GitHub repo → https://lnkd.in/dwS5Jk9E

3️⃣ Neural Networks: Zero to Hero

Now that you’ve grasped the foundations of AI/ML, it’s time to dive deeper.

This repo by Andrej Karpathy builds modern deep learning systems from scratch, including GPTs.

GitHub repo → https://lnkd.in/dXAQWucq

4️⃣ DL Paper Implementations

So far, you have learned the fundamentals of AI, ML, and DL. Now study how the best architectures work.

This repo covers well-documented PyTorch implementations of 60+ research papers on Transformers, GANs, Diffusion models, etc.

GitHub repo → https://lnkd.in/dTrtDrvs

5️⃣ Made With ML

Now it’s time to learn how to go from notebooks to production.

Made With ML teaches you how to design, develop, deploy, and iterate on real-world ML systems using MLOps, CI/CD, and best practices.

GitHub repo → https://lnkd.in/dYyjjBGb

6️⃣ Hands-on LLMs

- You've built neural nets.
- You've explored GPTs and LLMs.

Now apply them. This is a visually rich repo that covers everything about LLMs, like tokenization, fine-tuning, RAG, etc.

GitHub repo → https://lnkd.in/dh2FwYFe

7️⃣ Advanced RAG Techniques

Hands-on LLMs will give you a good grasp of RAG systems. Now learn advanced RAG techniques.

This repo covers 30+ methods to make RAG systems faster, smarter, and accurate, like HyDE, GraphRAG, etc.

GitHub repo → https://lnkd.in/dBKxtX-D

8️⃣ AI Agents for Beginners by Microsoft

After diving into LLMs and mastering RAG, learn how to build AI agents.

This hands-on course covers building AI agents using frameworks like AutoGen.

GitHub repo → https://lnkd.in/dbFeuznE

9️⃣ Agents Towards Production

The above course will teach what AI agents are. Next, learn how to ship them.

This is a practical playbook for building agents covering memory, orchestration, deployment, security & more.

GitHub repo → https://lnkd.in/dcwmamSb

🔟 AI Engg. Hub

To truly master LLMs, RAG, and AI agents, you need projects.

This covers 70+ real-world examples, tutorials, and agent app you can build, adapt, and ship.

GitHub repo → https://lnkd.in/geMYm3b6

#AIEngineering #MachineLearning #DeepLearning #LLMs #RAG #MLOps #Python #GitHubProjects #AIForBeginners #ArtificialIntelligence #NeuralNetworks #OpenSourceAI #DataScienceCareers


✉️ Our Telegram channels: https://yangx.top/addlist/0f6vfFbEMdAwODBk

📱 Our WhatsApp channel: https://whatsapp.com/channel/0029VaC7Weq29753hpcggW2A
Please open Telegram to view this post
VIEW IN TELEGRAM
3
NUMPY FOR DS.pdf
4.5 MB
Let's start at the top...

NumPy contains a broad array of functionality for fast numerical & mathematical operations in Python

The core data-structure within #NumPy is an ndArray (or n-dimensional array)

Behind the scenes - much of the NumPy functionality is written in the programming language C

NumPy functionality is used in other popular #Python packages including #Pandas, #Matplotlib, & #scikitlearn!

✉️ Our Telegram channels: https://yangx.top/addlist/0f6vfFbEMdAwODBk
Please open Telegram to view this post
VIEW IN TELEGRAM
Topic: Python Script to Convert a Shared ChatGPT Link to PDF – Step-by-Step Guide

---

### Objective

In this lesson, we’ll build a Python script that:

• Takes a ChatGPT share link (e.g., https://chat.openai.com/share/abc123)
• Downloads the HTML content of the chat
• Converts it to a PDF file using pdfkit and wkhtmltopdf

This is useful for archiving, sharing, or printing ChatGPT conversations in a clean format.

---

### 1. Prerequisites

Before starting, you need the following libraries and tools:

#### • Install pdfkit and requests

pip install pdfkit requests


#### • Install wkhtmltopdf

Download from:
[https://wkhtmltopdf.org/downloads.html](https://wkhtmltopdf.org/downloads.html)

Make sure to add the path of the installed binary to your system PATH.

---

### 2. Python Script: Convert Shared ChatGPT URL to PDF

import pdfkit
import requests
import os

# Define output filename
output_file = "chatgpt_conversation.pdf"

# ChatGPT shared URL (user input)
chat_url = input("Enter the ChatGPT share URL: ").strip()

# Verify the URL format
if not chat_url.startswith("https://chat.openai.com/share/"):
print("Invalid URL. Must start with https://chat.openai.com/share/")
exit()

try:
# Download HTML content
response = requests.get(chat_url)
if response.status_code != 200:
raise Exception(f"Failed to load the chat: {response.status_code}")

html_content = response.text

# Save HTML to temporary file
with open("temp_chat.html", "w", encoding="utf-8") as f:
f.write(html_content)

# Convert HTML to PDF
pdfkit.from_file("temp_chat.html", output_file)

print(f"\n PDF saved as: {output_file}")

# Optional: remove temp file
os.remove("temp_chat.html")

except Exception as e:
print(f" Error: {e}")


---

### 3. Notes

• This approach works only if the shared page is publicly accessible (which ChatGPT share links are).
• The PDF output will contain the web page version, including theme and layout.
• You can customize the PDF output using pdfkit options (like page size, margins, etc.).

---

### 4. Optional Enhancements

• Add GUI with Tkinter
• Accept multiple URLs
• Add PDF metadata (title, author, etc.)
• Add support for offline rendering using BeautifulSoup to clean content

---

### Exercise

• Try converting multiple ChatGPT share links to PDF
• Customize the styling with your own CSS
• Add a timestamp or watermark to the PDF

---

#Python #ChatGPT #PDF #WebScraping #Automation #pdfkit #tkinter

https://yangx.top/CodeProgrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
8
📚 JaidedAI/EasyOCR — an open-source Python library for Optical Character Recognition (OCR) that's easy to use and supports over 80 languages out of the box.

### 🔍 Key Features:

🔸 Extracts text from images and scanned documents — including handwritten notes and unusual fonts
🔸 Supports a wide range of languages like English, Russian, Chinese, Arabic, and more
🔸 Built on PyTorch — uses modern deep learning models (not the old-school Tesseract)
🔸 Simple to integrate into your Python projects

### Example Usage:

import easyocr

reader = easyocr.Reader(['en', 'ru']) # Choose supported languages
result = reader.readtext('image.png')


### 📌 Ideal For:

Text extraction from photos, scans, and documents
Embedding OCR capabilities in apps (e.g. automated data entry)

🔗 GitHub: https://github.com/JaidedAI/EasyOCR

👉 Follow us for more: @DataScienceN

#Python #OCR #MachineLearning #ComputerVision #EasyOCR
2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🧹 ObjectClear — an AI-powered tool for removing objects from images effortlessly.

⚙️ What It Can Do:

🖼️ Upload any image
🎯 Select the object you want to remove
🌟 The model automatically erases the object and intelligently reconstructs the background

⚡️ Under the Hood:

— Uses Segment Anything (SAM) by Meta for object segmentation
— Leverages Inpaint-Anything for realistic background generation
— Works in your browser with an intuitive Gradio UI

✔️ Fully open-source and can be run locally.

📎 GitHub: https://github.com/zjx0101/ObjectClear

#AI #ImageEditing #ComputerVision #Gradio #OpenSource #Python
Please open Telegram to view this post
VIEW IN TELEGRAM
2🔥1
python-docx: Create and Modify Word Documents #python

python-docx is a Python library for reading, creating, and updating Microsoft Word 2007+ (.docx) files.

Installation
pip install python-docx

Example
from docx import Document

document = Document()
document.add_paragraph("It was a dark and stormy night.")
<docx.text.paragraph.Paragraph object at 0x10f19e760>
document.save("dark-and-stormy.docx")

document = Document("dark-and-stormy.docx")
document.paragraphs[0].text
'It was a dark and stormy night.'

https://yangx.top/DataScienceN 🚗
Please open Telegram to view this post
VIEW IN TELEGRAM
2👍2
🔥 Trending Repository: awesome-machine-learning-interpretability

📝 Description: A curated list of awesome responsible machine learning resources.

🔗 Repository URL: https://github.com/jphall663/awesome-machine-learning-interpretability

📖 Readme: https://github.com/jphall663/awesome-machine-learning-interpretability#readme

📊 Statistics:
🌟 Stars: 3.8K stars
👀 Watchers: 132
🍴 Forks: 608 forks

💻 Programming Languages: Not available

🏷️ Related Topics:
#python #data_science #machine_learning #awesome #r #awesome_list #transparency #fairness #ai_safety #privacy_enhancing_technologies #interpretability #interpretable_ai #interpretable_ml #explainable_ml #xai #interpretable_machine_learning #privacy_preserving_machine_learning #machine_learning_interpretability #secure_ml #reliable_ai


==================================
🧠 By: https://yangx.top/DataScienceN
🔥 Trending Repository: awesome-learning

📝 Description: A curated lists of awesome learning resources for a Software Test Automation Engineer

🔗 Repository URL: https://github.com/mfaisalkhatri/awesome-learning

📖 Readme: https://github.com/mfaisalkhatri/awesome-learning#readme

📊 Statistics:
🌟 Stars: 1.1K stars
👀 Watchers: 36
🍴 Forks: 198 forks

💻 Programming Languages: Not available

🏷️ Related Topics:
#github #javascript #python #docker #programming_language #learning #devops #awesome #learning_path #tdd #continuous_integration #continuous_delivery #unicorns #test_automation #awesome_list #cicd #testautomation #software_testing #hacktoberfest


==================================
🧠 By: https://yangx.top/DataScienceN
🔥 Trending Repository: coding-problems

📝 Description: Solutions for various coding/algorithmic problems and many useful resources for learning algorithms and data structures

🔗 Repository URL: https://github.com/MTrajK/coding-problems

📖 Readme: https://github.com/MTrajK/coding-problems#readme

📊 Statistics:
🌟 Stars: 3.3K stars
👀 Watchers: 85
🍴 Forks: 625 forks

💻 Programming Languages: Python

🏷️ Related Topics:
#python #education #algorithms #leetcode #interview #data_structures #learn #problem_solving #coding_problem


==================================
🧠 By: https://yangx.top/DataScienceN
🔥 Trending Repository: DevOps-Roadmap

📝 Description: DevOps Roadmap for 2025. with learning resources

🔗 Repository URL: https://github.com/milanm/DevOps-Roadmap

🌐 Website: https://newsletter.techworld-with-milan.com/

📖 Readme: https://github.com/milanm/DevOps-Roadmap#readme

📊 Statistics:
🌟 Stars: 16.7K stars
👀 Watchers: 258
🍴 Forks: 2.8K forks

💻 Programming Languages: Not available

🏷️ Related Topics:
#python #go #linux #docker #kubernetes #computer_science #aws #devops #roadmap #jira #continuous_integration #azure #grafana #prometheus #sre #study_plan #continous_delivery #developer_roadmap #devops_roadmap


==================================
🧠 By: https://yangx.top/DataScienceN