Data Science Jupyter Notebooks – Telegram
Data Science Jupyter Notebooks
11.7K subscribers
289 photos
43 videos
9 files
847 links
Explore the world of Data Science through Jupyter Notebooks—insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
Download Telegram
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

element = WebDriverWait(driver, 10).until(
EC.presence_of_element_located((By.ID, "myDynamicElement"))
)

• Get the page source after JavaScript has executed.
dynamic_html = driver.page_source

• Close the browser window.
driver.quit()


VII. Common Tasks & Best Practices

• Handle pagination by finding the "Next" link.
next_page_url = soup.find('a', text='Next')['href']

• Save data to a CSV file.
import csv
with open('data.csv', 'w', newline='', encoding='utf-8') as f:
writer = csv.writer(f)
writer.writerow(['Title', 'Link'])
# writer.writerow([noscript, url]) in a loop

• Save data to CSV using pandas.
import pandas as pd
df = pd.DataFrame(data, columns=['Title', 'Link'])
df.to_csv('data.csv', index=False)

• Use a proxy with requests.
proxies = {'http': 'http://10.10.1.10:3128', 'https': 'http://10.10.1.10:1080'}
requests.get('http://example.com', proxies=proxies)

• Pause between requests to be polite.
import time
time.sleep(2) # Pause for 2 seconds

• Handle JSON data from an API.
json_response = requests.get('https://api.example.com/data').json()

• Download a file (like an image).
img_url = 'http://example.com/image.jpg'
img_data = requests.get(img_url).content
with open('image.jpg', 'wb') as handler:
handler.write(img_data)

• Parse a sitemap.xml to find all URLs.
# Get the sitemap.xml file and parse it like any other XML/HTML to extract <loc> tags.


VIII. Advanced Frameworks (Scrapy)

• Create a Scrapy spider (conceptual command).
scrapy genspider example example.com

• Define a parse method to process the response.
# In your spider class:
def parse(self, response):
# parsing logic here
pass

• Extract data using Scrapy's CSS selectors.
noscripts = response.css('h1::text').getall()

• Extract data using Scrapy's XPath selectors.
links = response.xpath('//a/@href').getall()

• Yield a dictionary of scraped data.
yield {'noscript': response.css('noscript::text').get()}

• Follow a link to parse the next page.
next_page = response.css('li.next a::attr(href)').get()
if next_page is not None:
yield response.follow(next_page, callback=self.parse)

• Run a spider from the command line.
scrapy crawl example -o output.json

• Pass arguments to a spider.
scrapy crawl example -a category=books

• Create a Scrapy Item for structured data.
import scrapy
class ProductItem(scrapy.Item):
name = scrapy.Field()
price = scrapy.Field()

• Use an Item Loader to populate Items.
from scrapy.loader import ItemLoader
loader = ItemLoader(item=ProductItem(), response=response)
loader.add_css('name', 'h1.product-name::text')


#Python #WebScraping #BeautifulSoup #Selenium #Requests

━━━━━━━━━━━━━━━
By: @DataScienceN
3
🔥 Trending Repository: nocobase

📝 Denoscription: NocoBase is the most extensible AI-powered no-code/low-code platform for building business applications and enterprise solutions.

🔗 Repository URL: https://github.com/nocobase/nocobase

🌐 Website: https://www.nocobase.com

📖 Readme: https://github.com/nocobase/nocobase#readme

📊 Statistics:
🌟 Stars: 17.7K stars
👀 Watchers: 147
🍴 Forks: 2K forks

💻 Programming Languages: TypeScript - JavaScript - Smarty - Shell - Dockerfile - Less

🏷️ Related Topics:
#internal_tools #crud #crm #admin_dashboard #self_hosted #web_application #project_management #salesforce #developer_tools #airtable #workflows #low_code #no_code #app_builder #internal_tool #nocode #low_code_development_platform #no_code_platform #low_code_platform #low_code_framework


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: alertmanager

📝 Denoscription: Prometheus Alertmanager

🔗 Repository URL: https://github.com/prometheus/alertmanager

🌐 Website: https://prometheus.io

📖 Readme: https://github.com/prometheus/alertmanager#readme

📊 Statistics:
🌟 Stars: 7.3K stars
👀 Watchers: 166
🍴 Forks: 2.3K forks

💻 Programming Languages: Go - Elm - HTML - Makefile - TypeScript - JavaScript

🏷️ Related Topics:
#notifications #slack #monitoring #email #pagerduty #alertmanager #hacktoberfest #deduplication #opsgenie


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: gopeed

📝 Denoscription: A modern download manager that supports all platforms. Built with Golang and Flutter.

🔗 Repository URL: https://github.com/GopeedLab/gopeed

🌐 Website: https://gopeed.com

📖 Readme: https://github.com/GopeedLab/gopeed#readme

📊 Statistics:
🌟 Stars: 21K stars
👀 Watchers: 167
🍴 Forks: 1.5K forks

💻 Programming Languages: Dart - Go - C++ - CMake - Swift - Ruby

🏷️ Related Topics:
#android #windows #macos #golang #http #ios #torrent #downloader #debian #bittorrent #cross_platform #ubuntu #https #flutter #magnet


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: vertex-ai-creative-studio

📝 Denoscription: GenMedia Creative Studio is a Vertex AI generative media user experience highlighting the use of Imagen, Veo, Gemini 🍌, Gemini TTS, Chirp 3, Lyria and other generative media APIs on Google Cloud.

🔗 Repository URL: https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio

📖 Readme: https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio#readme

📊 Statistics:
🌟 Stars: 512 stars
👀 Watchers: 19
🍴 Forks: 200 forks

💻 Programming Languages: Jupyter Notebook - Python - TypeScript - Go - JavaScript - Shell

🏷️ Related Topics:
#google_cloud #gemini #chirp #imagen #veo #lyria #vertex_ai #nano_banana


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: Parabolic

📝 Denoscription: Download web video and audio

🔗 Repository URL: https://github.com/NickvisionApps/Parabolic

🌐 Website: https://flathub.org/apps/details/org.nickvision.tubeconverter

📖 Readme: https://github.com/NickvisionApps/Parabolic#readme

📊 Statistics:
🌟 Stars: 4.1K stars
👀 Watchers: 28
🍴 Forks: 188 forks

💻 Programming Languages: C++ - CMake - Python - Inno Setup - C - CSS

🏷️ Related Topics:
#music #windows #downloader #youtube #qt #cpp #youtube_dl #gnome #videos #flathub #gtk4 #yt_dlp #libadwaita


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: localstack

📝 Denoscription: 💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

🔗 Repository URL: https://github.com/localstack/localstack

🌐 Website: https://localstack.cloud

📖 Readme: https://github.com/localstack/localstack#readme

📊 Statistics:
🌟 Stars: 61.1K stars
👀 Watchers: 514
🍴 Forks: 4.3K forks

💻 Programming Languages: Python - Shell - Makefile - ANTLR - JavaScript - Java

🏷️ Related Topics:
#python #testing #aws #cloud #continuous_integration #developer_tools #localstack


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: go-sdk

📝 Denoscription: The official Go SDK for Model Context Protocol servers and clients. Maintained in collaboration with Google.

🔗 Repository URL: https://github.com/modelcontextprotocol/go-sdk

📖 Readme: https://github.com/modelcontextprotocol/go-sdk#readme

📊 Statistics:
🌟 Stars: 2.7K stars
👀 Watchers: 39
🍴 Forks: 249 forks

💻 Programming Languages: Go

🏷️ Related Topics: Not available

==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: rachoon

📝 Denoscription: 🦝 Rachoon — A self-hostable way to handle invoices

🔗 Repository URL: https://github.com/ad-on-is/rachoon

📖 Readme: https://github.com/ad-on-is/rachoon#readme

📊 Statistics:
🌟 Stars: 292 stars
👀 Watchers: 4
🍴 Forks: 14 forks

💻 Programming Languages: TypeScript - Vue - HTML - SCSS - Dockerfile - JavaScript - Shell

🏷️ Related Topics: Not available

==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: Kotatsu

📝 Denoscription: Manga reader for Android

🔗 Repository URL: https://github.com/KotatsuApp/Kotatsu

🌐 Website: https://kotatsu.app

📖 Readme: https://github.com/KotatsuApp/Kotatsu#readme

📊 Statistics:
🌟 Stars: 7.2K stars
👀 Watchers: 72
🍴 Forks: 366 forks

💻 Programming Languages: Kotlin

🏷️ Related Topics:
#android #manga #comics #mangareader #manga_reader #webtoon


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: ggml

📝 Denoscription: Tensor library for machine learning

🔗 Repository URL: https://github.com/ggml-org/ggml

📖 Readme: https://github.com/ggml-org/ggml#readme

📊 Statistics:
🌟 Stars: 13.4K stars
👀 Watchers: 141
🍴 Forks: 1.4K forks

💻 Programming Languages: C++ - C - Cuda - Metal - GLSL - CMake

🏷️ Related Topics:
#machine_learning #automatic_differentiation #tensor_algebra #large_language_models


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: asm-lessons

📝 Denoscription: FFMPEG Assembly Language Lessons

🔗 Repository URL: https://github.com/FFmpeg/asm-lessons

📖 Readme: https://github.com/FFmpeg/asm-lessons#readme

📊 Statistics:
🌟 Stars: 9.7K stars
👀 Watchers: 153
🍴 Forks: 288 forks

💻 Programming Languages: Not available

🏷️ Related Topics: Not available

==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
1
🔥 Trending Repository: lima

📝 Denoscription: Linux virtual machines, with a focus on running containers

🔗 Repository URL: https://github.com/lima-vm/lima

🌐 Website: https://lima-vm.io/

📖 Readme: https://github.com/lima-vm/lima#readme

📊 Statistics:
🌟 Stars: 18.4K stars
👀 Watchers: 83
🍴 Forks: 722 forks

💻 Programming Languages: Go - Shell - Makefile - Perl - HTML - SCSS

🏷️ Related Topics:
#macos #vm #qemu #containerd


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: mcp

📝 Denoscription: AWS MCP Servers — helping you get the most out of AWS, wherever you use MCP.

🔗 Repository URL: https://github.com/awslabs/mcp

🌐 Website: https://awslabs.github.io/mcp/

📖 Readme: https://github.com/awslabs/mcp#readme

📊 Statistics:
🌟 Stars: 7K stars
👀 Watchers: 68
🍴 Forks: 1K forks

💻 Programming Languages: Python - Shell - Dockerfile - HTML - TypeScript - Jinja

🏷️ Related Topics:
#aws #mcp #mcp_servers #mcp_server #modelcontextprotocol #mcp_client #mcp_tools #mcp_host #mcp_clients


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: strix

📝 Denoscription: Open-source AI hackers for your apps 👨🏻‍💻

🔗 Repository URL: https://github.com/usestrix/strix

🌐 Website: https://usestrix.com/

📖 Readme: https://github.com/usestrix/strix#readme

📊 Statistics:
🌟 Stars: 3K stars
👀 Watchers: 38
🍴 Forks: 394 forks

💻 Programming Languages: Python - Jinja - Dockerfile

🏷️ Related Topics:
#artificial_intelligence #cybersecurity #penetration_testing #agents #llm #generative_ai


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: frigate

📝 Denoscription: NVR with realtime local object detection for IP cameras

🔗 Repository URL: https://github.com/blakeblackshear/frigate

🌐 Website: https://frigate.video

📖 Readme: https://github.com/blakeblackshear/frigate#readme

📊 Statistics:
🌟 Stars: 26.8K stars
👀 Watchers: 218
🍴 Forks: 2.5K forks

💻 Programming Languages: TypeScript - Python - CSS - Shell - Dockerfile - JavaScript

🏷️ Related Topics:
#home_automation #mqtt #ai #camera #rtsp #tensorflow #nvr #realtime #home_assistant #homeautomation #object_detection #google_coral


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: gumroad

📝 Denoscription: Sell stuff and see what sticks

🔗 Repository URL: https://github.com/antiwork/gumroad

🌐 Website: https://gumroad.com

📖 Readme: https://github.com/antiwork/gumroad#readme

📊 Statistics:
🌟 Stars: 7.4K stars
👀 Watchers: 50
🍴 Forks: 1.4K forks

💻 Programming Languages: Ruby - TypeScript - HTML - SCSS - Shell - JavaScript

🏷️ Related Topics: Not available

==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: code-server

📝 Denoscription: VS Code in the browser

🔗 Repository URL: https://github.com/coder/code-server

🌐 Website: https://coder.com

📖 Readme: https://github.com/coder/code-server#readme

📊 Statistics:
🌟 Stars: 74.6K stars
👀 Watchers: 734
🍴 Forks: 6.3K forks

💻 Programming Languages: TypeScript - Shell - HTML - CSS - HCL - JavaScript

🏷️ Related Topics:
#ide #vscode #development_environment #remote_work #dev_tools #browser_ide #vscode_remote


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: dbeaver

📝 Denoscription: Free universal database tool and SQL client

🔗 Repository URL: https://github.com/dbeaver/dbeaver

🌐 Website: https://dbeaver.io

📖 Readme: https://github.com/dbeaver/dbeaver#readme

📊 Statistics:
🌟 Stars: 46.1K stars
👀 Watchers: 527
🍴 Forks: 3.9K forks

💻 Programming Languages: Java - C++ - ANTLR - CSS - HTML - XSLT

🏷️ Related Topics:
#mysql #java #gui #sql #database #ai #nosql #jdbc #sqlite #postgresql #oracle #openai #dbeaver #erd #redshift #db2 #sqlserver #copilot


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: umami

📝 Denoscription: Umami is a modern, privacy-focused alternative to Google Analytics.

🔗 Repository URL: https://github.com/umami-software/umami

🌐 Website: https://umami.is

📖 Readme: https://github.com/umami-software/umami#readme

📊 Statistics:
🌟 Stars: 31.8K stars
👀 Watchers: 128
🍴 Forks: 5.8K forks

💻 Programming Languages: TypeScript - JavaScript

🏷️ Related Topics:
#charts #statistics #analytics #google_analytics #web_analytics


==================================
🧠 By: https://news.1rj.ru/str/DataScienceM
🔥 Trending Repository: GulfOfMexico

📝 Denoscription: perfect programming language

🔗 Repository URL: https://github.com/TodePond/GulfOfMexico

🌐 Website: https://dreamberd.computer

📖 Readme: https://github.com/TodePond/GulfOfMexico#readme

📊 Statistics:
🌟 Stars: 12.9K stars
👀 Watchers: 45
🍴 Forks: 446 forks

💻 Programming Languages: Not available

🏷️ Related Topics: Not available

==================================
🧠 By: https://news.1rj.ru/str/DataScienceM