Opensource by Reddit – Telegram
Opensource by Reddit
21 subscribers
5 photos
2 videos
9.61K links
Reddit's ♨️ take on Open Source Technology.

Join the discussion ➡️ @opensource_chats

Channel Inquiries ➡️ @group_contacts_bot

👄 TIPS ➡️➡️➡️ https://news.1rj.ru/str/addlist/mB9fRZOHTUk5ZjZk

🌈 made possible by
@reddit2telegram
@r_channels
Download Telegram
I was tired of dealing with image-based subnoscripts, so I built Subnoscript Forge, a cross-platform tool to extract and convert them to SRT.

Hey everyone,




  Like many of you who manage a media library, I often run into video files with embedded image-based subnoscripts (like PGS for Blu-rays or VobSub for DVDs). Getting those

  into the universally compatible .srt format was always a hassle, requiring multiple tools and steps.




  To solve this for myself, I created Subnoscript Forge, a desktop application for macOS, and Linux that makes the process much simpler.




  It's a tool with both a GUI and a CLI, but the main features of the GUI version are:




   * Extract & Convert: Pulls subnoscripts directly from MKV files.

   * OCR for Image Subnoscripts: Converts PGS (SUP) and VobSub (SUB/IDX) subnoscripts into text-based SRT files using OCR. It also handles ASS/SSA to SRT conversion.

   * Batch Processing: You can load a video file and process multiple subnoscript tracks at once.

   * Insert Subnoscripts: You can also use it to add an external SRT file back into an MKV.

   * Modern GUI: It has a clean, simple drag-and-drop interface, progress bars with time estimates, and dark theme support.




  The app is built with Go and the Fyne (https://fyne.io/) toolkit for the cross-platform GUI. It's open-source, and I'm hoping to get some feedback from the community to

  make it even better.




  You can check it out, see screenshots, and find the installation instructions over on GitHub:




  **https://github.com/VenimK/Subnoscript-Forge**




  I'd love to hear what you think! Let me know if you have any questions or suggestions.

https://redd.it/1mqpmkb
@r_opensource
Shark WebAuthn library for .NET

Hello everyone,

Over the past few months, I have been working on a server-side implementation of the WebAuthn standard for .NET as an alternative to existing solutions.

You can check out the project here: https://github.com/linuxchata/fido2

I’d love to hear what you think. Do you see any areas for improvement? Are there features you’d like to see added? Any kind of feedback, advice, or questions are appreciated.

Thanks in advance!

https://redd.it/1mqsw2p
@r_opensource
Build the buddy that gets you! We open-sourced a complete AI voice interaction system!

Hey everyone, we just open-sourced Buddie: a complete, AI-powered voice interaction system we built from the ground up, so you can create your own AI buddy.

It's a full-stack platform for developers, hackers, and students, including custom hardware, firmware, and a mobile app. Therefore, you can use our solution to create various forms of AI devices, such as earphones, speakers, bracelets, toys, or desktop ornaments.

What it can do:

Live transcribe & summarize meetings, calls, or in-person chats.
Get real-time hints during conversations .
Talk to LLMs completely hands-free.
Context-aware help without needing to repeat yourself.

We've put everything on GitHub, including docs, to get you started. We're just getting started and would love to hear your ideas, questions, or even wild feature requests. Let us know what you think!

https://redd.it/1mqxxjs
@r_opensource
Anyone else got charged a few cents by GitHub for an open-source repo?

I just noticed something odd and wanted to check if it’s only me.

On July 27, 2025, I opened a support ticket with GitHub after receiving an invoice that showed my public open-source repository being billed under “metered” usage. From what I understand, public repos shouldn’t trigger these charges.

I only got a reply on August 12, and the next day they explained it was a bug: some users were charged a couple of cents for metered billing products, even when they shouldn’t have been. They reversed the charge and said they’re working on a fix.

That’s fine — but now I’m wondering: how many other people saw a tiny $0.02 or $0.03 charge and didn’t bother contacting support?

Has anyone else here noticed small, unexpected charges for public repos recently?

https://redd.it/1mqz5sr
@r_opensource
Project: Unstructored -> structured

I’m building an open-source AI Agent that converts messy, unstructured documents into clean, structured data.

The idea is simple:

You upload multiple documents — invoices, purchase orders, contracts, medical reports, etc. — and get back structured data (CSV tables) so you can visualize and work with your information more easily.

Here’s the approach I’m testing:

1. inference_schema

A vLLM analyzes your documents and suggests the best JSON schema for them — regardless of the document type.
This schema acts as the “official” structure for all files in the batch.

2. invoice_data_capture

A specialized LLM maps the extracted fields strictly to the schema.
For each uploaded document, it returns something like this, always following the same structure:

>

3. generate_csv

Once all documents are structured in JSON, another specialized LLM (with tools like Pandas) designs CSV tables to clearly present the extracted data.

💬 What do you think about this approach? All feedback is welcome

https://redd.it/1mr9rms
@r_opensource
Open-Source Civic Framework – Looking for Collaborators & Review

Open-source governance toolkit — modular, forkable, and maybe just a little bit sci-fi. Want to help shape it?



I’ve published the **first draft** of an open-source civic framework called **Constella**. It’s intended as a **modular governance toolkit** for communities, blending practical civic processes with some creative concepts (cosmic citizenship, AI companions).



GitHub repo:

📄 [Constella Framework – GitHub](https://github.com/Nightmarejam/constella-framework)



Looking for:



* Code review & contribution
* Ideas for modular features
* Advice on making the repo more contributor-friendly



https://redd.it/1mrdc52
@r_opensource
I needed an efficient way to convert 5tb of unstructured html into dictionaries using just my laptop, so I wrote doc2dict.

I'm the developer of an open source package to work with SEC data. It turns out the SEC has 5tb of html. This data is visually standardized to humans, but under the hood is a mess of different tags and css.

There are a couple existing solutions for parsing html, but they usually involve a combination of LLMs and OCR, which is slow and expensive. So, I decided to write a flexible, algorithmic solution: doc2dict.

Installation

pip install doc2dict

User interface

dct = html2dict(content,mappingdict=None) # converts content to dictionary
visualize
dict(dct) # visualizes the dictionary using your browser.

Note: I don't use this UI much, as I mostly use it via my SEC package. Docs

# Architecture

1. Iterate through DOM and via inheritance get characteristics such as bold, visual height, italics, etc for text on same line (e.g. within a block) to create instructions, e.g.[{'text': 'BOARD MEETINGS', 'all_caps': True, 'bold': True, 'font-size': 15.995999999999999}]
2. Use a rule set to determine how to convert instructions into a nested dictionary. This is customizable. For example, the mapping dict below tells the parser that 'items' should be nested under 'parts', in addition to the default rules.

​

tenkmappingdict = {
('part',r'^part\s([ivx]+)$') : 0,
('signatures',r'^signatures?\.
$') : 0,
('item',r'^item\s(\d+)') : 1,
}

Note: This approach kinda works for modern pdfs. The text stream is often in the order a human would view as correct, so this kinda works. I've added the functionality to doc2dict, but it's in an early stage. (AKA, it sucks).

# Benchmarks

Benchmarks vary as I update the package w.r.t. to features (tables are slow!). Via my laptop:

500 pages per second single threaded
5,000 pages per second multi threaded

# Links

doc2dict GitHub
[raw html](https://html-preview.github.io/?url=https://raw.githubusercontent.com/john-friedman/doc2dict/refs/heads/main/example_output/html/msft_10k_2024.html#:~:text=embracing)
dictionary visualization (old)
[instructions visualization](https://html-preview.github.io/?url=https://github.com/john-friedman/doc2dict/blob/main/example_output/html/instructions_visualization.html) (old)
dictionary (old)

https://redd.it/1mrbkno
@r_opensource
Best practice for including third-party licenses in an OSS library?

I built a public library that’s MIT-licensed (the license is in a LICENSE file).
The package uses some third-party code, each with its own license.

I’m trying to figure out the standard way to include those third-party licenses in my repo:

Add them directly to my LICENSE file?

Create a separate file like THIRDPARTYLICENSES or NOTICE?


Also, when someone uses my package, do they need to include all these third-party licenses in their app?

One concern: I’ve noticed that some app license generators only pull the main LICENSE file of each dependency, so if third-party licenses are in a separate file, they might be missed. How do you handle this?

My library has 300k downloads a month, and I think it’s time to fix this in the best way.

Currently I only have in the readme a section with links to the third party code that I use with their license type.

Thanks

https://redd.it/1mrep4m
@r_opensource
Seeking code review for open source Canadian shopping extension before launch

Built a browser extension for Canadian e-commerce (keeping details light until launch). Looking for a code review from experienced developers

Stack is TypeScript + Vue. Considering the Canadian angle, this might interest Canadian devs, but would welcome feedback from anyone

Send a DM for the repo link

Thanks

https://redd.it/1mrimzr
@r_opensource
Need Contributors for PairPay

Need a contributor to add a feature for PairPay

PairPay uses:

1. React Native
2. react-native-reanimated
3. expo
4. supabase

The feature is about adding a chart for customers to see their data on a chart. The chart can show data how much they owe in which currencies and how much they are owed and in which currency.

If you would like to be part of this project DM.
https://play.google.com/store/apps/details?id=com.alisinayousofi.greenred

https://redd.it/1mrnl2l
@r_opensource
Rust Utility for Managing PATH

✦ Global Path Add - Rust Utility for Managing PATH



I've built a Rust utility that permanently adds directories to your PATH environment variable across different shell environments.



What it does:

Makes persistent PATH changes that apply to all new terminal sessions, unlike temporary solutions.



Current status (Pre-Alpha):

\- Works with Bash shell

\- ⚠️ Fish shell support semi-implemented (files created but not fully functional)

\- ⚠️ Only works with absolute paths

\- ⚠️ Not thoroughly tested - use at your own risk!



Usage:



1 global_path_add /absolute/path/to/directory



Why I'm sharing:

This is my first Rust project and I'm looking for feedback and contributors to help improve it. I need help with:

\- Completing Fish shell support

\- Support for other shells

\- Better error handling

\- Unit tests

\- Code refactoring



Licensed under MIT. Any feedback or contributions would be greatly appreciated!



GitHub: https://github.com/streamtechteam/global\_path\_add

What do you think? Would you find this useful?

https://redd.it/1mrplcl
@r_opensource
🛡️ Find security pitfalls fast: heuristics + local AI (StarCoder2‑3B) — NeuralScan

\- 💻 Lightweight desktop code scanner with a minimal GUI. Fast heuristics + optional on-device AI explanations.

\- 🧭 What it flags: command exec, unsafe deserialization, weak crypto (MD5/SHA1/DES), destructive FS, secrets, network IOCs. Works on common source/configs (e.g., .py/.sh/Dockerfile).

\- 🤖 AI: bigcode/starcoder2‑3b via HF Transformers; local-only, with deterministic fallback when AI isn’t available.

\- 🐳 Optional Trivy integration (Docker) for dependency scanning. Safe degradation if Docker is off.

\- 📊 Outputs a security score, risk categories (with severity weighting), and keeps recent scan history locally.

\- 🧰 Cross‑platform (Linux/Win/macOS), Python 3.9+, MIT.

GitHub

https://redd.it/1mrteh0
@r_opensource
What are some cool open source projects where I can contribute ?

I am a full stack developer having 1.5 YOE but no projects in my resume, so it gets rejected everytime.

My skillset -
- Javanoscript
- Typenoscript
- Nodejs
- Nestjs
- ReactJS
- Postgres & Mongodb
- Sequelize & Momgoose
- Docker

I am more interested in backend.
Any help would be appreciated

Thanks in adv.


https://redd.it/1mrteef
@r_opensource