-
1.
FlashML-org/flashlib ⭐ 383
Fast and memory-efficient classical machine learning operators
-
2.
Convert Google Gemini web into OpenAI-compatible API. Zero auth, cross-platform, single file.
-
3.
hyperliquid trading bot, perp trading bot, hyperliquid profitable trading bot, hyperliquid trading bot, perp trading bot, hyperliquid profitable trading bot, hyperliquid trading bot, perp trading bot, hyperliquid profitable trading bot, hyperliquid trading bot, perp trading bot, hyperliquid profitable trading bot,
-
4.
aimili-vpngate是一个借助vpngate.net让Linux用干净ip出站的代理工具。
-
5.
三角洲行动OBS锁头插件 – 基于OBS渲染注入的智能锁头辅助,支持QQ音乐/网易云联精准骨骼识别、平滑自瞄、压枪抑制,稳定过检,提升击杀效率。动加载。DeltaForce OBS Lockhead Plugin – Smart aim assist via OBS injection, supports QQ Music/NetEase Cloud integration. Bone recognition, smooth aimbot, recoil control, stable anti-cheat bypass.
-
6.
-p-e-w- on /r/LocalLLaMA
-
7.
AI tool for generating uncensored images and videos (18+)
-
8.
veryyoldman/Genspark-AI ⭐ 262
Genspark AI open-source, self-hosted Super Agent. Free alternative to Genspark.ai with multi-agent orchestration, deep research, Sparkpages, AI slides & sheets, image generation and 80+ tools. One-command Windows install. Run locally with any LLM (OpenAI, Anthropic, Gemini, Ollama)
-
9.
leochlon/ntkmirror ⭐ 247
-
10.
Powerful upscaling and frame generation tool with LSFG technology for sharper visuals and higher FPS. One-command install.
-
11.
HumanDrone8721 on /r/LocalLLaMA
-
12.
henliveira/av-curator ⭐ 226
Audio-visual data curation pipeline — scene cuts, silence trim, dedup, CLIP/Whisper filtering for messy web video.
-
13.
bandyah/uni-mm-trainer ⭐ 223
A small library for training multimodal LLMs combining text, vision, and audio
-
14.
Probe and compare the prosody (pitch / energy / duration) of TTS outputs.
-
15.
GordenSun/GordenPPTSkill ⭐ 206
AI-friendly PPT builder skill: 17 hand-polished Chinese PPTX templates + non-destructive text-only editing tools (python-pptx based). Pick a template, write edits.json, build a real .pptx with the layout intact. Personal/research use only.
-
16.
Content-aware frame sampling strategies for video-LLMs.
-
17.
Benchmark for LLM-based ASR n-best rescoring (ngram, neural-LM, MLM-PLL, LLM-prompt strategies).
-
18.
Spatial-VQA-Bench: a focused benchmark of spatial visual reasoning for multimodal LLMs.
-
19.
cortsdine/LightVLM ⭐ 201
Efficient inference toolkit for vision-language models: KV-cache compression, INT4/INT8 quantization, and visual token pruning.
- 20.
-
21.
openbmb/MiniCPM5-1B 🤗 194
We are releasing **MiniCPM5-1B**, the first model in the **MiniCPM5** series.
-
22.
edmicho/mm-probe-kit ⭐ 189
A small, hackable toolkit for probing multimodal LLMs — attention, hidden states, alignment, and causal tracing.
-
23.
Training and evaluation toolkit for audio-visual contrastive representation alignment (CLIP-style, but for audio + video).
-
24.
kepengxu/PRISM-VL ⭐ 180
PRISM-VL studies measurement-grounded VLM learning with RAW-derived Meas.-XYZ inputs, camera-conditioned grounding, and exposure-bracketed supervision transfer.
-
25.
quarqlabs/agent-oss ⭐ 177
A recursive evidence-gated cognitive runtime for memory-native AI agents, combining hybrid retrieval, temporal reasoning, async learning, and plug-and-play tools.
-
26.
TX-Leo/HumanEgo ⭐ 163
HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos
-
27.
LiquidAI/LFM2.5-8B-A1B 🤗 155
LFM2.
-
28.
MackThax on /r/LocalLLaMA
-
29.
UPDATE! 5/28/26 **Z-Engineer V6 in training now!** The **Z-Engineer** returns — now with a PhD in "not being mid.
- 30.
- 31.
-
32.
Orange2019220/ReluPruner ⭐ 139
-
33.
Open-source CLI for semantic Taiwan legal judgment retrieval. Search judgments, package them for your own AI (ChatGPT/Claude/Gemini), and run a bundle-level citation check. Bring your own LLM; retrieval-only.
- 34.
-
35.
<div style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif; display: flex; flex-direction: column; gap: 20px; margin-bottom: 30px;"> <div style="border: 1px solid...
-
36.
## Table of Contents - Open-source list - Models - Data - Engineering Solutions - Introduction - Chat with MOSS - GPU Requirements - Installation - Try MOSS - Fine-tuning MOSS -...
-
37.
pmv143 on /r/LocalLLaMA
-
38.
ScenemaAI/scenema-audio 🤗 128
**Zero-shot expressive voice cloning and speech generation.
-
39.
## Table of Contents - Open-source list - Models - Data - Engineering Solutions - Introduction - Chat with MOSS - GPU Requirements - Installation - Try MOSS - Fine-tuning MOSS -...
-
40.
Porespellar on /r/LocalLLaMA
-
41.
AriPath/AriMando ⭐ 127
در این پروژه تلاش شده است روش قدرتمند SNI-Spoofing بهصورت یک اپلیکیشن گرافیکی (GUI)، مدرن و بسیار ساده برای ویندوز پیادهسازی شود؛ بهطوریکه کاربران بتوانند بدون درگیری با محیط ترمینال، کدهای پیچیده یا قطعیهای پیدرپی، با پایداری صددرصد از اینترنت آزاد لذت ببرند.
-
42.
SUZ-tsinghua/smp ⭐ 127
-
43.
Hrethric on /r/LocalLLaMA
-
44.
Python port of Claude Code's agent-runtime architecture, on LangChain
-
45.
nekocode/filetree-skill ⭐ 115
A Claude Code plugin that maintains `FILETREE.md`.
-
46.
KevinXu02/R3 ⭐ 108
-
47.
Reproducible benchmark for adversarial attacks on multimodal large language models
-
48.
imperia-ran/DTLN_v2.0 ⭐ 103
-
49.
haskaomni/serenity ⭐ 102
-
50.
GU-Cryptography/anykb ⭐ 101
AnyKB — 私有 RAG 知识库 + 透明 Agent
- 51.
-
52.
TumbleweedNew6515 on /r/LocalLLaMA
-
53.
Scared-Biscotti2287 on /r/LocalLLaMA
-
54.
pmttyji on /r/LocalLLaMA
-
55.
We introduce PaddleOCR-VL-1.
- 56.
-
57.
_BreakingGood_ on /r/StableDiffusion
-
58.
Ambitious_Fold_2874 on /r/LocalLLaMA
-
59.
NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) 👽 270
Gailenstorm on /r/LocalLLaMA
-
60.
Gao-Ruilin/AutoRun ⭐ 80
An AI agent for coding and others
-
61.
CLI harness for WPS Office -- let AI agents control Writer, Calc & Impress via COM automation
-
62.
prunaai/p-video-animate ®️ 1484
p-video-animate animates a reference image with the motion and audio of a source video. Optimized for speed and cost — 5.24s per 1s of video.
-
63.
JustFinishedBSG on /r/LocalLLaMA
-
64.
Speech-aware KV cache pruning for long-form speech LLMs (Qwen2-Audio, SALMONN). Token/head/chunk-level pruners + eval on LibriSpeech-long & GigaSpeech.
-
65.
Total-Resort-3120 on /r/StableDiffusion
-
66.
Trinity Nano Preview is a preview of Arcee AI's 6B MoE model with 1B active parameters.
-
67.
EvilEnginer on /r/LocalLLaMA
-
68.
Forward_Jackfruit813 on /r/LocalLLaMA
-
69.
kaggleqrdl on /r/LocalLLaMA
- 70.
-
71.
Step 3.
-
72.
Drop-in prompt-caching fixes for the LLM agent harness you use. Point your AI coding agent at this repo and it ships the patches.
-
73.
nvidia/PiD 🤗 73
<p align="center"> <img src="figures/teaser.
-
74.
DeltaSqueezer on /r/LocalLLaMA
-
75.
nalltama/RAIV ⭐ 72
Realtime AI Image Viewer - AI upscaling image viewer for Windows
-
76.
Simple_Library_2700 on /r/LocalLLaMA
-
77.
DCDmllm/InstructSAM ⭐ 71
The code for "InstructSAM: Segment Any Instance with Any Instructions"
-
78.
## Table of Contents - Open-source list - Models - Data - Engineering Solutions - Introduction - Chat with MOSS - GPU Requirements - Installation - Try MOSS - Fine-tuning MOSS -...
-
79.
## 📝 Short description A LoRA for **Flux Kontext Dev** that fuses a **reference image (left)** with a **depth map (right)**.
-
80.
laginimaineb on /r/MachineLearning
-
81.
FineVLM-Probe: a lightweight harness for fine-grained probing of frozen vision-language models (CLIP / SigLIP / BLIP-2 / LLaVA).
-
82.
A small CLI harness for evaluating speech LLMs and ASR models on standard benchmarks (LibriSpeech, FLEURS, VoxPopuli).
-
83.
Python полная дорожная карта для изучения языка в 2026 году
-
84.
vick2djax on /r/LocalLLaMA
- 85.
-
86.
Creating these models takes significant time, work and compute.
-
87.
pardcomper/safegate ⭐ 65
Lightweight runtime safety guard for multimodal LLM I/O
-
88.
marived/vlm-probe ⭐ 64
Probing fine-grained perception in open-source vision-language models — companion code for a writeup.
-
89.
pulgog/whisperkv ⭐ 64
KV-cache compression for Whisper-family speech models. Drop-in patch, three eviction policies.
-
90.
mitkox/SkillOpt ⭐ 64
SkillOpt with local AI is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.
-
91.
Open Source Reimplementation of Google Deepmind's Co-Scientist
-
92.
Multi-dimensional trustworthiness evaluation for multimodal LLMs
-
93.
HornyGooner4402 on /r/LocalLLaMA
-
94.
**Non-autoregressive ASR with ONNX runtime** — optimized for deployment without PyTorch dependency.
-
95.
Paradigmind on /r/LocalLLaMA
-
96.
Practical CLIP fine-tuning recipes — DDP training, LoRA, hard-negative mining, leakage checks.
-
97.
A free detector capable of identifying content generated by all advanced AI models.
-
98.
fairydreaming on /r/LocalLLaMA
- 99.
-
100.
funasr/paraformer-zh 🤗 59
**Non-autoregressive end-to-end speech recognition** — 120x realtime on GPU, production-ready for Mandarin Chinese.
-
101.
WeChat MP article harvester with DLL injection for no-GUI automation
-
102.
Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios.
-
103.
Yes-Scale-9723 on /r/LocalLLaMA
- 104.
- 105.
-
106.
anthropic/claude-sonnet-4.6 ®️ 811
Claude Sonnet 4.6 from Anthropic: a full upgrade to coding, computer use, long-context reasoning, agent planning, knowledge work, and design, with a 1 million token context window in beta.
-
107.
parrot42 on /r/LocalLLaMA
-
108.
izscc/cc2image ⭐ 54
中文文章视觉系统生成 Skill:认知锚点拆图、40 套风格匹配与批量生图提示词,支持封面、正文配图和系列主视觉。
-
109.
End-to-end Machine Learning pipeline for Truck Delay Prediction using XGBoost, Flask API, MLflow, and Lightning AI deployment.
-
110.
A real-time AI and Machine Learning based healthcare application that predicts diseases from user symptoms using text, voice, and image inputs. The system supports multilingual communication, severity analysis, diet recommendations, PDF report generation, and nearby hospital navigation for smart medical assistance.
-
111.
cv-cat/All-IN-ONE ⭐ 53
小红书、抖音、微博等平台CLI、SKILLS
-
112.
DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models
-
113.
The model has been trained on a collection of 500k articles with headings.
- 114.
-
115.
MarcCDB on /r/LocalLLaMA
-
116.
jacek2023 on /r/LocalLLaMA
-
117.
Training-free style transfer for DiT models.
-
118.
HelpFreedom/youthub ⭐ 51
YouHub — console YouTube client. TUI grid, custom ffplay-yt player, SABR streaming, SponsorBlock, 1080p.
- 119.
-
120.
<div align="center"> <img class="dark:hidden" src="figures/Keye-logo--black.
-
121.
A real traffic light for AI agents.
-
122.
JuneYaooo/nihaixia ⭐ 49
倪海厦中医课程资料的 Agent Skill:支持课程检索、方证穴位辨析、学习笔记整理与板书截图证据索引。 | An Agent Skill for Ni Haixia TCM course study, formula-pattern lookup, acupoint reference, and screenshot evidence indexing.
-
123.
AgentGuard:An Attribute-Based Access Control Framework for Tool-Use LLM-Based Agent
-
124.
This is an attempt to make original open mythos better with chat ui.
-
125.
Turn any source tree into a local SQLite database. FTS5 trigram search across 89K files in seconds. One file. No server. No network.
-
126.
Makapic/RocoPilot ⭐ 47
洛克王国世界自动化助手:基于 Interception 驱动级输入,在内核层面模拟键鼠,游戏反作弊无法检测。 基于 OpenCV 模板匹配 + YOLO 精灵检测的游戏自动化工具,专为《洛克王国:世界》设计。支持自动丢球、精灵定位瞄准、巡航抓宠,以及战斗中的自定义应对策略。
- 127.
-
128.
觀瀾 · A-share research workstation with 24 AI sub-agents — one command, deep-dive report in ~10 min.
-
129.
LLMFan46 on /r/LocalLLaMA
-
130.
Implement the anima plugin for artisti mix by hooking into the attention layer
-
131.
A Budget Tracker is a financial management application that helps users record, monitor, and manage their income and expenses efficiently. It allows users to track daily spending, categorize transactions, set budget limits, and analyze savings patterns through reports and visual charts.
-
132.
1ove9/antenna-forge ⭐ 45
AI-driven inverse antenna design with real NEC2 in the loop
-
133.
Hephaestite on /r/LocalLLaMA
-
134.
kyrtstn/syv ⭐ 44
syv ⚡ A dual-threat optimization CLI. Combines local Static Site Generation (SSG) caching and Single Page Application (SPA) payload compression into one lightweight, zero-dependency daemon. Built for Termux and Linux.
-
135.
srigi on /r/LocalLLaMA
-
136.
NielsRogge on /r/MachineLearning
-
137.
lantern_lol on /r/LocalLLaMA
-
138.
Engineering workflows for AI coding agents or flesh engineers. It helps absorb silent base-model quality drift.
- 139.
-
140.
ivari on /r/LocalLLaMA
-
141.
Rude_Substance_8904 on /r/LocalLLaMA
-
142.
meiukinn/CardLens ⭐ 42
QR-based physical card recognition for digital profile display.
-
143.
aipmer/book ⭐ 42
《Codex实战蓝皮书》:AI原生时代的产品研发与多端编排实战指南。
-
144.
数学建模竞赛完整工具链:从拿到赛题到交出论文,一条龙解决。 覆盖 国赛 CUMCM(A/B/C) 和 美赛 MCM/ICM(A-F) 全部题型。
-
145.
sjtuplayer/showvi ⭐ 42
AI video agent [Seedance2 agent一键成片]
-
146.
A Python runtime for multi-entity AI collaboration — agents, humans, and tools on a shared protocol layer.
-
147.
A now-playing overlay for OBS on macOS & Windows — Apple Music, Spotify, YouTube, and more — with artwork, track info, and a progress bar.
-
148.
AI is not for everyone 👽 139
Scutoidzz on /r/LocalLLaMA
-
149.
agent教程
-
150.
为 DeepSeek v4 系列补齐视觉理解、联网搜索与 Anthropic / OpenAI 兼容接口的代理服务
-
151.
**Streaming speech recognition** — real-time transcription with low latency for Chinese.
-
152.
Jorlen on /r/LocalLLaMA
-
153.
Cross-architecture LLM internal observation database (23 models, 13 architecture families). Exposed as MCP tools for any AI coding agent.
-
154.
batman634/coral-reef ⭐ 40
-
155.
Merserk/ComfyUI-PiD ⭐ 40
ComfyUI custom node for NVIDIA PiD pixel diffusion decoding and upscale workflows
-
156.
OnkelBB on /r/LocalLLaMA
-
157.
hongruhou89/ProRL ⭐ 39
ICML 2026: "ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation"
-
158.
0xtotem/peek-dspy ⭐ 39
A DSPy port of PEEK
-
159.
Staged self-improvement engine for Hermes-style memory, skill, and fact updates with explicit review and apply/discard gates.
-
160.
paf1138 on /r/LocalLLaMA
-
161.
bobaburger on /r/LocalLLaMA
-
162.
joshhu/skillopt-qa ⭐ 38
Minimal faithful re-implementation of Microsoft SkillOpt: a text-space optimizer that trains a deployable natural-language skill for a frozen LLM agent on HotpotQA.
-
163.
Python project that predicts the next day’s attendance based on past attendance data using NumPy and a basic moving average approach.
-
164.
Prompt-first starter kit for making repositories safer for AI coding agents.
-
165.
biao994/DocPaws ⭐ 37
-
166.
Self-evolving memory palace for AI agents — persistent memory with automatic learning, knowledge graph, and multi-agent support
-
167.
This is a text-only GGUF build of XiaomiMiMo/MiMo-V2.
-
168.
siegekeebsofficial on /r/StableDiffusion
-
169.
akira3weet on /r/LocalLLaMA
-
170.
krea/krea-2-medium ®️ 403
Foundation image model from Krea, tuned for expressive illustration, anime, and painterly styles. Fast and consistent across artistic directions.
-
171.
用于让 agent 在大众点评自动取号,定时取号,符合条件后触发取号,以及定时获取排号情况等,全流程自动化脚本控制,无需ai手动执行
-
172.
EveningIncrease7579 on /r/StableDiffusion
- 173.
-
174.
<p align="center"> <img src="assets/teaser.jpg" alt="LocateAnything teaser" width="100%"> </p>
-
175.
**PaddleOCR-VL-1.
-
176.
MiniCPM5-1B 👽 115
kevinlch on /r/LocalLLaMA
-
177.
rlouf/sigil ⭐ 34
-
178.
Autodesk AutoCAD 2027 Professional Download | Desktop Installer with Autodesk AI Activation Patch & Keygen | Pre-Activated License Key Setup
-
179.
Harahan/RTDMD ⭐ 34
[Arxiv 2026] This is the official PyTorch implementation of "RTDMD: Reinforcing Few-step Generators via Reward-Tilted Distribution Matching"
-
180.
fxyz666/LogicPipe ⭐ 34
LogicPipe 是一个面向边缘多设备协同 LLM 推理的开源软件项目,提供离线管线规划、分布式 stage 权重加载、依赖感知任务调度和上下文 KV cache 复用能力。
-
181.
futterneid on /r/LocalLLaMA
-
182.
ALEX-nlp/DenoiseRL ⭐ 33
-
183.
boheling/deltasci ⭐ 32
Two-perspective co-reasoning for AI4Science hypothesis generation — grounded, falsifiable, with a citation-audit trail. CLI + Claude Code skill.
-
184.
PolyTalkIO/polytalk ⭐ 32
Privacy-first, self-hosted real-time speech-to-speech translation.
-
185.
## Introduction Trinity-Large-Preview is a 398B-parameter sparse Mixture-of-Experts (MoE) model with approximately 13B active parameters per token.
-
186.
fallingdowndizzyvr on /r/LocalLLaMA
-
187.
Nid_All on /r/StableDiffusion
-
188.
MironV on /r/LocalLLaMA
- 189.
-
190.
highkay/keytoauth ⭐ 31
一个自动icloud取码到登录chatgpt取auth session最后转cpa+sub2api的脚本
-
191.
癌细胞系 ID 跨数据库映射工具 (DepMap/COSMIC/Sanger)
-
192.
洛克王国世界MITM服务器
-
193.
[CVPR 2026] ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction
-
194.
Alternative-Cat-1347 on /r/LocalLLaMA
-
195.
Some-Cauliflower4902 on /r/LocalLLaMA
-
196.
Possible-Active-1903 on /r/MachineLearning
-
197.
Student placement automation using UiPath and Excel for validating records, calculating scores, detecting errors, and predicting salary packages based on student performance data.
-
198.
tsolful/ComfyUI-PiD ⭐ 30
PiD Decode Custom Node
-
199.
你说你的业务,我翻译成AI能做的事。帮企业老板快速梳理AI落地路径的结构化方法论。
-
200.
ZYPGITA/astra-flux ⭐ 30
-
201.
color4-alt/CiteCheck ⭐ 30
Check academic paper citations for format, queryability, thematic relevance, and semantic accuracy.
-
202.
ernie-research/NAVA ⭐ 30
Official Code of NAVA: Native Audio-Visual Alignment for Generation.
-
203.
> Key difference from Wasserstein release and old Genesis release is data regeneration in model via mathematical statistics based on what it's already learned and stored in tensors.
-
204.
Trinity Nano Preview is a preview of Arcee AI's 6B MoE model with 1B active parameters.
- 205.
-
206.
AI-powered multilingual FIR drafting assistant using Flask, NLP, and Machine Learning.
-
207.
Stop getting your stops hunted. SL/TP never touch your broker - only fires when the underlying actually breaches your level. And skip the options chain: drag your levels on the chart, we auto-pick the strike + DTE + contracts. The first open-source platform that does both.
-
208.
g3t-paper/g3t ⭐ 29
Code for G3T and G3T-Long
-
209.
XiaokunFeng/MIGA ⭐ 29
Accepted by ICML 2026~
-
210.
A minimalistic Preact-like signals implementation.
-
211.
Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios.
-
212.
crystal_alpine on /r/StableDiffusion
-
213.
Installable Serenity tweet archive + AI/semi supply-chain skill. Install: npx skills add yan-labs/serenity-aleabitoreddit
-
214.
Pre-alpha local AI video editor for prompt-based motion graphics, Figma frame import, timeline editing, and LTX video generation.
-
215.
PS5 exFAT Library Scraper
- 216.
-
217.
为了在洛克王国S2赛季刷异色盒子时,不用一直盯着,所以写了这个小工具。感谢deepseek,感谢gpt,感谢gemini,感谢cc。基于OpenCV实现,准确度80%左右,可以实现检测惊喜盒子的中层和下层
-
218.
XuToWei/Image-To-UI ⭐ 28
Image-to-UI Codex skill: convert game UI screenshots and sliced sprites into ui_structure.json.
-
219.
Convert Hyprland .conf to Lua for v0.55+ — drop-in replacement for deprecated hyprlang. ~97% auto-conversion, 0% guesswork.
-
220.
5uck1ess/tts-bench ⭐ 27
Speed and samples benchmark: for all types of TTSs on Windows/Mac/Linux.
-
221.
pulseio76/ArgusMind ⭐ 27
AI 驱动的多 Agent 自主代码安全审计:审计计划、危险 Sink 发现与调用链分析;AI-driven multi-agent autonomous code security auditor — audit planning, sink discovery & call-chain analysis;
- 222.
-
223.
Alpha Forge — an agentic AI operating system for systematic trading.
-
224.
Tejas-TA/predikit ⭐ 27
pypi-predikit: Turn any sklearn/XGBoost model into an LLM-callable tool. Framework-agnostic
-
225.
Anbeeld on /r/LocalLLaMA
-
226.
pmttyji on /r/LocalLLaMA
-
227.
A toolkit of Claude Code skills for long-form content creators: research, score, rewrite, and publish.
-
228.
A Codex skill for daily AI literature digests
-
229.
Hy-MT2-1.
-
230.
manelinux/nixard ⭐ 25
Interactive terminal UI for exploring NixOS package closures, analyzing real installation costs, and generating ready-to-use Nix declarations.
- 231.
-
232.
buptwz/holmes-kb ⭐ 25
Knowledge-driven troubleshooting assistant — structured, git-native KB with AI-powered import and intelligent conflict resolution
-
233.
randomfoo2 on /r/LocalLLaMA
-
234.
Rudy_AA on /r/StableDiffusion
- 235.
-
236.
PulseVector on /r/LocalLLaMA
- 237.
-
238.
Headless bot used at Eden, modified version of Sweepy's bot. All restrictions removed + multiple account support. 200M+ fans per day.
- 239.
-
240.
codex一键接入国产模型
- 241.
-
242.
SkillOpt treats markdown skill files as trainable parameters with proper optimization machinery 👽 79
agentic-doc on /r/LocalLLaMA
-
243.
CuriousPlatypus1881 on /r/LocalLLaMA
-
244.
Deep Learning Based Air Gesture Text Recognition is an advanced AI-based project that combines computer vision and deep learning to enable users to write in the air naturally. The system improves human-computer interaction by providing a smart, contactless, and efficient method of text input.
- 245.
-
246.
kouhxp/textsnap ⭐ 23
Snap any image, screenshot, or webpage into plaintext. No GPU. No cloud. One command.
- 247.
-
248.
PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolutions
-
249.
CBC Disease Prediction System is a healthcare application that analyzes CBC blood parameters to predict possible blood-related disorders such as anemia, leukemia, thrombocytopenia, and infections. The system evaluates blood values and provides prediction results for basic health monitoring and analysis.
-
250.
Automatic Auction House sniper bot for Forza Horizon 6 (FH6). Screen-reading buyout automation. Windows, open source.
-
251.
A production-ready Django SaaS boilerplate built with Clean Architecture (Service Layer), Tenant-Aware RBAC, JWT Auth, Celery, and Enterprise-Grade Observability (Structlog & Request-ID).
-
252.
AMAPVOICE/PilotTTS ⭐ 23
-
253.
pmttyji on /r/LocalLLaMA
-
254.
Turbulent_Corner9895 on /r/StableDiffusion
-
255.
XTXinverseXTY on /r/MachineLearning
- 256.
-
257.
ShaneLiu04/Step-RL ⭐ 22
基于强化学习的 LLM Agent 长链路决策优化系统
-
258.
ShaneLiu04/NeoAgent ⭐ 22
支持多 Agent 编排、流式透明与安全沙箱的 AI Agent 框架。
- 259.
-
260.
PoC for an integer overflow vulnerability in ImageIO patched in iOS/macOS 26.5
-
261.
lipzh5/REALM ⭐ 22
REALM: A Coarse-to-Fine Generative Framework for Embodied Reactive Listening (Under Review)
-
262.
AXI4-Stream protocol compliance checker skill for Claude — open-source alternative to commercial EDA tools
-
263.
A curated collection of AI agent skills for biomedical research, covering genomics, proteomics, single-cell analysis, clinical AI, and protein design.
-
264.
Dance is an end-to-end framework that detects and classifies events in EEG signals. In a single forward pass, it extracts a set of events directly from the raw, unaligned recording.
- 265.
-
266.
GPT 全流程注册支付系统:协议注册、Stripe/PayPal 支付、Web 控制台、资源池、并发队列、session-json/getrt 导出。全协议注册+无头支付+协议Oauth。
-
267.
funasr/fsmn-vad 🤗 22
**Voice Activity Detection** — accurately detect speech segments in audio, essential for long-audio processing pipelines.
-
268.
## Introduction Trinity-Large-Thinking is a reasoning-optimized variant of Arcee AI's Trinity-Large family — a 398B-parameter sparse Mixture-of-Experts (MoE) model with approximately 13B active...
-
269.
Aaaaaaaaaeeeee on /r/LocalLLaMA
-
270.
chloeqxq/MACD ⭐ 21
-
271.
zetaneko/AnimaDex ⭐ 21
Self-hosted, searchable gallery for Anima AI-generated anime character and artist references. Flask backend, vanilla JS frontend, SQLite storage, zero external dependencies for the core feature set.
-
272.
ActiveGraph/GBrain bridge proof of concept for Apprentice launch.
-
273.
Official PyTorch implementation of "Steerable Rhythmic Complexity in Autoregressive Music Generation" (EI Accepted). A bar-level conditional micro-Transformer using REMI+ syntax for exact density control and harmonic decoupling.
-
274.
garyqlin/gbase ⭐ 21
GBase — Recursive Self-Improvement Agent Framework. Memory, evolution, quality gates, identity system, and 40+ auto-registered tools.
-
275.
Zero-dependency multi-agent workflow orchestration engine. YAML pipelines, shared event bus, auto-recovery, real-time dashboard.
-
276.
Tiktok Viewers | Last update 2026 | No shadows or bans | Custom amount
-
277.
An open-source RAG platform to explore the unsealed Jeffrey Epstein court documents.
-
278.
pmttyji on /r/LocalLLaMA
-
279.
arsocekaj/anime-sdxl-v17 ®️ 154
High‑quality anime image generator based on an SDXL v17 checkpoint.
-
280.
StandardLovers on /r/LocalLLaMA
-
281.
Majestic_Department7 on /r/StableDiffusion
-
282.
sub2api401账号重新获取at
-
283.
A framework for running a multi-agent software development team — autonomously or semi-autonomously. Six specialized agents collaborate through a shared database pipeline: requirements → planning → architecture → implementation → QA → human review. Works with GitHub Issues, ships real PRs.
- 284.
-
285.
Wiki-based retrieval for AI coding agents. 65× token reduction, +24pp Coverage@5 on SWE-bench Verified.
-
286.
Autoform Bot
-
287.
redai-infra/PIPO ⭐ 20
Implementation of an efficient LLM architecture: the Pair-In / Pair-Out Model (PIPO)
-
288.
Finance Sentiment ZH (base) is a model based on bert-base-chinese for analyzing sentiment of Chinese financial news.
-
289.
sword-in-stone on /r/LocalLLaMA
-
290.
wywywywy on /r/StableDiffusion
-
291.
CopilotCoding/GSM ⭐ 19
GSM — Geometric State Machine. A new type of AI architecture. No attention. No KV cache. No quadratic scaling. Just a fixed point in R^4096 being continuously deformed by a learned algebra of transformations. Sounds like Bach.
-
292.
Python refactor of StaMPS for PS-InSAR processing, focused on ISCE/ISCE2 stack preprocessing, HDF5-based Steps 1-8 workflow, snaphu unwrapping, SCLA/SCN correction, and GeoPackage/Shapefile export of velocity and displacement time series. GPL-3.0.
-
293.
A simple Python-based Book Management System that allows users to add, view, search, and delete books using a menu-driven console application. This project demonstrates basic Python concepts, file handling, and GitHub project management.
-
294.
Cintu07/ciot ⭐ 19
cpu inference for ternary neural nets. no deps. just c++ and simd.
-
295.
AMAAI-Lab/MERIT ⭐ 19
-
296.
Open-source legal AI agent skills for PKULaw, citations, and DOCX workflows
-
297.
Evidence gate for LLM agent claims - verify claims against evidence, tool results, and policies.
-
298.
Daily AI literature digest and visual Paper Vault for researchers. Monitors new papers by custom keywords, emails summaries via Gmail, and organizes full-text-read High/Medium papers into a categorized local web vault.
-
299.
g-wellsa/3and_agents ⭐ 19
当前 AI 工具都是"一个人干活"——你问一句,它回一句。**三和 3And 让 AI 变成一个团队**:你说一句话,天书(调度官)识别意图并派活,观澜(情报官)做调研分析,执戈(执行官)写代码做网页做游戏——三个 AI 角色自动分工、并行工作、互相验收、交付成果。
-
300.
LEGO Batman Legacy of the Dark Knight repack 2026 | Nothing is cut out | voices38 | PC | Steam Edition | DLCs | MULTi16
-
301.
funasr/campplus 🤗 19
**Speaker Verification & Diarization** — identify and distinguish speakers in audio.
-
302.
A LoRA adapter trained on FLUX.2-klein-4B for generating pixel art character sprites. Optimized for game-ready assets with transparent backgrounds.
-
303.
jacek2023 on /r/LocalLLaMA
-
304.
iamMess on /r/LocalLLaMA
-
305.
jslominski on /r/LocalLLaMA
-
306.
jacek2023 on /r/LocalLLaMA
-
307.
探索 Claude Agent SDK 能力的实验项目。通过 Skill 系统让 AI 理解 SDK 文档,进而自主构建了一个 TikHub 社交媒体数据对话助手。 本项目是利用 .claude/skills/claude-agent-sdk 建立 claude-agent-sdk 项目, 项目基于 https://github.com/liangdabiao/tikhub_api_skill tikhub skill 封装成 webapp。
-
308.
林月半子的 AI 开源技能库,持续更新中。
-
309.
bug-maker6/TAAC-2026 ⭐ 18
腾讯广告算法大赛2026-学术赛道-0.83220代码
- 310.
- 311.
-
312.
power97992 on /r/LocalLLaMA
-
313.
Ryoiki-Tokuiten on /r/LocalLLaMA
-
314.
dh7net on /r/MachineLearning
-
315.
A local-first Codex skill that turns natural-language requests into live results or free API discovery.
-
316.
A senior-engineer protocol for polyglot code generation, architecture, testing, security, performance, and agent validation.
- 317.
- 318.
-
319.
把单条抖音/小红书链接整理成本地视频、音频、字幕和逐字稿的 Codex / Claude Code Skill。
-
320.
zhnt/loushang ⭐ 17
AI-native coding orchestration platform: unified multi-model agent runtime with stateful sessions, tool governance, and traceable delivery.
-
321.
XiaoBei skill for converting academic images into editable Office VBA Shapes and PowerPoint reconstructions.
-
322.
abijitgowda/Muninn ⭐ 17
-
323.
pmttyji on /r/LocalLLaMA
-
324.
common_yarrow on /r/MachineLearning
-
325.
MiMo-V2.5-coder 👽 54
jedisct1 on /r/LocalLLaMA
-
326.
A-share quantitative trading factor analysis and generation framework (基于 Tushare 数据源的量化因子分析框架).
-
327.
这是一个可以让agent直接调用gpt-image2进行生图的一个skills。
-
328.
A tool used for hosting your own Formula 1 telemetry broadcast server using real F1 data 🏎️
-
329.
An agent lives in Canvas that does all of your homework
-
330.
snake eat eat eat
- 331.
-
332.
Laith0003/ux-skill ⭐ 16
Design intelligence engine for AI coding. 120 anti-pattern linter rules, 110 brand DESIGN.md specs, MCP server, 22 commands, 17 IDEs, 17 languages. MIT.
-
333.
shyftlabs/continuum ⭐ 16
Continuum — the agent runtime by ShyftLabs. Build, orchestrate, ship.
-
334.
Small Windows-only patcher for the Codex desktop app to enable its remote/mobile features.
-
335.
manizada/CIFSwitch ⭐ 16
-
336.
Quantized GGUF weights of the Equinox-31B model. When in doubt which specific file to download, take 80% of VRAM capacity as a guideline, leaving the remaining 20% for context.
-
337.
BitCPM-CANN is the first end-to-end 1.
-
338.
<div style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, sans-serif; border: 1px solid #93c5fd; border-radius: 12px; overflow: hidden; background: #ffffff; box-shadow: 0 2px...
-
339.
UkieTechie on /r/LocalLLaMA
-
340.
krea/krea-2-large ®️ 96
Krea's flagship foundation image model. Larger and more flexible than Krea 2 Medium, with particular strength in photorealism and expressive artistic styles.
-
341.
ttyd + tmux + Cloudflare Zero Trust — log into a persistent web terminal from any browser. Single-prompt Claude Code setup.
-
342.
L2 orchestrator for chained Claude L3 agents (analyst -> sdd_writer -> implementer -> auditor) on 1C projects
-
343.
GCRL in JAX. Official repository for LEO (ICML 2026).
-
344.
Transmit malware payload via sound: DFT + FSK + Goertzel algorithm + IDFT
-
345.
FH6 image-to-vinyl painter with PySide launcher, setup/update flow, Luma Bands, V2 repair, checkpoint browser, and FH6 JSON importer.
-
346.
Este projeto permite inspecionar e listar informações do ambiente Docker de forma simples através de endpoints HTTP, incluindo containers em execução, imagens, redes e volumes.
-
347.
orange90/MiMo-TUI ⭐ 15
-
348.
A modern poster overlay
-
349.
fscdc/dMoE ⭐ 15
[arXiv 2026] dMoE: dLLMs with Learnable Block Experts
-
350.
ctx-0/lazyllama ⭐ 15
a smol tool for managing local models
-
351.
A lightweight open-source tool for tracking daily RSI(6) signals on a stock watchlist.
-
352.
Conditional tool-schema loading for Hermes Agent to reduce first-turn token bloat by loading only the tools a prompt actually needs, with safe full-surface fallback for long or ambiguous requests.
-
353.
AFK-surf/safeclipper ⭐ 15
-
354.
Work-in-progress, not for production use.
-
355.
🧬 SelfEvolvingAI - 70模块自我进化AI系统 | Self-evolving AI with 70 modules
-
356.
LAW1223/AlignVid ⭐ 15
-
357.
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps 👽 49
pmttyji on /r/LocalLLaMA
-
358.
last_llm_standing on /r/LocalLLaMA
-
359.
Borkato on /r/LocalLLaMA
-
360.
Uiqueblhats on /r/MachineLearning
-
361.
Curated skill pack for LLM agents in engineer and science workflow (Cursor & Claude ready).
- 362.
-
363.
MCG-NJU/FreeRet ⭐ 14
[ICML2026] FreeRet: MLLMs as Training-Free Retrievers
- 364.
-
365.
Clean EPUB of Pope Leo XIV's Magnifica Humanitas
-
366.
The official repo of ICML2026 Paper: Adversarial Latent Embedding Repair for LLM Continual Learning
-
367.
An open-source End-to-End AI model for self driving
-
368.
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
-
369.
📄 Forgetting in Language Models: Capacity, Optimization, and Self-Generated Replay
-
370.
kocurvik/beampptx ⭐ 14
Convert LaTeX Beamer slides to PowerPoint with flawless vector graphics and embedded video support.
- 371.
-
372.
bovod-sjtu/HoliTok ⭐ 14
HoliTok:A Coutinuous Holistic Tokenization with Robust Dual Capabilities of Speech Generation and Understanding
-
373.
A simple prompt enhancer node built with a Qwen 3.5-4b GGUF backbone for LOW VRAM users. intakes your prompts and enhances them based on settings. supports T2I, Image Editing, T2V
-
374.
Use Hermes Agent as the control plane for local coding agents like Codex, Kimi Code, Claude Code, OpenCode, and Gemini CLI.
-
375.
Live sportsbook arbitrage tool with research-backed betting strategy (Ontario focused)
-
376.
Creating these models takes significant time, work and compute.
-
377.
**It speaks.
-
378.
## Description: The NVIDIA DeepSeek-V4-Pro-NVFP4 model is the quantized version of the DeepSeek-V4-Pro model, which is a Mixture-of-Experts (MoE) language model with 1.
-
379.
Any-Chipmunk5480 on /r/LocalLLaMA
-
380.
NightCR_ on /r/MachineLearning
-
381.
Uiqueblhats on /r/LocalLLaMA
- 382.
-
383.
NSM-Barii/war-rig ⭐ 13
WarDriving Rig
-
384.
2026TAAC腾讯广告算法大赛-KDDCUP方案,best score:0.832321,rank:51
- 385.
-
386.
GUI tool for analyzing, decoding, visualizing, and editing Flipper Zero Sub-GHz RAW .sub capture files.
-
387.
NeuroForge – A brain‑inspired AI that learns continuously on a laptop. No pretrained models. No cloud. Just math and biology.
-
388.
RealAvatarN 是一款基于开源项目 HeyGem 的 Unreal Engine 数字人插件,将 AI 驱动的实时口型同步能力与 UE 的强大渲染管线深度融合。插件封装了核心推理算法,提供简洁的蓝图与 C++ 接口,让开发者无需关心底层细节即可快速构建高质量的交互式数字人应用。
-
389.
Open-source инфраструктура автоматического перевода книг через Claude Code.
- 390.
-
391.
深度云创科技,专注开发 AI 智能体、AI 应用辅助科研、工作、学习!
-
392.
forza horizon 6 youtube muisc player
-
393.
mlhiter/skills ⭐ 13
Reusable agent skills
-
394.
MohitDabas/malshark ⭐ 13
AI-powered malware traffic analysis and network forensics via the Model Context Protocol
-
395.
CVPR26: UniRefiner
-
396.
wxzyd123/LoSATok ⭐ 13
Low-dimensional Unified Audio Tokenizer for Understanding and Generation
-
397.
Hihiczx/Xetrieval ⭐ 13
Xetrieval: Mechanistically Explaining Dense Retrieval
-
398.
LoveMind_AI on /r/LocalLLaMA
-
399.
ExoticYesterday8282 on /r/LocalLLaMA
-
400.
quietsubstrate on /r/LocalLLaMA
-
401.
Known_Ice9380 on /r/LocalLLaMA
- 402.
-
403.
Pressure-test research claims with falsifiable evidence plans, adversarial checks, frozen verifiers, and proof ledgers.
- 404.
-
405.
Turn a jailbroken Kindle into a local e-ink side display.
-
406.
Hello-AI是一个面向小白的 AI / LLM 学习入口平台:把 AI 基础、提示词、工具使用、RAG、Agent、部署、评测和安全常识串成一条能走通的路,并提供案例、练习和可复现项目让小白把这条路走通。
-
407.
Intelligent battery health analysis for Home Assistant
-
408.
Cowork Cookbook
-
409.
mane23-ai/solaria ⭐ 12
Shared skill operations layer for Claude, Codex, personas, ontology, trigger evaluation, and runtime governance
-
410.
imbue-bit/NS-NTK ⭐ 12
Official implementation for the paper: Deep Learning under Continuous Distribution Shift: The Non-Stationary NTK and Spectral Tracking SDE for Quantitative Finance
-
411.
Auto-evolving LLM Agent Harness - Benchmark-driven evolution via Claude Code + self_evolution.md guide
-
412.
Long-form audio-visual generation evaluation framework.
- 413.
-
414.
This is an open source project to improve tumor board discussions for oncologists.
-
415.
A lightweight dashboard rendering add-on / app for Home Assistant
-
416.
BOSS直聘岗位雷达 · AI驱动的智能求职助手 自动搜索AI岗位、一键批量投递、AI接管HR聊天、自动交换微信/手机号/简历。支持DeepSeek/OpenRouter/小米MiMo等多平台模型,Web控制台管理。
- 417.
-
418.
本项目是一个基于 Agentic RAG 架构的智能知识问答系统,采用前后端分离设计。后端基于 FastAPI 构建,前端使用 React + Vite,集成 Qdrant 向量数据库、BM25 关键词检索、MySQL 结构化查询和 Serper 网络搜索四大知识源。系统核心为一条 7 阶段流水线:查询优化、意图识别、任务拆解、ReAct 代理检索、相关性检查、答案生成及 Self-RAG 质量评估,通过闭环纠错机制保障回答质量。支持多用户注册登录,每位用户拥有独立的个人知识库,可上传 PDF、Word、Excel 等多种格式文件并自动向量化。系统还实现了三级压缩对话记忆、用户画像自动提取和 SSE 流式响应,提供实时、准确、可追溯的问答体验。
-
419.
A library-science-inspired personal knowledge management system with LLM agents
- 420.
- 421.
-
422.
Audio Transcript
-
423.
Isalia20/metalBLAS ⭐ 12
GEMMs with metal
-
424.
funasr/ct-punc 🤗 12
**Punctuation Restoration** — automatically add punctuation to ASR output text.
-
425.
This is an experimental merge of BeaverAI/Artemis-31B-v1h-GGUF with zerofata/G4-MeroMero-31B.
-
426.
Step 3.
-
427.
pmttyji on /r/LocalLLaMA
-
428.
Henrie_the_dreamer on /r/LocalLLaMA
-
429.
supracode on /r/LocalLLaMA
-
430.
Scriabinical on /r/StableDiffusion
-
431.
Bulky-Priority6824 on /r/LocalLLaMA
- 432.
-
433.
W01: LlamaIndex + Pydantic — RAG
-
434.
Authorized web application security testing platform built with Django, React, Celery, and Redis.
-
435.
Codex plugin that injects the ultrawork orchestration directive when the user prompt contains 'ultrawork' or 'ulw'.
-
436.
This is an intuitive FFmpeg GUI rewritten from classic Maruko Toolbox. It retains nearly all original core functions, adds practical new features and improved compatibility for vintage video files, delivering simple and user-friendly operation experience.
-
437.
A multi-agent cloud-native security platform for Docker, Kubernetes, IaC, cloud config and runtime eBPF event analysis.
- 438.
-
439.
tars-robotics/RTR ⭐ 11
Learning High-Frequency Continuous Action Chunks in Latent Space (ICML'26)
-
440.
OpenAI-compatible HTTP serving for diffusion language models. Continuous batching + LocalLeap acceleration.
-
441.
Shuo-Zheng/MML-FSAR ⭐ 11
PyTorch implementation of Multi-stage Metric Learning with CLIP-based Adaptation for Few-shot Action Recognition.
- 442.
-
443.
High-performance, differentiable quantum state-vector & tensor network simulator in 100% pure JAX (no classical framework overhead). Accelerated on NVIDIA GPUs and Google Cloud TPU v6e-64/v5e VM clusters up to 40 qubits! Supported by Google's TPU Research Cloud (TRC) program.
-
444.
Your tasks with 99% accuracy using any LLM (Claude, DeepSeek, Codex, Gemini, Hermes, OpenClaw, Cursor).
-
445.
Zero-config glassmorphic media downloader for Windows & Android — powered by yt-dlp + ffmpeg. Download videos, music and playlists from YouTube and 1000+ sites.
-
446.
paper experiment code template
-
447.
A command-line tool that detects the development framework or language used to build any Android APK without needing to manually inspect its contents. It works by decompiling the APK using **apktool** and analyzing the resulting files and folder structure to identify the underlying technology.
-
448.
An Interstellar inspired black hole simulation made in python.
-
449.
iOS Syscall Explorer for IDA 9.X
-
450.
prs-eth/PaGeR ⭐ 11
PaGeR — Unified Panoramic Geometry Estimation via Multi-View Foundation Models
-
451.
Implementation of "CollectionLoRA: Collecting 50 Effects in 1 LoRA via Multi-Teacher On-Policy Distillation"
-
452.
深度云创科技,专注开发 AI 智能体、AI 应用辅助科研、工作、学习!
-
453.
“Steal” any product’s GTM playbook, with the receipts. A Claude Skill for evidence-backed teardown, minus the “it went viral” hand-waving.
-
454.
Claude Code workflow for mapping projects into clean-room behavior docs, file maps, batch reports, verification, and rebuild-ready blueprints.
-
455.
Image-only consulting deck skill for Codex
- 456.
-
457.
The end of web parsing. The beginning of scalable pixel-native search.
- 458.
-
459.
<!-- ### quantize_version: 2 --> <!-- ### output_tensor_quantised: 1 --> <!-- ### convert_type: hf --> <!-- ### vocab_type: --> <!-- ### tags: nicoboss --> <!-- ### quants: Q2_K IQ3_M Q4_K_S...
-
460.
mossy_troll_84 on /r/LocalLLaMA
-
461.
isnaiter on /r/StableDiffusion
-
462.
Select-Cheesecake236 on /r/StableDiffusion
-
463.
SarcasticBaka on /r/LocalLLaMA
-
464.
RandomMan0880 on /r/MachineLearning
-
465.
slandercode/TAAC2026 ⭐ 10
-
466.
Python Student Record Managemen
- 467.
-
468.
USA TV Next - Stremio addon - live TV channels
- 469.
-
470.
powerycy/BossHunter ⭐ 10
Smart job hunting Agent - AI-powered automation from scraping to delivery
-
471.
NBA Machine Learning Tools
-
472.
RamaAditya49/tutur ⭐ 10
Indonesian writing skills for natural, human, register-aware text
-
473.
merlresearch/unic ⭐ 10
UNIC: Learning Unified Multimodal Extrinsic Contact Estimation
- 474.
- 475.
-
476.
) 🚀 A free, open-source tool to generate high-quality AI videos and AI images locally or via free APIs. No subscriptions, no watermarks—just pure creation.
-
477.
Mach-O file structure
-
478.
penglele/boncml ⭐ 10
- 479.
-
480.
Ok_Warning2146 on /r/LocalLLaMA
-
481.
Interesting-Print366 on /r/LocalLLaMA
-
482.
faldore on /r/LocalLLaMA
-
483.
mxforest on /r/LocalLLaMA
-
484.
LiveAccident5312 on /r/MachineLearning
-
485.
techstacknerd on /r/StableDiffusion
-
486.
ArkCoon on /r/StableDiffusion
-
487.
dreameroutloud on /r/MachineLearning
-
488.
Upper_Emphasis2664 on /r/StableDiffusion
-
489.
Potential_Hippo1724 on /r/MachineLearning
-
490.
DCGAN inference on a microcontroller: 12.6M parameters, 512KB SRAM, 26-second generation, pure C [P] 👽 18
Separate-Choice on /r/MachineLearning
-
491.
not-your-typical-cs on /r/MachineLearning
-
492.
Blandmarrow on /r/StableDiffusion
-
493.
Just_Jaguar3701 on /r/MachineLearning
-
494.
traceml-ai on /r/MachineLearning
-
495.
Chinese (Mandarin/Japanese/Korean) + English podcast transcription with speaker diarization, word-level timestamps, hotword injection. Whisper large-v3-turbo + pyannote 3.3.
-
496.
StayingUp4AFeeling on /r/MachineLearning
-
497.
Bobby-Ly on /r/MachineLearning