ZeroGPU Explorers

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

BK-Lee authored a paper 8 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

BK-Lee submitted a paper 8 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

BK-Lee authored a paper 9 days ago

Hide to See: Reasoning-prefix Masking for Visual-anchored Thinking in VLM Distillation

View all activity

innovation64

authored a paper 17 days ago

AURA: Intent-Directed Probing for Implicit-Need Surfacing in Situated LLM Agents

Paper • 2606.05557 • Published 22 days ago • 1

innovation64

submitted a paper to Daily Papers 20 days ago

AURA: Intent-Directed Probing for Implicit-Need Surfacing in Situated LLM Agents

Paper • 2606.05557 • Published 22 days ago • 1

Tonic

posted an update about 1 month ago

Post

2953

🙋🏻‍♂️ Hey there folks ,

Turns out : if we predict 🌏 earth we can save a lot of time looking for interesting things and less time looking at things that we expect to see.

Sentinel-2 imagery 🛰️basically takes a long time to download towards earth. so our "near real time" systems are quite far from that in practical terms.

meanwhile , if we "predict" what we will see , based on what we do see , we can send down much less data in a timely way , and prioritize 📡earth-bound response .

I'm talking about illegal fishing , logging , mining or building in nature reserves , the more of that we predict early the more we're able to stop it on time.

At least that's the concept !

check out the blog : https://huggingface.co/blog/Tonic/save-patagonia-by-predicting-earth

- Collection: https://huggingface.co/collections/NuTonic/earth-observation-with-temporal-and-general-understanding
- Code: https://github.com/Josephrp/Nutonic
- Dataset: NuTonic/sat-vl-sft-training-ready-v1
- Model: NuTonic/lspace
- Training: NuTonic/lspace-trackio
- Evals: NuTonic/Patagonia_Eval

2 replies

blanchon

posted an update about 1 month ago

Post

2727

I'm releasing OpenCS2 a 11TB dataset of around 5000 hours of counter strike gameplay recording.
- HD resolution - 1280×720 · 32 fps
- For each frame keyboard and mouse + world state (player position, velocity, weapon ...)
- HD Stereo audio
- All 10 players perspective

https://huggingface.co/collections/blanchon/opencs2

1 reply

Tonic

posted an update about 2 months ago

Post

4343

🙋🏻‍♂️ Hey there folks,

since everyone liked my previous announcement post ( https://huggingface.co/posts/Tonic/338509028435394 ) so much , i'm back with more high quality proceedural datasets in the Geospacial domain for SFT training !

Check this one out :
NuTonic/sat-bbox-metadata-sft-v1

the goal is to be able to train vision models on multiple images for remote sensing analysis with one shot .

hope you like it ! 🚀

2 replies

Tonic

posted an update 2 months ago

Post

3676

🙋🏻‍♂️ Hey there folks ,

I'm sharing huggingface's largest dataset of annotated statelite images today.

check it out here : NuTonic/sat-image-boundingbox-sft-full

I hope you like it , the idea is to be able to use this with small vision models 🚀

ozayezerceli

authored a paper 2 months ago

RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models

Paper • 2604.19321 • Published Apr 21 • 8

PeterL1n

authored a paper 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 166

mrfakename

in zero-gpu-explorers/README 2 months ago

Why doesn't anyone host llms in zerogpu spaces?

#172 opened 2 months ago by

Reality123b

nroggendorff

in zero-gpu-explorers/README 2 months ago

Why doesn't anyone host llms in zerogpu spaces?

#172 opened 2 months ago by

Reality123b

awinml

authored a paper 2 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 49

PeterL1n

authored a paper 2 months ago

Continuous Adversarial Flow Models

Paper • 2604.11521 • Published Apr 13 • 11

PeterL1n

submitted a paper to Daily Papers 2 months ago

Continuous Adversarial Flow Models

Paper • 2604.11521 • Published Apr 13 • 11

MaziyarPanahi

posted an update 3 months ago

Post

3884

Training mRNA Language Models Across 25 Species for $165

We built an end-to-end protein AI pipeline covering structure prediction, sequence design, and codon optimization. After comparing multiple transformer architectures for codon-level language modeling, CodonRoBERTa-large-v2 emerged as the clear winner with a perplexity of 4.10 and a Spearman CAI correlation of 0.40, significantly outperforming ModernBERT. We then scaled to 25 species, trained 4 production models in 55 GPU-hours, and built a species-conditioned system that no other open-source project offers. Complete results, architectural decisions, and runnable code below.

https://huggingface.co/blog/OpenMed/training-mrna-models-25-species

szymanowiczs

submitted a paper to Daily Papers 3 months ago

LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis

Paper • 2603.20176 • Published Mar 20 • 11

MaziyarPanahi

posted an update 3 months ago

Post

2367

We annotated 119K medical images with two frontier VLMs (Qwen 3.5, Kimi K2.5), cross-validated at 93% agreement, and produced 110K training records, all for under $500. Fine-tuning 3 small models (2-3B params) improved all benchmarks: best model reaches +15.0% average exact match.

Everything is open-sourced: datasets, adapters, and code.

https://huggingface.co/blog/OpenMed/synthvision

2 replies

szymanowiczs

authored a paper 3 months ago

LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis

Paper • 2603.20176 • Published Mar 20 • 11

PereLluis13

authored a paper 3 months ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 23

MaziyarPanahi

posted an update 4 months ago

Post

4940

DNA, mRNA, proteins, AI. I spent the last year going deep into computational biology as an ML engineer. This is Part I of what I found. 🧬

In 2024, AlphaFold won the Nobel Prize in Chemistry.

By 2026, the open-source community had built alternatives that outperform it.

That's the story I find most interesting about protein AI right now. Not just the science (which is incredible), but the speed at which open-source caught up. Multiple teams, independently, reproduced and then exceeded AlphaFold 3's accuracy with permissive licenses. The field went from prediction to generation: we're not just modeling known proteins anymore, we're designing new ones.

I spent months mapping this landscape for ML engineers. What the architectures actually are (spoiler: transformers and diffusion models), which tools to use for what, and which ones you can actually ship commercially.

New post on the Hugging Face blog: https://huggingface.co/blog/MaziyarPanahi/protein-ai-landscape

Hope you all enjoy! 🤗

2 replies

Tonic

posted an update 4 months ago

Post

3795

🤔 Who would win ?

- a fully subsidized ai lab
OR
- 3 random students named

kurakurai ?

demo : Tonic/fr-on-device

if you like it give the demo a little star and send a shoutout to : @MaxLSB @jddqd and @GAD-cell for absolutely obliterating the pareto frontier of the french language understanding .

4 replies

AI & ML interests

Recent Activity

Team members 748

zero-gpu-explorers's activity

Why doesn't anyone host llms in zerogpu spaces?

Why doesn't anyone host llms in zerogpu spaces?