11 18 9

Yif Yang

Yif29

Yif-Yang

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

upvoted a paper 6 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

authored a paper 9 days ago

Latent Spatial Memory for Video World Models

View all activity

Organizations

authored a paper 3 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 11 days ago • 100

upvoted a paper 6 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 11 days ago • 100

authored a paper 9 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 11 days ago • 67

upvoted a paper 10 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 11 days ago • 67

updated a dataset 22 days ago

microsoft/AVGen-Bench

Viewer • Updated 22 days ago • 3.01k • 4.92k • 4

authored 2 papers 24 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 28 days ago • 236

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Paper • 2605.23899 • Published 28 days ago • 29

upvoted a paper 24 days ago

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Paper • 2605.23899 • Published 28 days ago • 29

commented 2 papers 24 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 28 days ago • 236 •

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 28 days ago • 236 •

upvoted a paper 25 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 28 days ago • 236

authored a paper about 1 month ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Paper • 2605.12501 • Published May 12 • 16

upvoted a paper about 1 month ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Paper • 2605.12501 • Published May 12 • 16

updated a dataset about 2 months ago

microsoft/World-R1

Viewer • Updated Apr 29 • 6.48k • 205 • 8

published a dataset about 2 months ago

microsoft/World-R1

Viewer • Updated Apr 29 • 6.48k • 205 • 8

authored a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 118

upvoted a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 118

updated a Space about 2 months ago

BizGenEval Leaderboard

🥇

Official BizGenEval leaderboard on Hugging Face.

authored a paper about 2 months ago

AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

Paper • 2604.08540 • Published Apr 9 • 5

updated a dataset 2 months ago

microsoft/MM-WebGen-Bench

Viewer • Updated Apr 17 • 120 • 39

Yif Yang

AI & ML interests

Recent Activity

Organizations

Yif29's activity

BizGenEval Leaderboard