Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
👋
Open to Work
612.7
TFLOPS
Banaxi Inc.
PRO
Banaxi-Tech
22
12
Follow
LayerDynamics's profile picture
SiruiZhang's profile picture
jsob7's profile picture
41 followers
·
14 following
Banaxi-Tech
AI & ML interests
SLMs, training from scratch, LoRA, TTS, Ternary models
Recent Activity
replied
to
their
post
about 7 hours ago
A new model is coming! Its going to take a long time on my 5070 Ti so expect a release in ~1 month. We think this model is going to be SOTA For its size. Our Mini Version will be 25M Parameters and Pro with 140M. The Pro version has a 3072 Context Window (Extensible to up to 6K with RoPE) And the Mini version has a context window of 4096 (Up to 8K with RoPE) Meanwhile we are currently working on a Instruct Version of our BananaMind 1.5 Base. The training will start this weekend We are very exited to release it when its done!
updated
a model
about 13 hours ago
BananaMind/BananaMind-KV1-8M-2Bit-Experimental
new
activity
about 15 hours ago
AxiomicLabs/GPT-X2-125M:
Code
View all activity
Organizations
Banaxi-Tech
's datasets
2
Sort: Recently updated
Banaxi-Tech/HumanNumberEval
Viewer
•
Updated
9 days ago
•
6.6k
•
110
Banaxi-Tech/Deepseek-V4-Reasoning-Code-2500
Viewer
•
Updated
May 24
•
2.56k
•
64