As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.
67.6K Pulls 4 Tags Updated 3 days ago
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.
6.3M Pulls 5 Tags Updated 3 months ago
The most powerful vision-language model in the Qwen model family to date.
1.2M Pulls 59 Tags Updated 3 months ago
Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.
299K Pulls 6 Tags Updated 1 month ago
The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.
322.4K Pulls 16 Tags Updated 1 month ago
The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.
281.2K Pulls 10 Tags Updated 1 month ago
Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.
591.5K Pulls 17 Tags Updated 3 months ago
Nemotron 3 Nano - A new Standard for Efficient, Open, and Intelligent Agentic Models
134.8K Pulls 6 Tags Updated 1 month ago
24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
116.1K Pulls 6 Tags Updated 1 month ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
298K Pulls 8 Tags Updated 4 months ago
Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.
64.2K Pulls 10 Tags Updated 1 month ago
FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.
50.7K Pulls 4 Tags Updated 1 month ago
123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
59.4K Pulls 6 Tags Updated 1 month ago
gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss
54.9K Pulls 3 Tags Updated 3 months ago
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
18.2M Pulls 58 Tags Updated 3 months ago
Alibaba's performant long context models for agentic and coding tasks.
2.7M Pulls 10 Tags Updated 4 months ago
An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.
1.2M Pulls 5 Tags Updated 7 months ago
Magistral is a small, efficient reasoning model with 24B parameters.
1M Pulls 5 Tags Updated 7 months ago
Cogito v1 Preview is a family of hybrid reasoning models by Deep Cogito that outperform the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen across most standard benchmarks.
1.3M Pulls 20 Tags Updated 9 months ago
Meta's latest collection of multimodal models.
1.1M Pulls 11 Tags Updated 7 months ago