X-Message

merve

#120

RT @googlegemma: Building super fast experiences with Gemma just got easier. Gemma 4 MTP is now officially merged into llama.cpp. Develope…

中文: RT @googlegemma:与Gemma合作打造超快体验变得更加简单。 Gemma 4 MTP 现已正式合并为 lamama.cpp。开发......

2026-06-08 20:25:41

merve

#119

RT @ben_burtenshaw: So excited to be opening up OpenEnv to the whole community. It will now be owned by @huggingface , Meta-PyTorch, @refle…

中文: RT @ben_burtenshaw:非常期待向整个社区开放OpenEnv。现在将由 @huggingface、Meta-PyTorch、@refle 拥有......

2026-06-08 16:33:24

merve

#118

RT @osanseviero: Gemma 4 MTP just got officially merged into llama.cpp This means you can use Gemma 4 QAT + MTP for a lightweight + super…

中文: RT @osanseviero:Gemma 4 MTP 刚刚正式合并至 lamama.cpp 这意味着您可以使用 Gemma 4 QAT + MTP 进行轻量级 + 超级...

2026-06-07 17:44:22

merve

#117

RT @julien_c: Your monthly reminder that HF is much cheaper at scale, for both storage and egress (especially if you use several cloud prov…

中文: RT @julien_c:您的月度提醒，即HF在规模上价格便宜得多，无论是存储还是进步(尤其是如果你使用多个云证明......)

2026-06-06 22:48:11

merve

#116

this is how @giffmana signs his emails https://twitter.com/mervenoyann/status/2063262072369041770/photo/1

中文: @giffmana 就是这样签署电子邮件的

2026-06-06 14:09:46

merve

#115

RT @victormustar: Before the week ends, let's acknowledge one of the most INSANE week ever for open AI, with 25+ notable open-weight drops…

中文: RT @victormustar:在本周结束前，让我们确认开放式人工智能领域有史以来最疯狂的一周之一，其中开放重量下降了25个以上......

2026-06-06 09:17:38

merve

#114

RT @googlegemma: We just dropped Gemma 4 Quantization-Aware Training (QAT) checkpoints on Hugging Face! All Gemma 4 model sizes and their…

中文: RT @googlegemma:我们刚刚在拥抱面上投放了Gemma 4量化感知训练(QAT)检查点! 所有 Gemma 4 型号及其...

2026-06-05 23:28:03

merve

#113

I'm working on a bit of a something, here's a spoiler I learned so much from the process about VLM labelling & judging currently adding instance segmentation and adding more infra options https://twitter.com/mervenoyann/status/2062918401845026928/photo/1

中文: 我正在做点事情，这里有个剧透我从关于VLM标签和示例的流程中学到了很多;评判目前正在添加实例细分并添加更多基础设施选项

2026-06-05 15:24:08

merve

#112

RT @liquidai: Introducing LFM2.5-VL-1.6B-Extract and LFM2.5-VL-450M-Extract: Vision-language models that return structured JSON, not free-f…

中文: RT @ liquidai:引入 LFM2.5-VL-1.6B 提取和 LFM2.5-VL-450M 提取:用于返回结构化 JSON 的视觉语言模型，而不是 free-f...

2026-06-05 13:18:27

merve

#111

RT @nathanhabib1011: World models feel like the future... almost... We can still see some weird artifacts. That means we need high-quality…

中文: RT @nathanhabib1011:世界模特们感觉像是未来......几乎......我们仍然能看到一些奇怪的文物。这意味着我们需要高质量......

2026-06-05 10:35:49

merve

#110

RT @googlegemma: Introducing Magenta RealTime 2, a new open model musicians can play as an instrument! Run low-latency, live music synthes…

中文: RT @googlegemma:推出Magenta RealTime 2，一款全新的开放式音乐模特，可作为乐器演奏! 低延迟的现场音乐合成器......

2026-06-04 23:26:20

merve

#109

RT @PiotrZelasko: Second big release from us today: Nemotron-3.5-ASR-Streaming! 🌎40 languages ⚡️80ms - 1s controllable latency 🔥240 - 2400…

中文: RT @PiotrZelasko:今日我们发布的第二大版本:Nemotron-3.5-ASR-Streaming! 🌎40 种语言 ⚡️80ms - 1s 可控延迟 🔥240 - 2400...

2026-06-04 18:21:20

merve

#108

RT @julien_c: Today I'm launching a new project called SynthTraces 🔥 It is a minimal codebase to generate synthetic coding agent session t…

中文: RT @julien_c:今天我将启动一个名为 SynthTraces 的新项目 🔥 它是一个用于生成合成编码代理会话的最小代码库。

2026-06-04 15:02:30

merve

#107

NVIDIA Nemotron Ultra is here 😍 > 55B/550B a hybrid MoE 🦖 with 1M context window > supports MTP speculative decoding 💨 > day-0 supported in transformers sits in the most attractive quadrant per performance/efficiency in AA Index 🔥 https://twitter.com/mervenoyann/status/2062526071203938703/photo/1

中文: NVIDIA Nemotron Ultra 已到来 😍 采用55B/550B混合式MoE【EE1】，带1万个窗口支持MTP投机解码💨 支持“天”式变压器在AA指数🔥中，每分表现/效率最具吸引力的象限中

2026-06-04 13:25:09

merve

#106

RT @LeRobotHF: Train AI robots without writing a single line of code. 🤖 We just launched LeLab, the official graphical user interface for…

中文: RT @LeRobotHF:无需编写一行代码即可训练AI机器人。🤖 我们刚刚推出了LeLab，这是用于...的官方图形用户界面

2026-06-04 12:39:25

merve

#105

just replaced my Qwen3.6 35B 8-bit quant with Gemma 12B bf16 for local coding & Hermes, will report my findings 🙌🏻

中文: 刚刚用Gemma 12B bf16替换了我的Qwen3.6 35B 8位量子点，用于本地编码和编程;Hermes将报告我的发现🙌🏻

2026-06-04 10:56:11

merve

#104

RT @drfeifei: https://x.com/i/article/2062244283940544512

2026-06-03 22:16:36

merve

#103

RT @rasbt: It's been a while! 4 nice additions to the open-weight local-LLM-on-consumer-hardware ecosystem: https://twitter.com/rasbt/status/2062235700636873082/photo/1

中文: RT @rasbt:已经有一段时间了!开源本地-LLM-on-Consumer-硬件生态系统的4个不错附加功能:

2026-06-03 18:48:03

merve

#102

Google dropped Gemma-4 12B, it's a beast 🔥 > unified: audio + image go straight into model, no encoder > multimodal + tool calling > dense 12B with 256K context, comes with assistants for MTP (faster!⚡️) > day-0 in transformers, llama.cpp & MLX > A2.0 🤗 https://twitter.com/mervenoyann/status/2062214149476683900/photo/1

中文: 谷歌放弃了Gemma-4 12B，这真是个大不前的选择统一:音频+图像直接进入模型，无编码器加:多式联运+工具调用高密度12B，带256K上下文，配备适用于MTP的助手(更快!EE1) > 日间:变压器、lamama.cpp & MLX 网址:A2.0 🤗

2026-06-03 16:45:41

merve

#101

RT @victormustar: Reminder: every Hugging Face Space is an API your agents can call :) I asked mine to build a website about the flowers o…

中文: RT @victormustar:提醒:每个拥抱的人脸空间都是你的代理可以调用的API :) 我让我建立了一个关于花朵的网站......

2026-06-02 18:02:38

merve

#100

RT @hcompany_ai: Computer-use agents are moving from the cloud to your local machine. Fast. When we launched Holo3 two months ago, the pro…

中文: RT @hcompany_ai:计算机使用代理正在从云端迁移到本地机器。快。两个月前我们推出Holo3时，这位专业人士......

2026-06-02 18:02:17

merve

#99

your AI agent thinks you're lame and I'll prove it upload your agent traces (CC/Codex/Pi/Claw) to @huggingface and let this app roast you here's what boss' agent thinks of him @julien_c share yours below https://twitter.com/mervenoyann/status/2061756306281611607/photo/1

中文: 你的人工智能代理认为你很蹩脚，我来证明将您的代理线索(CC/Codex/Pi/Claw)上传至@huggingface，让这款应用为您烘焙老板的经纪人对他的看法是:@julien_c 在下方分享您的内容:

2026-06-02 10:26:23

merve

#98

everyone's building simple agents meanwhile IBM is building robust enterprise agents in production, and it's open-source they just dropped a blog on HF breaking down how to go beyond LLMs & agents: structured reasoning, tool use, and more to scale AI across enterprise https://twitter.com/mervenoyann/status/2061450307523985469/photo/1

中文: 每个人的建筑都是简单的代理与此同时，IBM正在生产中构建强大的企业代理，并且开源他们刚刚删除了一篇关于HF的博客，将其分解如何超越LLMs&代理:结构化推理、工具使用等，以在企业范围内扩展人工智能

2026-06-01 14:10:27

merve

#97

RT @ctnzr: Nemotron 3 Ultra: Frontier smart. 5X faster. 30% cheaper. 💚💚💚 https://twitter.com/ctnzr/status/2061308138838729121/photo/1

2026-06-01 10:24:31

merve

#96

NVIDIA just dropped Cosmos 3 at GTC 🔥 closest thing to AGI as world model > it can reason, understand AND generate videos, images, actions, text > sota, comes in 16B, 65B, with datasets > diffusers support 🧨 > open license 🤗

中文: 英伟达刚刚在GTC上退出了Cosmos 3 🔥 最接近AGI作为世界模型它能够推理、理解并生成视频、图像、操作和文字数据集(Sota)提供16B、65B、数据集支持扩散器 🧨 开放许可证 🤗

2026-06-01 07:12:26

merve

#95

this is super cool

中文: 这太酷

2026-05-30 18:53:35

merve

#94

RT @NVIDIAAI: This #CVPR2026 paper from our research team is trending #1 on @HuggingFace 🤗 Meet LocateAnything: a vision-language detectio…

中文: RT @NVIDIAAI:我们研究团队的这篇#CVPR2026论文在@HuggingFace上排名第一🤗 认识查找一切:视觉语言检测...

2026-05-28 22:21:46

merve

#93

RT @fangfu0830: 🔥 We release Gamma-World from @nvidia — a generative multi-agent world model that finally goes beyond 2 players. ⚡ 24 FPS r…

中文: RT @fangfu0830:🔥 我们从 @nvidia 发布 Gamma-World——一个生成式多代理世界模型，最终超越了两名玩家。 ⚡ 24 FPS r...

2026-05-28 17:11:06

merve

#92

RT @julien_c: We are starting to be quite bullish about getting in the data infrastructure business. I just cloned 68 TB (while I only hav…

中文: RT @julien_c:我们开始对进入数据基础设施业务持相当乐观的态度。我刚克隆了68TB(而我只有......

2026-05-28 16:49:48

merve

#91

RT @liquidai: Today, we're releasing LFM2.5-8B-A1B, a device-optimized model designed to power real-life applications on phones, laptops, P…

中文: RT @ liquidai:今天，我们将推出LFM2.5-8B-A1B，这是一款专为手机、笔记本电脑和P...上实际应用供电而设计的设备优化型型。

2026-05-28 16:08:05

merve

#90

RT @skalskip92: RF-DETR is now available in @huggingface transformers state of the art in both detection and segmentation, outperforming Y…

中文: RT @skalskip92:RF-DETR 现已在 @huggingface 变压器中提供在检测和细分领域都处于技术状态，表现优于

2026-05-27 16:36:20

merve

#89

RF-DETR just landed to @huggingface transformers 🥵🔥 sota real-time detection & segmentation models by @roboflow 💜 > play with our real-time demo > fine-tune the models on your use case with our tutorials (takes a toaster's VRAM) > or just hand them to your agents 😄 https://twitter.com/mervenoyann/status/2059647988373373253/video/1

中文: RF-DETR 刚刚登陆至 @huggingface 变压器 🥵🔥 实时检测与实时检测;由 @roboflow 进行细分模型 💜 玩我们的实时演示使用我们的教程(请使用烤面包机的VRAM)，为您的使用用箱中的型号进行精细调整或将它们交给您的代理公司 😄

2026-05-27 14:48:41

merve

#88

RT @victormustar: cool new release: a tiny open video VLM that understands what happens in videos and when 👀 Marlin-2B (Apache 2.0!) can c…

中文: RT @victormustar:酷炫的新发布:一段微小的开放视频，可了解视频中的情况以及👀 马林-2B(Apache 2.0!)可以......

2026-05-27 08:41:50

merve

#87

RT @victormustar: Made a free Pixal3D demo (Tencent's new image-to-3D model) because I like it a lot 🔥 What's interesting: pixel-aligned g…

中文: RT @victormustar:免费制作一个Pixal3D演示版(腾讯全新图像到3D模式)，因为我非常喜欢它🔥 有趣的是:像素对齐的 g...

2026-05-22 11:13:09

merve

#86

Cohere dropped Command A+ 🔥 > 25B/219B MoE vision language model > supports 48 languages with efficient tokenizer > tool-calling/agentic + 128k context window > transformers day-0 support 🤗 free license 💗 https://twitter.com/mervenoyann/status/2057128432190787643/photo/1

中文: 科赫雷放弃了命令A+ 🔥 25B/219B 视觉语言模型支持48种语言，支持高效的令牌化工具调用/代理 + 1.28k 上下文窗口支持:>变压器日间支持 🤗 免费许可证 💗

2026-05-20 15:56:52

merve

#85

RT @victormustar: it's open source time, with a real leap for world models 🎉 NVIDIA's SANA-WM: a camera-conditioned world model that fits…

中文: RT @victormustar:现在是开源时间，世界模特们确实实现了飞跃🎉 NVIDIA 的 SANA-WM:一款符合相机条件的世界型号，适合......

2026-05-20 10:21:00

merve

#84

RT @victormustar: llama.cpp with MTP support makes local models fast enough to use as daily drivers 🚀 Qwen3.6-27B dense generation (on A10…

中文: RT @victormustar:支持MTP的 llama.cpp 使本地车型速度足够快，能够作为日常驾驶者使用 🚀 Qwen3.6-27B 密度生成(在 A10 上)

2026-05-18 19:33:54

merve

#83

finally faster Qwen3.6 models with MTP support ⚡️ brb updating my Pi & Hermes setup 🤝

中文: 支持MTP的Qwen3.6型号速度更快⚡️ 更新我的Pi & Hermes 设置 🤝

2026-05-18 18:59:31

merve

#82

RT @ben_burtenshaw: if you're doing RL on agent use cases, check out this video. agents might seem like the most obvious application of p…

中文: RT @ben_burtenshaw:如果你在代理使用情况下使用RL，请观看此视频。代理可能看起来像是p...最明显的应用

2026-05-18 14:45:33

merve

#81

RT @MaziyarPanahi: Arabic. Japanese. Turkish. Redacting clinical discharge summaries in real-time. 30+ new open-source PII models shipped…

中文: RT @MaziyarPanahi:阿拉伯语。日语。土耳其语。实时编辑临床出院摘要。 30 个以上新的开源 PII 型号已发布...

2026-05-18 14:08:46

merve

#80

reason why self-improving personal agents (Claw, Hermes) are hyped is due to how people actually like the idea but they just never try it, so all they do is to yap because it gets engagements quite sad. on the contrast I really find them useful

中文: 自我改进的个人代理人(克劳，爱马仕)之所以被大肆宣传，是因为人们其实对这个想法很喜欢，但他们从不尝试，所以他们所做的只是努力，因为会进行互动非常难过。从对比上来说，我确实觉得它们很有用

2026-05-17 20:10:10

merve

#79

TIL Hermes Agent has optional skills for.. *checks notes* tokenizers and accelerate? 😄 joke aside it also has a peft and trl which can really be useful https://twitter.com/mervenoyann/status/2056102443830677874/photo/1

中文: TIL Hermes Agent 具备可选技能，可选择 . . * 勾选笔记* 标记器并加速?😄 开玩笑说，它还具有一种 peft 和 trl 功能，这确实很有用

2026-05-17 19:59:57

merve

#78

I finally got the tattoo @huggingface https://twitter.com/mervenoyann/status/2055302977158607211/photo/1

中文: 我终于得到了纹身 @huggingface

2026-05-15 15:03:10

merve

#77

RT @onusoz: People were asking at @clawcon singapore how to setup eg. gemma with OpenClaw, and I realize for some time that there is no eas…

中文: RT @onusoz:人们在@clawcon singapore 询问如何使用 OpenClaw 来设置 eg. gemma，我已意识到一段时间以下没有这个问题......

2026-05-15 07:47:26

merve

#76

this week @huggingface crossed 1M datasets 🚀 every open model you love was built on top of them next objective: more open coding session traces on Hub to push coding models even further 🤝 help push the open frontier by uploading your traces! https://twitter.com/mervenoyann/status/2054891459053039938/photo/1

中文: 本周，@huggingface 跨越了 1M 数据集 🚀 你所热爱的每一个开放模型都建立在它们之上下一个目标:在Hub上进行更多开放编码会话，以进一步推动编码模型 🤝 上传你的痕迹，帮助突破开阔的前沿!

2026-05-14 11:47:56

merve

#75

RT @aiDotEngineer: Your Agent Can Now Train Models The argument from @mervenoyann: open source models have caught up. GLM 5.1 is leading t…

中文: RT @aiDotEngineer:您的代理现在可以训练模型 @mervenoyann:开源模型的争论已经追上来。GLM 5.1 处于领先地位......

2026-05-13 18:06:47

merve

#74

RT @_lewtun: You can now have an AI researcher running on your laptop 24/7 for free! Running Qwen3-35B-A3B with llama.cpp and a 4-bit qua…

中文: RT @_lewtun:现在你可以免费让一台人工智能研究人员在笔记本电脑上免费运行! 使用 lama.cpp 和 4 位 qua 运行 Qwen3-35B-A3B ...

2026-05-13 11:29:19

merve

#73

RT @JulienBlanchon: I'm releasing OpenCS2 a 11TB dataset of around 5000 hours of counter strike gameplay recording. - HD resolution - 1280…

中文: RT @JulienBlanchon:我将发布 OpenCS2 一个包含约 5000 小时反击游戏记录的 11TB 数据集。 - 高清分辨率 - 1280...

2026-05-13 11:28:18

merve

#72

look mom I'm on my favorite YT channel this evening @aiDotEngineer 💗 I talked about how @huggingface meets your agent: you can ask your agent to do all ML workflows from training models to label data now https://twitter.com/mervenoyann/status/2054496147914252394/photo/1

中文: 看妈妈，今晚我在我最喜欢的YT频道上 @aiDotEngineer 💗 我谈到了@huggingface 如何与你的经纪人见面:你可以要求你的代理完成从训练模型到标注数据的所有机器学习工作流程，现在

2026-05-13 09:37:07

merve

#71

RT @sergeynazarovx: We used to go to a special website, ask strangers for help with programming, and get humiliated in return https://t.co/…

中文: RT @sergeynazarovx:我们过去常常访问一个特别网站，向陌生人求助编程，并会遭受羞辱，

2026-05-12 20:13:16

merve

#70

RT @huggingface: We've just hit 1M open datasets on the Hugging Face Hub 🎉 Open models need open data. Today we hit that milestone, togeth…

中文: RT @huggingface:我们刚刚在“拥抱”面部中心(EE0)上点击了100万个开放数据集开放模型需要开放数据。今天我们达到了那个里程碑，图集......

2026-05-12 15:35:51

merve

#69

Meta silently dropped Sapiens2 last week 🔥 a family of high-res models trained on 1B human images > for pose estimation, body-part segmentation, surface normals, pointmaps (sota) > 6 sizes: 0.1B → 5B params (all ViT patch 16) > high-res: 1024×768 and 4K https://twitter.com/mervenoyann/status/2054187884417102319/video/1

中文: Meta上周悄然放弃了Sapiens2 🔥 一个基于1B张人类图像训练的高分辨率模型家族 >用于姿势估计、身体与身体分割、表面正常、点图(sota) 6 种尺寸:0.1B → 5B 参数(所有 ViT 补丁 16) 高空:1024×768 和 4K

2026-05-12 13:12:11

merve

#68

this project uses entire @huggingface infra to build agentic medical intelligence 🔥 signup for preview ⤵️

中文: 该项目使用整个 @huggingface infra 来构建特化医疗智能 🔥 注册预览 ⤵️

2026-05-12 12:03:01

merve

#67

MiniCPM-V-4.6 is in 🔥 > 1B (SigLIP2-400M + Qwen3.5-0.8B) > beats Qwen3.5-0.8B on AA with 19x fewer tokens > beats other larger small VLMs > deploy to iOS + Android with GGUF and other quants https://twitter.com/mervenoyann/status/2053912404774248895/photo/1

中文: miniCPM-V-4.6 已登录 🔥 1B(SigLIP2-400M + Qwen3.5-0.8B) 在AA上以19倍的代币数量击败Qwen3.5-0.8B 比其他较大的小型VLM更胜一负与GGUF及其他量子点一起部署到iOS+安卓系统

2026-05-11 18:57:31

merve

#66

RT @victormustar: This feature is quite cool to run Hermes Agent locally because: - You can filter on the +60k models compatible with Herme…

中文: RT @victormustar:此功能在本地运行 Hermes Agent 非常酷，因为: - 您可以筛选与 Herme 兼容的 +60k 型号...

2026-05-11 15:41:27

merve

#65

🆕 Hugging Face 🤝 Hermes Agent 🔥 > we added Hermes Agent to local apps: run it locally with any compatible GGUF/MLX model > shipped native traces support for Hermes Agent: visualize your Hermes traces directly on the Hub Very soon most agents will run locally and we want to… https://twitter.com/mervenoyann/status/2053857347429151163/photo/1

中文: 🆕 拥抱面容 🤝 爱马仕探员 🔥 我们已将 Hermes Agent 添加到本地应用程序中:使用任何兼容的 GGUF/MLX 型号本地运行已发货的Hermes Agent原生痕迹支持:直接在Hub上直观显示您的Hermes痕迹很快大多数代理人员将在当地运营，我们希望......

2026-05-11 15:18:45

merve

#64

RT @onusoz: I have a new job! Excited to announce that I will be working with Hugging Face to make local models work great in OpenClaw and…

中文: RT @onusoz:我有一份新工作! 很高兴宣布，我将与Hugging Face合作，让本地模特在OpenClaw和...上大有作为

2026-05-11 13:35:42

merve

#63

RT @victormustar: Exciting: local ML is (finally) going mainstream 🔥 - new GGUF uploads on HF nearly doubled in 2 months - smaller models…

中文: RT @victormustar:令人兴奋:本地机器学习(终于)成为主流了 🔥 - HF 上新增 GGUF 上传量在两个月内几乎翻倍 - 更小的模型......

2026-05-11 11:15:10

merve

#62

RT @nathanhabib1011: 🦞 Claw-Eval 🦞 🥇 @XiaomiMiMo's MiMo-V2.5-Pro at 1T 🥈 @Zai_org GLM5.1 at 754B 🥉 @XiaomiMiMo MiMo-V2.5 at 310B Congrats…

中文: RT @nathanhabib1011:🦞 爪子-埃瓦尔 🦞 🥇 @XiaomiMiMo 的 MiMo-V2.5-Pro 1T 🥈 @Zai_org GLM5.1 at 754B 🥉 @XiaomiMiMo MiMo-V2.5，310B 恭喜......

2026-05-11 11:01:02

merve

#61

Istanbul Open-source AI meet-up @huggingface was 🔥 we had many stories from building in-house models cutting costs to agentic apps 🙌🏼 many thanks @trendyoltech @nsrt_py @anil_ozturkk for hosting us 🤗 https://twitter.com/mervenoyann/status/2053447237364002877/photo/1

中文: 伊斯坦布尔开源人工智能会议 @huggingface 是 🔥 我们从打造内部模型，向代理应用程序降低成本，经历了许多故事🙌🏼 非常感谢@trendyoltech @nsrt_py @anil_ozturkk 为我们提供的接待 🤗

2026-05-10 12:09:07

merve

#60

RT @ben_burtenshaw: PSA: If you put your blood, sweat, and tears into a custom model for your use case on OpenAI, make sure you get the wei…

中文: RT @ben_burtenshaw:PSA:如果你在 OpenAI 上为使用案例定制了血、汗和泪，请务必使用 wei...

2026-05-10 11:45:47

merve

#59

read our response here https://x.com/mervenoyann/status/2052752660537676198?s=46 I wish journalists can do better in the future wrt avoiding baiting

中文: 请在此处阅读我们的回复: 我希望记者们未来能做得更好，以免被诱骗

2026-05-08 14:09:49

merve

#58

RT @AnthropicAI: New Anthropic research: Natural Language Autoencoders. Models like Claude talk in words but think in numbers. The numbers…

中文: RT @AnthropicAI:新人类研究:自然语言自动编码器。像克劳德这样的模特会用言语说话，但会用数字来思考。数字......

2026-05-08 12:52:26

merve

#57

people don't even read articles these days and jump in to conclusions 👀

中文: 如今人们甚至不会阅读文章，而是贸然得出结论👀

2026-05-08 12:41:23

merve

#56

RT @adithya_s_k: Excited to release the Ultimate guide to RL environments! Definitions of RL environments differ wildly in the LLM era, so…

中文: RT @adithya_s_k:很高兴发布RL环境终极指南! 在LLM时代，RL环境的定义差异很大，因此......

2026-05-07 17:07:35

merve

#55

RT @Tu7uruu: Big announcement for speech AI Benchmarks get gamed. So we added a repellent. The Open ASR Leaderboard now includes private…

中文: RT @Tu7ruu:语音AI发布重要公告基准会被游戏。于是我们加了个驱人者。开放的ASR排行榜现在包含私有...

2026-05-06 11:35:45

merve

#54

RT @pcuenq: transformers v5.8.0 is here, and it's a biggie 🚀 Three massive model additions: 🐳 DeepSeek-V4: next-gen efficient MoE 🪨 Granit…

中文: RT @pcuenq:变压器 v5.8.0 到来，这很大🚀 三个庞大的模型新增功能: 🐳 DeepSeek-V4:下一代高效 MOE 🪨 格拉尼特......

2026-05-06 11:11:53

merve

#53

Gemma 4 just got a massive speed-up with MTP drafters ⚡️ > speculative decoding (up to 3x tokens/sec improvement compared to normal Gemma-4 🔥) > identical reasoning, just faster > day-0 support in transformers, MLX, vLLM > A2.0 licensed 🤗 https://twitter.com/mervenoyann/status/2051702372339003841/photo/1

中文: 杰玛4队刚刚凭借MTP选秀者迅速加速上场⚡ >推测性解码(与正常的Gemma-4相比，可提高3倍的代币/秒量🔥) 相同的推理，速度更快支持 > 用于变压器、MLX、vLLM 的 Day-0 获得A2.0授权🤗

2026-05-05 16:35:39

merve

#52

RT @osanseviero: Excited to introduce Gemma 4 Multi-Token Prediction Drafters⚡️Accelerated inference right in your pockets - Up to a 3x sp…

中文: RT @osanseviero:很高兴能在口袋里直接引入Gemma 4多代币预测绘图员⚡️ - 最高可达3倍 sp...

2026-05-05 16:21:14

merve

#51

RT @ben_burtenshaw: Introducing the context course: a free course on doing ML with agent context. You will learn how to train models, opti…

中文: RT @ben_burtenshaw:介绍上下文课程:一门关于使用代理语境进行机器学习的免费课程。您将学习如何训练模型，选择......

2026-05-05 14:48:13

merve

#50

I forked openclaw to build a small local rescue agent that debugs whenever the agent is down in just half an hour my fork was behind 15 commits insane pace

中文: 我把openclaw分叉来构建一个小型本地救援代理，每当代理人员倒下时都会进行调试在短短半小时内，我的分叉就落后于15个提交疯狂的步伐

2026-05-05 10:21:12

merve

#49

RT @victormustar: honestly Granite 4.1 8b might be the best model to run at this size https://huggingface.co/blog/ibm-granite/granite-4-1

中文: RT @victormustar:老实说，Granite 4.1 8b 可能是这个尺寸的最佳型号

2026-05-04 18:42:30

merve

#48

gpt-5.5-extra-high-as-a-kite

中文: gpt 5.5-ext-high-a-kite

2026-05-04 15:09:54

merve

#47

RT @0xSero: Weekly best models for your hardware: ~~ 8 to 16gb ~~ Granite models are amazing: [NEW] - https://huggingface.co/ibm-granite/granite-4.1-8b Gemma-E4B…

中文: RT @0xSero:每周最适合您硬件的型号: ~8到16克～～花岗岩模型令人惊叹:[新] - 杰玛-E4B...

2026-05-04 10:52:00

merve

#46

İstanbul'da buluşalım, konuşmacı ya da katılımcı olmak isterseniz başvurular aşağıda 🙌🏼

2026-05-03 10:58:50

merve

#45

RT @nsrt_py: Herkese selamlar! Hugging Face ile ilk etkinliğimiz olan "Open-source AI Meet-up with Hugging Face" 9 Mayıs saat 13:00'da Tren…

2026-04-30 12:24:00

merve

#44

RT @Tu7uruu: IBM just dropped TWO new open ASR models with very strong performance! > ~5.3 WER on Open ASR leaderboard (strong accuracy fo…

中文: RT @Tu7ruu:IBM刚刚推出了两款新的开放式ASR机型，性能非常出色! 在Open ASR排行榜上表现出色(准确性极强)

2026-04-30 09:16:39

merve

#43

RT @alvarobartt: IBM Granite just released two multilingual embedding models with 97M and 311M parameters 🤏🏻 ModernBERT-based, 200+ langua…

中文: RT @alvarobartt:IBM Granite 刚刚发布了两个多语言嵌入模型，参数为 97M 和 311M [EE] 总部位于现代伯特，200岁以上...

2026-04-30 05:21:14

merve

#42

nvidia cooked 😍 nemotron-3-nano-omni all modalities LLM 🔥 • 30B-A3B MoE hybrid Mamba-Transformer • 9x throughput vs other open omni models • native audio (1200s), long video, 100+ page docs in single turn • agentic CUA built in • BF16 / FP8 / NVFP4 https://twitter.com/mervenoyann/status/2049216818150015255/photo/1

中文: nvidia 已烹饪😍 nemotron-3-nano-omni 所有模式 LLM 🔥 • 30B-A3B MoE 混合型 Mamba-Transformer • 与其他开放式全系统模型相比，吞吐量为9倍 • 原生音频(1200年代)，长视频，100多页单折 • 内置的代理 CUA • BF16 / FP8 / NVFP4

2026-04-28 19:58:56

merve

#41

nvidia cooked 😍 nemotron-3-nano-omni all modalities LLM 🔥 • 30B-A3B MoE hybrid Mamba-Transformer • 9x throughput vs other open omni models • native audio (1200s), long video, 100+ page docs in single turn • agentic CUA built in • BF16 / FP8 / NVFP4

中文: nvidia 已烹煮 😍 - nemotron-3-nano-omni 所有模式 LLM 🔥 • 30B-A3B MoE 混合型 Mamba-Transformer • 与其他开放式全系统模型相比，吞吐量为9倍 • 原生音频(1200年代)，长视频，100多页单折 • 内置的代理 CUA • BF16 / FP8 / NVFP4

2026-04-28 19:54:11

merve

#40

any-to-any model based on Nemotron 3 Nano 🔥

中文: 基于 Nemotron 3 Nano 🔥 的任何型号

2026-04-28 17:58:07

merve

#39

we compile all the best benchmarks and model results so agents can find the best model for fine-tuning and inference for your hardware budget turns out it was a "skill" issue

中文: 我们编制所有最佳基准和模型结果以便代理为您的硬件预算找到最佳的微调和推理模型事实证明这是一个“技能”问题

2026-04-28 15:46:46

merve

#38

RT @ben_burtenshaw: Humanity's Last Hackathon is NOW OPEN for registration. This is not a normal hackathon. You will be judged on the cont…

中文: RT @ben_burtenshaw:Humanity's Last Hackathon 现已开放注册。这并非一场普通的黑客马拉松。你将在比赛中受到评判......

2026-04-28 14:43:58

merve

#37

RT @LysandreJik: I've been trying to make transformers more agent-friendly: agentic CLI, a skill, doc rewrites, canonical examples. It fe…

中文: RT @LysandreJik:我一直试图让变压器更环保:一种技术技能，文档重写，以规范为例。这......

2026-04-28 13:29:37

merve

#36

beauty of open-sourcing powerful models & datasets 😍

中文: 开源强大模型的美观;数据集 😍

2026-04-28 10:17:58

merve

#35

learn how to use, fine-tune, optimize and deploy bleeding edge audio models 🙌🏼

中文: 学习如何使用、微调、优化和部署出血边缘音频模型 🙌🏼

2026-04-27 14:22:42

merve

#34

RT @TencentHunyuan: 👋Hi /haɪ/, we're the Tencent Hy /haɪ/ team🐧 Today, we open source Hy3 preview (295B A21B), a leading reasoning and age…

中文: RT @TencentHunyeuan:👋 Hi /haɪ/，我们是腾讯 Hy /haɪ/ 团队🐧 今天，我们开源 Hy3 预览版(295B A21B)，这是一个领先的推理和年龄......

2026-04-27 14:18:08

merve

#33

stop spending money for your openclaw agent's memory search 💸 use local models on @huggingface with llama.cpp instead I use quantized Embedding Gemma @googlegemma, it can run on anything https://twitter.com/mervenoyann/status/2048724071936880697/photo/1

中文: 停止为你的开放小卡代理的内存搜索花钱 💸 使用 @huggingface 与 lalama.cpp 一起使用本地模型我使用量子化嵌入Gemma @googlegemma，它可以在任何平台上运行

2026-04-27 11:20:56

merve

#32

I have just crossed 10K friends on @huggingface 🤗💗 I try to make myself more and more useful for community and am always happy to be of service 🫡 https://twitter.com/mervenoyann/status/2048680605206856091/photo/1

中文: 我刚刚在@huggingface上结识了10K个朋友🤗💗 我努力让自己对社区越来越有用，并且总是乐于服务🫡

2026-04-27 08:28:13

merve

#31

it's only 48 hours: - Qwen3.6-27B - Tencent-Hy3-preview - DeepSeekv4 what's next? 👀

中文: 只有48小时: - Qwen3.6-27B - 腾讯-海伊3-预览 - 深度深度接下来会发生什么?👀

2026-04-24 13:10:17

merve

#30

RT @julien_c: This is where we are right now. And i’m not gonna lie it feels pretty magical 🧚‍♀️ Qwen3.6 27B running inside of Pi coding a…

中文: RT @julien_c:我们目前所处的位置。我不会撒谎，感觉相当神奇🧚 ♀️ 运行在Pi编程内的Qwen3.6 27B...

2026-04-24 13:05:56

merve

#29

read this deep dive from Ben on DSv4 release today ⬇️

中文: 今天在DSv4上阅读本的深度阅读⬇️

2026-04-24 12:48:51

merve

#28

if you were in a cave, DeepSeek v4 is out, and it's groundbreaking, here's why: it's the first open model to have solved long context: your agentic setup (OpenClaw, coding agents) need many agents, hybrid attention by DSv4 compresses KV-cache, allowing more overhead memory on…

中文: 如果你在洞穴里，DeepSeek v4 已经出局，而且具有开创性，原因就是: 这是首个解决长上下文的开放式模型:您的代理设置(OpenClaw、编码代理)需要多种代理，而 DSv4 的混合注意力则压缩了 KV 缓存，从而实现了更多的管理内存上......

2026-04-24 08:34:37

merve

#27

RT @ben_burtenshaw: deepseek-v4 is out and solves context rot at 1M tokens by taking on attention for the kv cache. It's big at 1T Params…

中文: RT @ben_burtenshaw: deepseek-v4 已经退出，通过关注 kv 缓存，解决 1M 代币的上下文问题。在1T Params上很大......

2026-04-24 08:12:26

merve

#26

DSv4 genuinely shines in 1M context window and peak efficiency to run many agents/users 😍 shortly coming to transformers and we're making sure you get all the peak efficiency 🔥 @art_zucker https://twitter.com/mervenoyann/status/2047568003093471283/photo/1

中文: DSv4 在 1M 环境窗口中真正大放异彩，并实现高效运行，可运行多种代理/用户 😍 即将进入变压器，我们确保获得所有最高能效 [EE] @art_zucker

2026-04-24 06:47:08

merve

#25

DeepSeek v4 is out with 1M context window 🥵🔥 > Pro (13B/284B) & Flash (49B/1.6T) > hybrid attention, needs 27% flops & 10% kv cache compared to V3.2 > reasoning effort: non-think, think high, think max > MIT licensed 💗 https://twitter.com/mervenoyann/status/2047547789601595772/photo/1

中文: DeepSeek v4 已关闭，具有 1M 上下文窗口 🥵🔥 >专业(13B/284B)和快速;闪光灯(49B/1.6T) 混合注意力，需要27%的空转;与V3.2相比，缓存为10% 推理能力:不思考，高思考，最大思考麻省理工学院持牌💗

2026-04-24 05:26:49

merve

#24

RT @deepseek_ai: 🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSe…

中文: RT @deepseek_ai:🚀 DeepSeek-V4 预览版正式上线，即开源!欢迎来到具有成本效益的1M环境长度时代。 🔹 深度......

2026-04-24 05:09:37

merve

#23

RT @gabriberton: I miss ConvNets Much simpler and more intuitive than transformers Early layers would always converge to the same feature…

中文: RT @gabrierton:我想念ConvNets 比变压器简单得多，也更直观早期层次总能汇聚到相同的特征上......

2026-04-23 06:26:36

merve

#22

OpenAI just released Privacy Filter > multilingual PII redaction with 128k context window 🤯 only 1B params > fine-tunable > redact variety of things: including emails, address, names, secrets (best for platform/agent logs) > transformers & ONNX weights 🤗 https://twitter.com/mervenoyann/status/2046980302002602473/photo/1

中文: OpenAI 刚刚发布了隐私筛选器附加语言PII编辑，包含128k个上下文窗口🤯，仅支持1B参数可调和删减多种内容:包括电子邮件、地址、姓名、秘密(最适合平台/代理日志) 设备:变压器和放大器;ONNX 重量 🤗

2026-04-22 15:51:49

merve

#21

RT @Alibaba_Qwen: 🚀 Meet Qwen3.6-27B, our latest dense, open-source model, packing flagship-level coding power! Yes, 27B, and Qwen3.6-27B…

中文: RT @Alibaba_Qwen:🚀 与Qwen3.6-27B会面，这是我们最新的密集开源机型，采用旗舰级编程功能! 是的，27B，Qwen3.6-27B...

2026-04-22 15:00:21

merve

#20

RT @stevibe: Which LLMs actually love to think? Tested 7 models on 5 math problems, measured reasoning length. The think winners: both Qw…

中文: RT @stevibe:哪些LLMs真的喜欢思考? 测试了7个关于5个数学问题的模型，测量了推理长度。获胜的思想家:两者兼有......

2026-04-22 12:25:21

merve

#19

RT @googlegemma: What does it take to run 3, 5, or even 10 concurrent instances of Gemma 4 locally? We've open-sourced a demo letting you…

中文: RT @googlegemma:在本地运行3、5甚至10个并发的Gemma 4实例需要什么? 我们开源了一个演示，让你......

2026-04-21 17:39:57

merve

#18

RT @akseljoonas: Introducing ml-intern, the agent that just automated the post-training team @huggingface It's an open-source implementati…

中文: RT @akseljoonas:介绍 mls-intern，这位刚刚将训练后团队自动化的代理人 @huggingface 这是一个开源的实现......

2026-04-21 16:09:36

merve

#17

bad thing about having your OC agent on local model is having to maintain your mlx/llama-server but also even by then it sometimes goes down and it's not the llama server does anyone have any tips https://twitter.com/mervenoyann/status/2046620360351613187/photo/1

中文: 让你的OC代理使用本地模型，就是必须维护你的mlx/llama-服务器但即便如此，它有时也会下降，而它并非“骆驼”服务器有谁有建议吗

2026-04-21 16:01:32

merve

#16

RT @ben_burtenshaw: 2 days until we will hold this deep dive workshop on everything RL for agents with some of the best names in the game.…

中文: RT @ben_burtenshaw:距离我们举办本次深潜研讨会前，将为拥有游戏中一些最知名球员的经纪人提供所有RL课程。......

2026-04-21 12:49:53

merve

#15

no shade but I hope one day god gives me the confidence of an average random AI strategy/innovation person on LI

中文: 没有遮阳，但希望有一天，上帝能让我在LI上给出一个普通的随机人工智能策略/创新人的自信

2026-04-20 19:56:15

merve

#14

kimi k2.6 is out: open source coding sota 🔥 > 32B/1T MoE with 256k context > long horizon coding + better website design > most interesting: agent swarms (300 subagents can do 4k steps) & Claw groups (multiple self improving agents) https://twitter.com/mervenoyann/status/2046254380102373739/photo/1

中文: kimi k2.6 已发布:开源编码 sota 🔥 加法;32B/1T 闺號带 256k 上下文 > 长地平线编程 + 更完善的网站设计最有趣的是:代理群(300 个亚中介可以完成 4K 步骤)和草皮组(多个自我提升代理)

2026-04-20 15:47:16

merve

#13

RT @Kimi_Moonshot: Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench…

中文: RT @Kimi_Moonshot:认识 Kimi K2.6:推进开源编码 🔹 开源SOTA与HLE的工具(54.0)、SWE-Bench Pro(58.6)、SWE-bench...

2026-04-20 15:32:22

merve

#12

this model is an underrated gem and the results are very strong 🙌🏼

中文: 这个模型是一颗被低估的宝石，结果非常强劲🙌🏼

2026-04-20 15:18:13

merve

#11

RT @LysandreJik: We're opening a Hugging Face office in Tokyo! Our goal: help open-source AI develop in Japan and grow the local communit…

2026-04-20 13:35:23

merve

#10

RT @NielsRogge: We've added support for SAM-3 Lite-Text in the Transformers library! 🔥 > replaces the heavy text encoder in SAM-3 with a c…

中文: RT @NielsRogge:我们已在 Transformers 库中添加了对 SAM-3 Lite-Text 的支持!🔥 将 SAM-3 中的重文本编码器替换为 c...

2026-04-20 12:59:21

merve

#9

RT @stevibe: Which local models can actually handle tool calling? I built a framework to find out. 15 scenarios. 12 tools. Mocked respons…

中文: RT @stevibe:哪些本地模型实际上可以处理工具调用? 我建立了一个框架来查明事实。 15个场景。12个工具。模拟反应......

2026-04-19 15:52:01

merve

#8

RT @onusoz: Who is running local models on GPUs on OpenClaw? I have started benchmarking different models this week. I am working on impro…

中文: RT @onusoz:谁在 OpenClaw 上使用 GPU 运行本地模型? 我本周开始对不同模型进行基准测试。我正在做 impro 的工作......

2026-04-19 14:35:04

merve

#7

tried my openclaw intern with various open models recently, currently using Qwen3.6 with Q6_K vibe: GLM-5 and Minimax sounded a bit more witty and friendly whereas Qwen seems to forget the character a bit Q6_K without reasoning is still surprisingly accurate though

中文: 最近，我试用了各种开放式型号的 openclaw 实习生，目前使用 Qwen3.6 和 Q6_K 氛围:GLM-5和Minimax听起来更幽默、更友善，而Qwen似乎有点忘记了这个角色毫无理由仍然出人意料地准确

2026-04-19 11:37:52

merve

#6

if you're using Replit, Antigravity or other vibe-building tools 👋🏻 simply adding @huggingface Skills to your setup gives your agent access to ~3M open models, 500k+ local AI apps and ~1M datasets agent will pick and build with the best model for your use case and hardware https://twitter.com/mervenoyann/status/2045816220679537088/photo/1

中文: 如果你正在使用 Replit、Antigravity 或其他氛围构建工具 👋🏻 只需在设置中添加 @huggingface Skills，即可让您的代理能够访问 ~3M 的开放模型、500k+ 本地人工智能应用程序以及 ~1M 数据集代理将采用最适合您使用的用例和硬件的型号进行选择和构建

2026-04-19 10:46:11

merve

#5

RT @prithivMLmods: HY-World-2.0 Demo is now live on @huggingface Spaces for 3D world reconstruction and simulation with Gradio and Server m…

中文: RT @prithivMLmods:HY-World-2.0 演示现已在 @huggingface Spaces 上实时上线，通过 Gradio 和 Server m.

2026-04-18 15:50:31

merve

#4

no shade but it pains me to know that AI isn't replacing hard labor where kids in developing countries are dying in press machines or adults die in mines but rather all the investment is for creative jobs and the companies doing this claim to change the world yeah go off

中文: 没有遮蔽，但让我痛心地知道，人工智能并不能取代那些发展中国家儿童在印刷机上或成年人在矿井中死亡的辛苦劳动，而所有投资都用于创造性工作而那些声称要改变世界的公司，是的

2026-04-18 11:32:42

merve

#3

use GLM-5.1 or MiniMax or Gemma-4 you can't be banned from your servers

中文: 使用 GLM-5.1 或 MiniMax 或 Gemma-4 你的服务器不能被禁止

2026-04-18 11:15:39

merve

#2

we met @swyx 🐐 @andimarafioti https://twitter.com/mervenoyann/status/2042004372704424284/photo/1

中文: 我们遇到了 @swyx 🐐 @andimarafioti

2026-04-08 22:19:15

merve

#1

fun blog on fine-tuning gemma 4 and my failures and vibe tests incoming 🔜

中文: 关于微调 gemma 4 的有趣博客，以及我的失败与氛围测试

2026-04-04 17:03:54