08:57 · Apr 16, 2025 · Wed https://x.com/karminski3/status/1912287572065415582?t=4kgi73oZCfIWD3KOiLAkJA&s=35 X (formerly Twitter) karminski-牙医 (@karminski3) on X 微软研究院整了个活,发布了个原生 1-bit 的大语言模型 —— bitnet-b1.58-2B-4T有啥意义吗?有的,这个模型虽然将权重量化到超低精度(实际是1.58位,权重只有{-1, 0, +1}三个值),但它在性能上几乎能与其它2B参数规模的全精度模型相媲美。与传统模型相比,这个1-bit模型带来了惊人的效率提升:-
11:25 · Apr 15, 2025 · Tue https://s3.cn-north-1.amazonaws.com.cn/sides-share/%E6%B7%B1%E5%85%A5%E6%8E%A2%E8%AE%A8+Amazon+ElastiCache+.pdf
09:52 · Apr 15, 2025 · Tue https://x.com/HiTw93/status/1911921912399421851?t=tdK8eRDyQsU9Ae0hHdYuag&s=35 X (formerly Twitter) Tw93 (@HiTw93) on X 这个开源的小说转语音智能解决方案 EasyVoice,展示效果做得挺有趣,支持处理大型文本文件,轻松将超长小说转换为语音,这里比较难处理的是如何转成很流畅的对话、多音字处理、分割等地方。https://t.co/JKW7I7MCUe
23:43 · Apr 14, 2025 · Mon https://x.com/Aurimas_Gr/status/1911403414094758029?t=OSEfMcJs_6ZHYW2RliHDng&s=35 X (formerly Twitter) Aurimas Griciūnas (@Aurimas_Gr) on X 𝗠𝗖𝗣 and 𝗔𝟮𝗔: Friends or Foes? In my latest Newsletter episode I talk about both protocols. Could A2A eat up MCP in the long term?I have been asked multiple times why I think the two protocols could become competitive in the future. I tried to outline my…
23:43 · Apr 14, 2025 · Mon https://x.com/Aurimas_Gr/status/1910671639869530502?t=dT4EUKh1soARHDrWFR4DaA&s=35 X (formerly Twitter) Aurimas Griciūnas (@Aurimas_Gr) on X 𝗠𝗖𝗣 𝘃𝘀. 𝗔𝟮𝗔Two days ago Google announced an open A2A (Agent2Agent) protocol in an attempt to normalise how we implement multi-Agent system communication.As always, social media is going crazy about it, but why?Let’s review the differences and how both…
23:35 · Apr 14, 2025 · Mon https://x.com/xu_paco/status/1911689824949715009?t=Au-d_RQG0OqUV1ehNVs_lw&s=35 X (formerly Twitter) paco xu (@xu_paco) on X https://t.co/Yam7NRK85j 看到评论很多人对 cka、ckad、cks 还是很认可的。截图是我的想法。
23:33 · Apr 14, 2025 · Mon https://x.com/ayakaneko/status/1911704595052810675?t=MiVnzj-uceMTZg7kIPrJTA&s=35
23:31 · Apr 14, 2025 · Mon https://x.com/op7418/status/1911755915864523177?t=YUjzs93SqTSwr1fgGwCPeQ&s=35 X (formerly Twitter) 歸藏(guizang.ai) (@op7418) on X 字节确实变了居然先发布了新版 Seaweed 视频模型的论文和演示除了常规文生、图生视频外还支持:- 音视频同步生成- 长镜头与多镜头叙事- 高分辨率超分与实时生成- 世界建模与相机控制下面的论文页面有更多演示
23:29 · Apr 14, 2025 · Mon 终于把 DeepSeek 这个系列的 DeepSeek V2 里的 MLA 写完了,我之前以为这个是最难理解的,因为涉及到一些纯线性代数的推导。不过回过头来看其实里面数学的难度不大,但这个过程却十分精彩,有时候我都觉得这里面有剧本。DeepSeek 系列的论文我其实已经都扫过一遍了,但最喜欢的还是 V2 这篇论文。https://oilbeater.com/2025/04/14/deepseek-mla/ Oilbeater 的自习室 DeepSeek MLA -- 为成本优化而生的注意力机制 | Oilbeater 的自习室
23:29 · Apr 14, 2025 · Mon https://x.com/ZHO_ZHO_ZHO/status/1911692518644723788?t=m-IOZCgW_sWUCHUcjIvvEw&s=35 X (formerly Twitter) -Zho- (@ZHO_ZHO_ZHO) on X 如果变成蒙德里安我是真喜欢啊啊啊啊啊(GPT 4o 懂我
23:28 · Apr 14, 2025 · Mon https://x.com/Kimi_Moonshot/status/1911805130099638683?t=LwX7wGkoFuhJw5f6UUdQSw&s=35
23:26 · Apr 14, 2025 · Mon https://x.com/karminski3/status/1911791924648051034?t=1q4YAbuzn2EiMS8zfGIgJA&s=35 X (formerly Twitter) karminski-牙医 (@karminski3) on X 速报——智谱好像要发 GLM4看上去模型大小分32B和9B,然后不同参数量大小还有衍生模型,比如 GLM-4-32B-0414 是基座模型,GLM-4-32B-Chat-0414 是 Chat 模型,GLM-Z1-32B-0414 是思考模型,GLM-4-Z1-Rumination-32B-0414 (Rumination 反刍/沉思?不知道是不是前几天那个沉思),GLM-4V-9B
23:25 · Apr 14, 2025 · Mon https://x.com/keepeetron/status/1911422467068883163?t=JNTxtKbPK9xh6dCss8a6YA&s=35 X (formerly Twitter) keepee⚫ (@keepeetron) on X wishlist sobo now... or don't, it's your lifehttps://t.co/KHnNhDovG9
23:23 · Apr 14, 2025 · Mon https://x.com/karminski3/status/1911831173342597462?t=ABaOKdPY97A-udeQ0-IkUA&s=35 X (formerly Twitter) karminski-牙医 (@karminski3) on X GPT-4.1 API 价格公布
23:22 · Apr 14, 2025 · Mon https://x.com/invisal89/status/1911692246182928753?t=EkkmqO_AUihYPrbAGerFhw&s=35 X (formerly Twitter) Visal In (@invisal89) on X SQLite Internal — a "very human" SQLite file format viewer — is now available! It's still a work in progress, but I’m releasing it early to gather feedback.https://t.co/2kNFq9yKhr