开源项目
最新开源 AI 模型、工具、框架与社区项目推荐。
摩尔线程MTT S5000全面适配阿里Qwen3.5三款新模型
36氪获悉,2月26日,摩尔线程官微宣布已在AI训推一体全功能GPU MTT S5000上,完成对阿里三款全新模型的全方位适配。据介绍,继开源Qwen3.5-397B-A17B之后,阿里宣布开源千问3.5最新三款中等规模模型Qwen3.5-...
Hyperbolic Busemann Neural Networks
arXiv:2602.18858v2 Announce Type: replace-cross Abstract: Hyperbolic spaces provide a natural geometry for representing ...
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers
arXiv:2602.18292v2 Announce Type: replace-cross Abstract: Decoding sits between a language model and everything we do wi...
Diffusion Language Models Know the Answer Before Decoding
arXiv:2508.19982v4 Announce Type: replace-cross Abstract: Diffusion language models (DLMs) have recently emerged as an a...
Modular Deep Learning for Multivariate Time-Series: Decoupling Imputation and Downstream Tasks
arXiv:2411.03941v3 Announce Type: replace-cross Abstract: Missing values are pervasive in large-scale time-series data, ...
Spilled Energy in Large Language Models
arXiv:2602.18671v2 Announce Type: replace Abstract: We reinterpret the final Large Language Model (LLM) softmax classifi...
Spurious Rewards: Rethinking Training Signals in RLVR
arXiv:2506.10947v2 Announce Type: replace Abstract: We show that reinforcement learning with verifiable rewards (RLVR) c...
Temporal Knowledge-Graph Memory in a Partially Observable Environment
arXiv:2408.05861v4 Announce Type: replace Abstract: Agents in partially observable environments require persistent memor...
Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes
arXiv:2602.22197v1 Announce Type: cross Abstract: Advances in Generative AI (GenAI) have led to the development of vario...
On Imbalanced Regression with Hoeffding Trees
arXiv:2602.22101v1 Announce Type: cross Abstract: Many real-world applications provide a continuous stream of data that ...
DualWeaver: Synergistic Feature Weaving Surrogates for Multivariate Forecasting with Univariate Time Series Foundation Models
arXiv:2602.22066v1 Announce Type: cross Abstract: Time-series foundation models (TSFMs) have achieved strong univariate ...
RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models
arXiv:2602.22026v1 Announce Type: cross Abstract: Metro trains often operate in highly complex environments, characteriz...
xai-cola: A Python library for sparsifying counterfactual explanations
arXiv:2602.21845v1 Announce Type: cross Abstract: Counterfactual explanation (CE) is an important domain within post-hoc...
Learning from Yesterday's Error: An Efficient Online Learning Method for Traffic Demand Prediction
arXiv:2602.21757v1 Announce Type: cross Abstract: Accurately predicting short-term traffic demand is critical for intell...
ECHOSAT: Estimating Canopy Height Over Space And Time
arXiv:2602.21421v1 Announce Type: cross Abstract: Forest monitoring is critical for climate change mitigation. However, ...
Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages
arXiv:2602.21374v1 Announce Type: cross Abstract: Extracting clinical information from medical transcripts in low-resour...
又快又省?仅5%参数、训练快4倍!ArcFlow用「非线性」魔法实现FLUX/Qwen推理40倍加速
在生成式 AI 的浪潮中,我们见证了从 Stable Diffusion 到 FLUX、Qwen-Image 等大规模扩散模型的画质飞跃。然而,这种飞跃并非没有代价。为了从纯噪声中 “雕刻” 出清晰的图像,这些模型通...
DeepMind药物衍生公司的独家新AI,堪称AlphaFold 4的专有药物设计引擎
编辑丨&在谷歌 DeepMind 发布了针对药物发现的更新版 AlphaFold3 近两年后,其生物制药衍生公司 Isomorphic Labs 宣布了更强大的人工智能模型—&md...
最强Coding Plan上线!阿里云上线Qwen3.5、GLM-5、MiniMax M2.5、Kimi K2.5四大顶尖开源模型
2月25日,阿里云百炼推出包含Qwen3.5、GLM-5、MiniMax M2.5、Kimi K2.5四大顶尖开源模型API服务的最强Coding Plan。用户订阅套餐后不再受限于单一模型,可实现多模型自由切换,享受更稳定、Tokens额...
消费级显卡可跑!刚刚,阿里Qwen3.5又开源3款新模型
刚过完年,阿里又卷起来了。2 月 25 日,继除夕开源 Qwen3.5-397B-A17B 之后,阿里继续开源千问 3.5 系列模型,而且是一口气开源三款中等规模的新模型,分别是 Qwen3.5-35B-A3B、Qwen3.5-1...
PMG: Parameterized Motion Generator for Human-like Locomotion Control
arXiv:2602.12656v2 Announce Type: replace-cross Abstract: Recent advances in data-driven reinforcement learning and moti...
UI-Venus-1.5 Technical Report
arXiv:2602.09082v2 Announce Type: replace-cross Abstract: GUI agents have emerged as a powerful paradigm for automating ...
GOT-Edit: Geometry-Aware Generic Object Tracking via Online Model Editing
arXiv:2602.08550v3 Announce Type: replace-cross Abstract: Human perception for effective object tracking in 2D video str...
MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation
arXiv:2510.18316v3 Announce Type: replace-cross Abstract: Imitation learning from large-scale, diverse human demonstrati...
LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts
arXiv:2509.25684v2 Announce Type: replace-cross Abstract: Recent studies have shown that combining parameter-efficient f...
CONTINA: Confidence Interval for Traffic Demand Prediction with Coverage Guarantee
arXiv:2504.13961v2 Announce Type: replace-cross Abstract: Accurate short-term traffic demand prediction is critical for ...
Hidden Dynamics of Massive Activations in Transformer Training
arXiv:2508.03616v2 Announce Type: replace Abstract: We present the first comprehensive analysis of massive activation de...
Efficient Hierarchical Any-Angle Path Planning on Multi-Resolution 3D Grids
arXiv:2602.21174v1 Announce Type: cross Abstract: Hierarchical, multi-resolution volumetric mapping approaches are widel...
SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery
arXiv:2602.21136v1 Announce Type: cross Abstract: Qualitative insights from user experiences are critical for informing ...
MIP Candy: A Modular PyTorch Framework for Medical Image Processing
arXiv:2602.21033v1 Announce Type: cross Abstract: Medical image processing demands specialized software that handles hig...
Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression
arXiv:2602.20650v1 Announce Type: cross Abstract: Large-scale image datasets are fundamental to deep learning, but their...
POMDPPlanners: Open-Source Package for POMDP Planning
arXiv:2602.20810v1 Announce Type: new Abstract: We present POMDPPlanners, an open-source Python package for empirical ev...
Train AI models with Unsloth and Hugging Face Jobs for FREE
GGML and llama.cpp join HF to ensure the long-term progress of Local AI
Deploying Open Source Vision Language Models (VLM) on Jetson
Spanish ‘soonicorn’ Multiverse Computing releases free compressed AI model
Spanish startup Multiverse Computing has released a new version of its HyperNova 60B model on Hugging Face that, it says...