开源项目

最新开源 AI 模型、工具、框架与社区项目推荐。

开源

摩尔线程MTT S5000全面适配阿里Qwen3.5三款新模型

36氪获悉,2月26日,摩尔线程官微宣布已在AI训推一体全功能GPU MTT S5000上,完成对阿里三款全新模型的全方位适配。据介绍,继开源Qwen3.5-397B-A17B之后,阿里宣布开源千问3.5最新三款中等规模模型Qwen3.5-...

开源

Hyperbolic Busemann Neural Networks

arXiv:2602.18858v2 Announce Type: replace-cross Abstract: Hyperbolic spaces provide a natural geometry for representing ...

开源

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

arXiv:2602.18292v2 Announce Type: replace-cross Abstract: Decoding sits between a language model and everything we do wi...

开源

Diffusion Language Models Know the Answer Before Decoding

arXiv:2508.19982v4 Announce Type: replace-cross Abstract: Diffusion language models (DLMs) have recently emerged as an a...

开源

Modular Deep Learning for Multivariate Time-Series: Decoupling Imputation and Downstream Tasks

arXiv:2411.03941v3 Announce Type: replace-cross Abstract: Missing values are pervasive in large-scale time-series data, ...

开源

Spilled Energy in Large Language Models

arXiv:2602.18671v2 Announce Type: replace Abstract: We reinterpret the final Large Language Model (LLM) softmax classifi...

开源

Spurious Rewards: Rethinking Training Signals in RLVR

arXiv:2506.10947v2 Announce Type: replace Abstract: We show that reinforcement learning with verifiable rewards (RLVR) c...

开源

Temporal Knowledge-Graph Memory in a Partially Observable Environment

arXiv:2408.05861v4 Announce Type: replace Abstract: Agents in partially observable environments require persistent memor...

开源

Off-The-Shelf Image-to-Image Models Are All You Need To Defeat Image Protection Schemes

arXiv:2602.22197v1 Announce Type: cross Abstract: Advances in Generative AI (GenAI) have led to the development of vario...

开源

On Imbalanced Regression with Hoeffding Trees

arXiv:2602.22101v1 Announce Type: cross Abstract: Many real-world applications provide a continuous stream of data that ...

开源

DualWeaver: Synergistic Feature Weaving Surrogates for Multivariate Forecasting with Univariate Time Series Foundation Models

arXiv:2602.22066v1 Announce Type: cross Abstract: Time-series foundation models (TSFMs) have achieved strong univariate ...

开源

RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models

arXiv:2602.22026v1 Announce Type: cross Abstract: Metro trains often operate in highly complex environments, characteriz...

开源

xai-cola: A Python library for sparsifying counterfactual explanations

arXiv:2602.21845v1 Announce Type: cross Abstract: Counterfactual explanation (CE) is an important domain within post-hoc...

开源

Learning from Yesterday's Error: An Efficient Online Learning Method for Traffic Demand Prediction

arXiv:2602.21757v1 Announce Type: cross Abstract: Accurately predicting short-term traffic demand is critical for intell...

开源

ECHOSAT: Estimating Canopy Height Over Space And Time

arXiv:2602.21421v1 Announce Type: cross Abstract: Forest monitoring is critical for climate change mitigation. However, ...

开源

Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages

arXiv:2602.21374v1 Announce Type: cross Abstract: Extracting clinical information from medical transcripts in low-resour...

又快又省?仅5%参数、训练快4倍!ArcFlow用「非线性」魔法实现FLUX/Qwen推理40倍加速
开源

又快又省?仅5%参数、训练快4倍!ArcFlow用「非线性」魔法实现FLUX/Qwen推理40倍加速

在生成式 AI 的浪潮中,我们见证了从 Stable Diffusion 到 FLUX、Qwen-Image 等大规模扩散模型的画质飞跃。然而,这种飞跃并非没有代价。为了从纯噪声中 “雕刻” 出清晰的图像,这些模型通...

DeepMind药物衍生公司的独家新AI,堪称AlphaFold 4的专有药物设计引擎
开源

DeepMind药物衍生公司的独家新AI,堪称AlphaFold 4的专有药物设计引擎

编辑丨&在谷歌 DeepMind 发布了针对药物发现的更新版 AlphaFold3 近两年后,其生物制药衍生公司 Isomorphic Labs 宣布了更强大的人工智能模型—&md...

最强Coding Plan上线!阿里云上线Qwen3.5、GLM-5、MiniMax M2.5、Kimi K2.5四大顶尖开源模型
开源

最强Coding Plan上线!阿里云上线Qwen3.5、GLM-5、MiniMax M2.5、Kimi K2.5四大顶尖开源模型

2月25日,阿里云百炼推出包含Qwen3.5、GLM-5、MiniMax M2.5、Kimi K2.5四大顶尖开源模型API服务的最强Coding Plan。用户订阅套餐后不再受限于单一模型,可实现多模型自由切换,享受更稳定、Tokens额...

消费级显卡可跑!刚刚,阿里Qwen3.5又开源3款新模型
开源

消费级显卡可跑!刚刚,阿里Qwen3.5又开源3款新模型

刚过完年,阿里又卷起来了。2 月 25 日,继除夕开源 Qwen3.5-397B-A17B 之后,阿里继续开源千问 3.5 系列模型,而且是一口气开源三款中等规模的新模型,分别是 Qwen3.5-35B-A3B、Qwen3.5-1...

开源

PMG: Parameterized Motion Generator for Human-like Locomotion Control

arXiv:2602.12656v2 Announce Type: replace-cross Abstract: Recent advances in data-driven reinforcement learning and moti...

开源

UI-Venus-1.5 Technical Report

arXiv:2602.09082v2 Announce Type: replace-cross Abstract: GUI agents have emerged as a powerful paradigm for automating ...

开源

GOT-Edit: Geometry-Aware Generic Object Tracking via Online Model Editing

arXiv:2602.08550v3 Announce Type: replace-cross Abstract: Human perception for effective object tracking in 2D video str...

开源

MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation

arXiv:2510.18316v3 Announce Type: replace-cross Abstract: Imitation learning from large-scale, diverse human demonstrati...

开源

LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts

arXiv:2509.25684v2 Announce Type: replace-cross Abstract: Recent studies have shown that combining parameter-efficient f...

开源

CONTINA: Confidence Interval for Traffic Demand Prediction with Coverage Guarantee

arXiv:2504.13961v2 Announce Type: replace-cross Abstract: Accurate short-term traffic demand prediction is critical for ...

开源

Hidden Dynamics of Massive Activations in Transformer Training

arXiv:2508.03616v2 Announce Type: replace Abstract: We present the first comprehensive analysis of massive activation de...

开源

Efficient Hierarchical Any-Angle Path Planning on Multi-Resolution 3D Grids

arXiv:2602.21174v1 Announce Type: cross Abstract: Hierarchical, multi-resolution volumetric mapping approaches are widel...

开源

SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery

arXiv:2602.21136v1 Announce Type: cross Abstract: Qualitative insights from user experiences are critical for informing ...

开源

MIP Candy: A Modular PyTorch Framework for Medical Image Processing

arXiv:2602.21033v1 Announce Type: cross Abstract: Medical image processing demands specialized software that handles hig...

开源

Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression

arXiv:2602.20650v1 Announce Type: cross Abstract: Large-scale image datasets are fundamental to deep learning, but their...

开源

POMDPPlanners: Open-Source Package for POMDP Planning

arXiv:2602.20810v1 Announce Type: new Abstract: We present POMDPPlanners, an open-source Python package for empirical ev...

开源

Train AI models with Unsloth and Hugging Face Jobs for FREE

开源

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

开源

Deploying Open Source Vision Language Models (VLM) on Jetson

开源

Spanish ‘soonicorn’ Multiverse Computing releases free compressed AI model

Spanish startup Multiverse Computing has released a new version of its HyperNova 60B model on Hugging Face that, it says...