(for LLM & AI Agent in Time Series and Education domains, check AI for Time Series, AI for Education)
Algorithm & Benchmark & Data & Code:
- [KDD'26] Evaluating RAG Robustness to Symbolic Perturbations, KDD 2026.
- [AAAI'26] Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation, AAAI 2026. [arXiv] (AAAI Oral, Top 5%)
- [AAAI'26] SafeSieve: From Heuristics to Experience in Progressive Pruning for LLM-based Multi-Agent Communication, AAAI 2026. [arXiv]
- [NeurIPS'25] Improving Nonlinear RNN with Closed-loop Control, NeurIPS 2025. [arXiv] (NeurIPS Spotlight, Top 3.5%)
- [NeurIPS'25] Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs, NeurIPS 2025.
- [NeurIPS'25] SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders, NeurIPS 2025.
- [ICLR'25] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? ICLR 2025. [arXiv] [code]
- [NeurIPS'24] AutoSurvey: Large Language Models Can Automatically Write Surveys, NeurIPS 2024 [arXiv] [code]
- [ACL'25] NetSafe: Exploring the Topological Safety of Multi-agent Networks, ACL 2025. [arXiv]
- [EMNLP'25] DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition, EMNLP 2025. [arXiv]
- [AAAI'25] UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction, AAAI 2025 [paper]
- [MM'25] The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework, ACM MM 2025. [arXiv]
- [MM'25] Debiasing Multimodal Large Language Models via Penalization of Language Priors, ACM MM 2025. [arXiv] [code]
- [WWW'24] UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web, WWW '24 [arXiv]
- [EMNLP'24] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation, EMNLP 2024 (Demo Track) [arXiv] [code]
- [arXiv'24] Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models, arXiv 2024. [arXiv] [code]
- [arXiv'25] LLM-Virus: Evolutionary Jailbreak Attack on Large Language Models, arXiv 2025. [arXiv]
- [arXiv'25] AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management, arXiv 2025. [arXiv]
- [arXiv'25] Automating Personalization: Prompt Optimization for Recommendation Reranking, arXiv 2025. [arXiv]
- [arXiv'25] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models, arXiv 2025. [arXiv]
- [arXiv'25] Backdoor Attribution: Elucidating and Controlling Backdoor in Language Models, arXiv 2025. [arXiv]
- [arXiv'25] ARM2: Adaptive Reasoning Model with Vision Understanding and Executable Code, arXiv 2025. [arXiv]