List of Papers

List of Papers

Defining Trust in AI
1. A Survey on Trustworthy LLM Agents: Threats and Countermeasures
2. Agentic AI Systems: Opportunities, Challenges, and Trustworthiness
Agency vs. Tool Use
Uncertainty and Robustness
1. Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games
2. Are You Sure You’re Positive? Consolidating Chain-of-Thought Agents with Uncertainty Quantification for Aspect-Category Sentiment Analysis
Interpretability and Transparency
1. Because we have LLMs, we Can and Should Pursue Agentic Interpretability
2. LLM-Driven Social Influence for Cooperative Behavior in Multi-Agent Systems
Multi-Agent Safety / Cross Agent Trust Networks
Security by Design for Agentic Systems
Human-Agent Collaboration and Oversight
Trust in Browser and Web Agents
1. Aligned LLMs Are Not Aligned Browser Agents
2. Context manipulation attacks : Web agents are susceptible to corrupted memory
Long-term agent memory
Deployment and Monitoring of Agents
1. Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition
2. A Survey on AgentOps: Categorization, Challenges, and Future Directions
Societal Implications of Agentic AI
1. Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents
2. Can Large Language Model Agents Simulate Human Trust Behavior?

Privacy Policy | Legal Notice
If you encounter technical problems, please contact the administrators.