AI Research Trends 

Creative Ownership in the Age of AI

This paper discusses copyright law challenges posed by generative AI, proposing a new infringement criterion based on training corpus dependence.

Read more

Agentic Test-Time Scaling for WebAgents

Introducing CATTS, this research provides a technique for dynamically allocating compute in multi-step tasks, improving performance and efficiency.

Read more

AttentionRetriever: Attention Layers are Secretly Long Document Retrievers

Presentation of AttentionRetriever, a model for retrieving long-document context, outperforming existing models while maintaining efficiency.

Read more

CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use

CM2 presents a novel reinforcement learning framework for optimizing agent tool usage, offering strong improvements over supervised approaches.

Read more

Think like a Scientist: Physics-guided LLM Agent for Equation Discovery

Introducing KeplerAgent, this study showcases a framework for scientific reasoning using LLMs to enhance accuracy in symbolic equation discovery.

Read more

Automated Test Suite Enhancement Using Large Language Models with Few-shot Prompting

Exploration of using few-shot prompting to enhance the quality and usability of LLM-generated unit tests, emphasizing the effective integration of human examples.

Read more

ExtractBench: A Benchmark and Evaluation Methodology for Complex Structured Extraction

ExtractBench offers a comprehensive framework for evaluating structured extraction from PDFs, emphasizing reliability and standardization.

Read more

When Scaffolding Breaks: Investigating Student Interaction with LLM-Based Writing Support in Real-Time K-12 EFL Classrooms

An exploration of LLMs in classroom settings reveals challenges of lower-proficiency students relying on AI scaffolding, leading to design guidelines for inclusivity.

Read more

Towards Autonomous Mathematics Research

Introducing Aletheia, a math research agent advancing AI capabilities in solving and generating research papers through autonomous reasoning.

Read more