The current generation of LLM-based research agents, while adept at orchestration, has largely failed to capitalize on the structured nature of scientific knowledge. Existing approaches often distill papers into superficial elements like abstracts and citation links, missing the granular details, entities, claims, evidence, mechanisms, and method lineages, crucial for robust scientific reasoning. This oversight represents a significant bottleneck in advancing AI's capability for scientific discovery.
Beyond Abstracts: A Multimodal Knowledge Extraction Pipeline
To address this gap, the researchers introduce Agents-K1, an end-to-end pipeline designed to transform raw scientific documents into agent-native scientific knowledge graphs. Unlike prior methods, Agents-K1 employs a multimodal parser with a five-module schema that captures entities, multimodal evidence, citations, and typed inter-entity relations across the entirety of a paper, not just its abstract. This comprehensive approach is powered by a 4B parameter information-extraction backbone, trained using GRPO with a rule-based reward mechanism, ensuring high fidelity in knowledge capture.