OPINE-World: نموذج عالم ذكي يركز على أخطاء الأونتولوجيا

يقدّم بحث OPINE-World نموذج عالم تفاعلي قائم على البرمجة يركّز على اكتشاف الأخطاء في أونتولوجيا النموذج لتحسين التعلّم من التفاعل مع البيئة.

ملخص الذكاء الاصطناعي

يقدّم بحث OPINE-World نموذج عالم تفاعلي قائم على البرمجة يركّز على اكتشاف الأخطاء في أونتولوجيا النموذج لتحسين التعلّم من التفاعل مع البيئة.
التركيز على أخطاء الأونتولوجيا يحسّن كفاءة التعلّم من التفاعل المحدود
arXiv:2607.01531v1 Announce Type: new Abstract: Learning how an environment behaves from interaction is central to building agents that adapt to unfamiliar tasks. World models learned with deep networks are flexible but data-hungry and transfer poorly beyond their training distribution. Program-synthesized world models, written as source code by LLMs and refined through counterexample-guided inductive synthesis (CEGIS), are instead data-efficient and reusable, yet they have been demonstrated mainly on structured-state worlds with a given object vocabulary, and a single program search does not scale to pixel-rendered environments whose object structure must be hypothesized flexibly. We introduce OPINE-World, an LLM agent that learns an object-centric programmatic world model online from interaction. OPINE-World couples two cooperating agents in a loop of hypothesis and test, one acting in the environment and one synthesizing the model in code with replay verification and model-based planning, and it steers exploration with a Bayesian measure of object-type adequacy we call ontology error. We evaluate OPINE-World on ARC-AGI-3, a benchmark for skill-acquisition efficiency in which the object vocabulary, the goal, and the action semantics are withheld. OPINE-World solves 20 of 25 games without per-game training and reaches an action-efficiency score of 78.4 against the human baseline.

arXiv:2607.01531v1 Announce Type: new Abstract: Learning how an environment behaves from interaction is central to building agents that adapt to unfamiliar tasks. World models learned with deep networks are flexible but data-hungry and transfer poorly beyond their training distribution. Program-synthesized world models, written as source code by LLMs and refined through counterexample-guided inductive synthesis (CEGIS), are instead data-efficient and reusable, yet they have been demonstrated mainly on structured-state worlds with a given object vocabulary, and a single program search does not scale to pixel-rendered environments whose object structure must be hypothesized flexibly.

We introduce OPINE-World, an LLM agent that learns an object-centric programmatic world model online from interaction. OPINE-World couples two cooperating agents in a loop of hypothesis and test, one acting in the environment and one synthesizing the model in code with replay verification and model-based planning, and it steers exploration with a Bayesian measure of object-type adequacy we call ontology error.

We evaluate OPINE-World on ARC-AGI-3, a benchmark for skill-acquisition efficiency in which the object vocabulary, the goal, and the action semantics are withheld. OPINE-World solves 20 of 25 games without per-game training and reaches an action-efficiency score of 78.4 against the human baseline.

الوسوم: #نموذج العالم #أونتولوجيا #التعلّم التفاعلي #الذكاء الاصطناعي

المصدر الأصلي: exportarxiv

ملخص الذكاء الاصطناعي

● اقرأ أيضا

سباق الذكاء الاصطناعي يُهدد الأهداف المناخية

Hawk: توليد نوى NPUs ذكية بالاعتماد على العتاد

نظام EO-Agents لتوليد الفرضيات العلمية من بيانات رصد الأرض