نظام EO-Agents لتوليد الفرضيات العلمية من بيانات رصد الأرض

يقدم البحث نظام EO-Agents المكوّن من ثلاثة وكلاء لغويين كبيرين لتوليد فرضيات علمية في مجال رصد الأرض باستخدام بيانات مرئية وبنية مُنظمة بدلًا من النصوص غير المنظمة فقط.

ملخص الذكاء الاصطناعي

يقدم البحث نظام EO-Agents المكوّن من ثلاثة وكلاء لغويين كبيرين لتوليد فرضيات علمية في مجال رصد الأرض باستخدام بيانات مرئية وبنية مُنظمة بدلًا من النصوص غير المنظمة فقط.
دمج البيانات المرئية والهيكلية يعزز دقة الفرضيات العلمية
arXiv:2607.01584v1 Announce Type: new Abstract: Large language models have recently been explored for scientific hypothesis generation, but most prior work relies on unstructured literature and free-form textual claims. We present a pipeline for Earth observation that grounds hypothesis generation directly in the NASA Earth Observation Knowledge Graph. A heterogeneous graph neural network trained on historical co-usage relations ranks candidate dataset pairings, and a three-agent LLM pipeline filters, generates, and evaluates structured research hypotheses. Applied to 1,475 NASA datasets, the system produces 160 hypotheses spanning multiple Earth-science domains, including ecohydrology, glaciology, aerosol--cloud interactions, vegetation phenology, and stratospheric chemistry. Model-predicted novel dataset pairings are rated nearly as plausible as held-out real co-usages from the literature, indicating that the pipeline surfaces scientifically coherent yet unexplored combinations. A 2*2*2 factorial experiment across GPT-5.2 and Claude Sonnet 4.6 shows that hypothesis rankings remain stable, while absolute scores depend strongly on judge identity, highlighting limitations of single-judge LLM evaluation.

arXiv:2607.01584v1 Announce Type: new Abstract: Large language models have recently been explored for scientific hypothesis generation, but most prior work relies on unstructured literature and free-form textual claims. We present a pipeline for Earth observation that grounds hypothesis generation directly in the NASA Earth Observation Knowledge Graph.

A heterogeneous graph neural network trained on historical co-usage relations ranks candidate dataset pairings, and a three-agent LLM pipeline filters, generates, and evaluates structured research hypotheses. Applied to 1,475 NASA datasets, the system produces 160 hypotheses spanning multiple Earth-science domains, including ecohydrology, glaciology, aerosol--cloud interactions, vegetation phenology, and stratospheric chemistry.

Model-predicted novel dataset pairings are rated nearly as plausible as held-out real co-usages from the literature, indicating that the pipeline surfaces scientifically coherent yet unexplored combinations. A 2*2*2 factorial experiment across GPT-5.2 and Claude Sonnet 4.6 shows that hypothesis rankings remain stable, while absolute scores depend strongly on judge identity, highlighting limitations of single-judge LLM evaluation.

الوسوم: #EO-Agents #توليد الفرضيات #رصد الأرض #نماذج لغوية كبيرة

المصدر الأصلي: exportarxiv

ملخص الذكاء الاصطناعي

● اقرأ أيضا

سباق الذكاء الاصطناعي يُهدد الأهداف المناخية

Hawk: توليد نوى NPUs ذكية بالاعتماد على العتاد

تقنية SOLiD لمراقبة الكذب في الذكاء الاصطناعي