Artificial Intelligence #llm#artificial intelligence
SciText2Eq Study: LLMs Show Limited Accuracy in Generating Equations from Scientific Text for Enterprise AI
A new paper, SciText2Eq, evaluates large language models (LLMs) on generating mathematical equations from scientific texts. The study constructed a dataset from AI research papers and introduced a multi-faceted evaluation protocol. Results show that LLMs achieve only moderate lexical similarity and suffer from poor semantic accuracy, and that LLM-based evaluations correlate poorly with human judgments, highlighting challenges for reliable AI in technical domains.
Jun 16, 2026 2 sources