Part 8/9:
Several educators, including those teaching graduate-level courses, have been evaluating LLM performance through standard exams. This analysis illustrates the rapid improvements seen in LLM responses over just a few years. Tasks that once stumped LLMs are now being handled with increasing sophistication, demonstrating a marked increase in their ability to tackle complex subjects such as general relativity.
However, this achievement does not necessarily imply that LLMs can solve original research problems; they may excel at asking and solving problems already present in their training data but may struggle with non-standard queries that require genuine creativity or rigorous cross-discipline application of knowledge.