Improving GPT-4 performance for domain tasks

Sutanay_Choudhury · May 10, 2024, 6:23am

Hi, I wanted to share this ICML '24 paper ([2402.10980] ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback) that focuses on designing compound AI systems and shows an application for a scientific problem combining linguistic reasoning with 3D geometrical reasoning. Would appreciate hearing your feedback on the promise of such techniques.

vha14 · May 16, 2024, 6:44pm

ChemReasoner seems like really interesting use of LLM agents in scientific discovery. The topic is out of my core area of expertise though. I look forward to hearing future advances here. Thanks for sharing.

Sutanay_Choudhury · May 17, 2024, 7:29pm

Thanks Vu - I appreciate you taking the time to read through. Basically two key lessons or insights.

Lesson 1) Learn how to probe a model and go beyond single step prompting for applications that can allow that latency. Everyone is focused on training a better model - and we are focused on eliciting information from the AI model. Knowing how to probe the model is an art, and heuristic search and using feedback are key tools in that art.

Lesson 2) Bridging language and geometry is a new qwest. Learning a correspondence between concepts and 3D structural orientation is a big deal for many scientific disciplines like biology and chemistry. SORA-like models are starting to focus on contours and colors, while designing proteins and materials require reasoning about what’s under the surface, how to pack molecules in a 3D space that leads to desired behavior. Most work before us bridging molecular structures and LLMs focused on string representation of molecules, not 3D ones.

Topic	Replies	Views
Weekly paper roundup: Competitive Programming with Large Reasoning Models (2/10/2025) General weekly-paper-roundup	25	March 1, 2025
Weekly paper roundup: Agent Laboratory (1/6/2025) General weekly-paper-roundup	13	March 1, 2025
Weekly paper roundup: DeepSeek-R1 (1/20/2025) General weekly-paper-roundup	17	March 1, 2025
Weekly paper roundup: OLMoE (9/2/2024) General weekly-paper-roundup	86	September 10, 2024
Weekly paper roundup: SWE-Lancer Benchmark (2/17/2025) General weekly-paper-roundup	36	March 1, 2025

Improving GPT-4 performance for domain tasks

Related topics