Natural Language Processing

NLP is changing how we interact with machines, enabling more fluid communication and better understanding of human language.

Social Science Moves In Silico

Katharine Miller

Jul 25, 2025

News

Despite limitations, advances in AI offer social science researchers the ability to simulate human subjects.

News

Social Science Moves In Silico

Katharine Miller

Generative AINatural Language ProcessingSciences (Social, Health, Biological, Physical)Jul 25

Despite limitations, advances in AI offer social science researchers the ability to simulate human subjects.

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

Abby C King, Zakaria N Doueiri, Ankita Kaulberg, Lisa Goldman Rosas

Feb 14, 2025

Research

Current societal trends reflect an increased mistrust in science and a lowered civic engagement that threaten to impair research that is foundational for ensuring public health and advancing health equity. One effective countermeasure to these trends lies in community-facing citizen science applications to increase public participation in scientific research, making this field an important target for artificial intelligence (AI) exploration. We highlight potentially promising citizen science AI applications that extend beyond individual use to the community level, including conversational large language models, text-to-image generative AI tools, descriptive analytics for analyzing integrated macro- and micro-level data, and predictive analytics. The novel adaptations of AI technologies for community-engaged participatory research also bring an array of potential risks. We highlight possible negative externalities and mitigations for some of the potential ethical and societal challenges in this field.

Research

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

Abby C King, Zakaria N Doueiri, Ankita Kaulberg, Lisa Goldman Rosas

Foundation ModelsGenerative AIMachine LearningNatural Language ProcessingSciences (Social, Health, Biological, Physical)HealthcareFeb 14

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Juan Pava, Haifa Badi Uz Zaman, Caroline Meinhardt, Toni Friedman, Sang T. Truong, Daniel Zhang, Elena Cryst, Vukosi Marivate, Sanmi Koyejo

Deep DiveApr 22, 2025

White Paper

This white paper maps the LLM development landscape for low-resource languages, highlighting challenges, trade-offs, and strategies to increase investment; prioritize cross-disciplinary, community-driven development; and ensure fair data ownership.

White Paper

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Juan Pava, Haifa Badi Uz Zaman, Caroline Meinhardt, Toni Friedman, Sang T. Truong, Daniel Zhang, Elena Cryst, Vukosi Marivate, Sanmi Koyejo

International Affairs, International Security, International DevelopmentNatural Language ProcessingDeep DiveApr 22

Christopher Manning

Person

Christopher Manning

Natural Language ProcessingOct 05

New Large Language Model Helps Patients Understand Their Radiology Reports

Vignesh Ramachandran

Jun 23, 2025

News

‘RadGPT’ cuts through medical jargon to answer common patient questions.

News

New Large Language Model Helps Patients Understand Their Radiology Reports

Vignesh Ramachandran

HealthcareNatural Language ProcessingJun 23

‘RadGPT’ cuts through medical jargon to answer common patient questions.

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

Susan Athey, Herman Brunborg, Tianyu Du, Ayush Kanodia, Keyon Vafa

Dec 11, 2024

Research

Vafa et al. (2024) introduced a transformer-based econometric model, CAREER, that predicts a worker’s next job as a function of career history (an “occupation model”). CAREER was initially estimated (“pre-trained”) using a large, unrepresentative resume dataset, which served as a “foundation model,” and parameter estimation was continued (“fine-tuned”) using data from a representative survey. CAREER had better predictive performance than benchmarks. This paper considers an alternative where the resume-based foundation model is replaced by a large language model (LLM). We convert tabular data from the survey into text files that resemble resumes and fine-tune the LLMs using these text files with the objective to predict the next token (word). The resulting fine-tuned LLM is used as an input to an occupation model. Its predictive performance surpasses all prior models. We demonstrate the value of fine-tuning and further show that by adding more career data from a different population, fine-tuning smaller LLMs surpasses the performance of fine-tuning larger models.

Research

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

Susan Athey, Herman Brunborg, Tianyu Du, Ayush Kanodia, Keyon Vafa

Foundation ModelsNatural Language ProcessingDec 11

All Work Published on Natural Language Processing

MedArena: Comparing LLMs for Medicine in the Wild

Eric Wu, Kevin Wu, James Zou

Apr 24, 2025

News

Stanford scholars leverage physicians to evaluate 11 large language models in real-world settings.

MedArena: Comparing LLMs for Medicine in the Wild

Eric Wu, Kevin Wu, James Zou

Apr 24, 2025

Stanford scholars leverage physicians to evaluate 11 large language models in real-world settings.

Healthcare

Natural Language Processing

Generative AI

News

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

Nov 14, 2024

Research

Language Model Programs, i.e. sophisticated pipelines of modular language model (LM) calls, are increasingly advancing NLP tasks, but they require crafting prompts that are jointly effective for all modules. We study prompt optimization for LM programs, i.e. how to update these prompts to maximize a downstream metric without access to module-level labels or gradients. To make this tractable, we factorize our problem into optimizing the free-form instructions and few-shot demonstrations of every module and introduce several strategies to craft task-grounded instructions and navigate credit assignment across modules. Our strategies include (i) program- and data-aware techniques for proposing effective instructions, (ii) a stochastic mini-batch evaluation function for learning a surrogate model of our objective, and (iii) a meta-optimization procedure in which we refine how LMs construct proposals over time. Using these insights we develop MIPRO, a novel algorithm for optimizing LM programs. MIPRO outperforms baseline optimizers on five of seven diverse multi-stage LM programs using a best-in-class open-source model (Llama-3-8B), by as high as 13% accuracy. We have released our new optimizers and benchmark in DSPy at [http://dspy.ai](http://dspy.ai).

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

Nov 14, 2024

Natural Language Processing

Research

Percy Liang

Associate Professor of Computer Science, Stanford University | Director, Stanford Center for Research on Foundation Models | Senior Fellow, Stanford HAI

Person

Percy Liang

Associate Professor of Computer Science, Stanford University | Director, Stanford Center for Research on Foundation Models | Senior Fellow, Stanford HAI

Foundation Models

Generative AI

Machine Learning

Natural Language Processing

Person

Language Models in the Classroom: Bridging the Gap Between Technology and Teaching

Instructors and students of CS293

Apr 09, 2025

News

Instructors and students from Stanford class CS293/EDUC473 address the failures of current educational technologies and outline how to empower both teachers and learners through collaborative innovation.

Language Models in the Classroom: Bridging the Gap Between Technology and Teaching

Instructors and students of CS293

Apr 09, 2025

Education, Skills

Generative AI

Natural Language Processing

News

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Anand Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

Sep 05, 2024

Research

Increasingly large robotics datasets are being collected to train larger foundation models in robotics. However, despite the fact that data selection has been of utmost importance to scaling in vision and natural language processing (NLP), little work in robotics has questioned what data such models should actually be trained on. In this work we investigate how to weigh different subsets or "domains'' of robotics datasets during pre-training to maximize worst-case performance across all possible downstream domains using distributionally robust optimization (DRO). Unlike in NLP, we find that these methods are hard to apply out of the box due to varying action spaces and dynamics across robots. Our method, ReMix, employs early stopping and action normalization and discretization to counteract these issues. Through extensive experimentation on both the Bridge and OpenX datasets, we demonstrate that data curation can have an outsized impact on downstream performance. Specifically, domain weights learned by ReMix outperform uniform weights by over 40% on average and human-selected weights by over 20% on datasets used to train the RT-X models.

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

Joey Hejna, Chethan Anand Bhateja, Yichen Jiang, Karl Pertsch, Dorsa Sadigh

Sep 05, 2024

Computer Vision

Robotics

Natural Language Processing

Research

An Open-Source AI Agent for Doing Tasks on the Web

Katharine Miller

Mar 27, 2025

News

NNetNav learns how to navigate websites by mimicking childhood learning through exploration.

An Open-Source AI Agent for Doing Tasks on the Web

Katharine Miller

Mar 27, 2025

NNetNav learns how to navigate websites by mimicking childhood learning through exploration.

Machine Learning

Natural Language Processing

News

Navigate

Participate

Stay Up To Date

Natural Language Processing

Social Science Moves In Silico

Social Science Moves In Silico

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

The Promise and Perils of Artificial Intelligence in Advancing Participatory Science and Health Equity in Public Health

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts

Christopher Manning

Christopher Manning

New Large Language Model Helps Patients Understand Their Radiology Reports

New Large Language Model Helps Patients Understand Their Radiology Reports

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

LABOR-LLM: Language-Based Occupational Representations with Large Language Models

All Work Published on Natural Language Processing

MedArena: Comparing LLMs for Medicine in the Wild

MedArena: Comparing LLMs for Medicine in the Wild

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

Percy Liang

Percy Liang

Language Models in the Classroom: Bridging the Gap Between Technology and Teaching

Language Models in the Classroom: Bridging the Gap Between Technology and Teaching

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

ReMix: Optimizing Data Mixtures for Large Scale Imitation Learning

An Open-Source AI Agent for Doing Tasks on the Web

An Open-Source AI Agent for Doing Tasks on the Web