View all topics

Foundation Models

Foundation models can be applied across domains and tasks. But there are challenges to scalability, and how AI is applied in specific use cases. At IBM Research, we create new foundation models for business, integrating deep domain expertise, a focus on responsible AI, and a commitment to open-source innovation.

Overview

Modern AI models can learn from millions of examples to help find new solutions to difficult problems. But building new systems tends to take time — and lots of data. The next wave in AI will replace task-specific models with ones that are trained on a broad set of unlabeled data that can be used for different tasks — with minimal fine-tuning. These are called foundation models. They can be the foundation for many applications of the AI model. Using self-supervised learning and fine-tuning, the model can apply information it has learned in general to a specific task. We believe that foundation models will dramatically accelerate AI adoption in business. Reducing time spent labeling data and programming models will make it much easier for businesses to dive in, allowing more companies to deploy AI in a wider range of mission-critical situations. Our goal is to bring the power of foundation models to every enterprise in a frictionless hybrid-cloud environment. Learn more about foundation models

Our work

Meet IBM’s new family of AI models for materials discovery
News
Kim Martineau
20 Dec 2024
Photos: How IBM and NASA's new geospatial model is changing our view of the world
News
Kim Martineau
06 Dec 2024
An IBM-led team is exploring how AI can prepare the electrical grid for the low-carbon era
News
Peter Hess
05 Dec 2024
IBM Granite has new experimental features for developers to test
News
Kim Martineau
19 Nov 2024
From surf to satellites: Campbell Watson is bringing AI to Earth science
Deep Dive
Peter Hess
08 Nov 2024
Serving customized AI models at scale with LoRA
Research
Kim Martineau
07 Nov 2024
See more of our work on Foundation Models

Publications

Can LLMs Replace Manual Annotation of Software Engineering Artifacts?
- - Toufique Ahmed
  - Premkumar Devanbu
  - et al.
- 2025
- MSR 2025
Preparing Good Data for Generative AI: Challenges and Approaches (Good-Data)
- - David Vazquez
  - Laure Berti-equille
  - et al.
- 2025
- AAAI 2025
Agentic AI for Digital Twin
- - Alexander Timms
  - Abigail Langbridge
  - et al.
- 2025
- AAAI 2025
Enhancing Decision Making through the Integration of Large Language Models and Operations Research Optimization - Bridge Talk
- - Segev Wasserkrug
  - Léonard Boussioux
  - et al.
- 2025
- AAAI 2025
Workshop on Planning in the Era of LLMs
- - Michael Katz
  - Jiayuan Mao
  - et al.
- 2025
- AAAI 2025
Usage Governance Advisor: From Intent to AI Governance
- - Elizabeth Daly
  - Sean Rooney
  - et al.
- 2025
- AAAI 2025

View all publications

Tools + code

Related topics

Neuro-symbolic AI Conversational AI Natural Language Processing Explainable AI