What is Models-as-a-Service?

URL 복사

Models-as-a-Service (MaaS) is an approach to delivering AI models as shared resources, allowing users within an organization to access them on demand. MaaS offers a ready-to-go AI foundation—in the form of application programming interface (API) endpoints—that encourages private and faster AI at scale. 

Watch Red Hat's MaaS webinar

Many want to use AI, but taking advantage of private models for the enterprise is where most organizations get stuck. Pretrained AI models from public sources like Hugging Face are becoming more accessible. And if you have the right hardware—like extensive graphical processing units (GPUs)—getting your models up and running can seem like a realistic goal. But here’s the issue: Once the model is up and running, who can use it? Furthermore, who can scale it? 

To scale a private AI model, you need to make a single model instance accessible to multiple users and multiple applications at once. Otherwise, the models are accessible only to the user who created it, which is really inefficient. That’s where MaaS comes in. 

Models-as-a-Service allows privately shared model access across 1 organization’s teams and applications, without sacrificing control over data. 

As organizations begin to adopt AI, it’s common to start with easy-to-use tools and interfaces. But as use grows, the focus shifts from experimenting with a couple of models to running AI at scale. You might start with a few specific models in production, but over time you’ll likely run many different types: language models, image models, audio models, and more—often with multiple versions and use cases.

This means moving from an “artisan” approach―doing everything manually―to more of a “factory” approach, where you manage models efficiently and consistently.

Managing all of this in a reliable, scalable way is what Models-as-a-Service is all about.

You don’t need public AI providers to explore AI patterns, such as retrieval-augmented generation (RAG)agentic, and coding assistants. Private AI models can power these same tools without impacting ease of use for the end user.

Models-as-a-Service is meant to support the use of openly available large language models (LLMs) like Mistral, Llama, DeepSeek, and more. It also isn’t limited to pretrained foundation models. MaaS can serve fine-tuned models or even built-from-scratch, predictive AI models, all on the same fully supported platform. 

In a typical MaaS implementation, an IT or AI platform engineering team makes AI models available through API endpoints for internal customers like developers and business users. Generally, MaaS environments are built on hybrid cloud AI platforms with API gateways to simplify integration across teams and operations. Key components of MaaS include models, a scalable AI platform, an AI orchestration system, and API management. All these moving parts equip your Models-as-a-Service to support a scalable AI strategy. 

View a MaaS demo 

AI 기술 구현의 4가지 핵심 고려 사항

블로그

Artificial Intelligence (AI)

See how our platforms free customers to run AI workloads and models anywhere

Red Hat과 함께하는 AI 여정: 조직의 AI 여정을 위한 전문성, 교육 및 지원

Red Hat만의 AI 포트폴리오를 살펴보세요. Red Hat AI를 통해 인공지능(AI)을 활용하여 비즈니스 및 IT 목표를 달성할 수 있습니다.

추가 자료

에이전틱 AI란? 자율적 상호작용을 위한 소프트웨어 시스템

에이전틱 AI (Agentic AI)는 인간의 개입을 최소화하며 데이터와 도구를 자율적으로 활용해 작업을 수행하도록 설계된 지능형 소프트웨어 시스템입니다.

공공 부문에서 AI의 역할

공공 부문에서 트랜스포메이션과 현대화를 이끌어내는 툴인 AI의 개발과 적용 양상을 살펴봅니다.

머신 러닝(Machine Learning): 패턴 인식과 예측 기술

머신 러닝 학습은 명시적인 프로그래밍 없이 데이터에서 패턴을 학습하고 예측하는 AI 기술입니다. 머신 러닝의 원리, 핵심 기능, 그리고 다양한 활용 사례를 알아보세요.

AI/ML 리소스

관련 기사