SlideShare a Scribd company logo
2
Most read
4
Most read
5
Most read
Trying to figure out MCP by
actually building an app from
scratch with open source and SLMs
Julien Simon, Chief Evangelist
julien@arcee.ai
https://www.julien.org
https://github.com/juliensimon/smolagents-mcp-demo
Just another remote procedure call protocol 👴
1982: UNIX RPC – First mainstream RPC, used in NFS.
1994: CORBA – Early cross-language object middleware.
2000: REST – Dominant HTTP API style.
2000: SOAP – Enterprise XML-based APIs.
2005: gRPC – Protobuf + HTTP/2, high-performance.
2007: Thri
f
– Mul
ti
-language RPC.
2010: WebSockets – Real-
ti
me, bidirec
ti
onal communica
ti
on.
2014: GraphQL – Client-driven data queries.
2023: MCP – Standardizes AI data integra
ti
on ⬅ YOU ARE HERE
Food for thought
Security, Trust, Audi
ti
ng - CRITICAL for enterprise adop
ti
on
• What measures are in place to authen
ti
cate and validate the servers your applica
ti
on communicates with?
• How do you ensure each func
ti
on is accessible only to authorized users under appropriate condi
ti
ons?
• How do you establish clear iden
ti
ty management to a
tt
ribute ac
ti
ons accurately within MCP systems?
Discoverability & Rou
ti
ng
• How does your applica
ti
on discover and connect to remote servers dynamically?
• How do you determine the most suitable server and func
ti
on(s) for each task within your applica
ti
on?
Versioning & Compa
ti
bility
• How can you ensure that updates do not disrupt exis
ti
ng func
ti
onali
ti
es for users?
• How does MCP live alongside other protocols (REST, OpenAI-style func
ti
on calling, etc.)?
Documenta
ti
on & Usability
• How do you ensure that func
ti
on descrip
ti
ons are detailed and understandable for models?
Performance & Cost Management
• How do you op
ti
mize latency and token consump
ti
on in agen
ti
c systems?
Arcee AI - Post-trained models
State-of-the-art tech stack based on open-source libraries
Spectrum (continuous pre-training), MergeKit (merging), DistilKit (distillation), EvolKit (dataset improvement)
Best-in-class models based on open-source architectures
Hugging Face OpenLLM Leaderboard benchmarks
Llama 3.1 70B
🥇
Best 70B model
Qwen2 1.5B
🥇
Best 1.5B model
Llama 3.1 8B
🥇
Best 8B model
Qwen2.5 14B
🥇
Best 14B model
Qwen2 72B
🥇
Best Arabic model
https://www.together.ai/models/afm-4-5b-preview
https://www.arcee.ai/blog/announcing-the-arcee-foundation-model-family
https://www.arcee.ai/blog/deep-dive-afm-4-5b-the-first-arcee-foundational-model
https://www.arcee.ai/blog/extending-afm-4-5b-to-64k-context-length
Arcee Foundation Models (AFM)
AFM-4.5B-Preview
https://api.together.ai/models/arcee-ai/AFM-4.5B-Preview
AFM-4.5B-Preview vs. Qwen-3-4B
8/10. Tie on Industrials, loss on Communication Services.
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
AFM-4.5B-Preview vs. Google Gemma-3n-E4B-it
8/10, tie on Healthcare, loss on IT
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
AFM-4.5B-Preview vs. Llama-3.2-8B
10/10 😃
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
AFM-4.5B-Preview vs. Mixtral-8x7B-Instruct
Almost tied (4/10) with 8% of Mixtral’s size
200 questions generated by Claude Sonnet 3.7
20 questions for each one of the top 10 industries in the S&P 500
Judge: DeepSeek-R1 (670B)
https://github.com/juliensimon/radar-evaluator
Julien Simon, Chief Evangelist
julien@arcee.ai
https://www.julien.org
https://github.com/juliensimon/smolagents-mcp-demo
Models on Hugging Face;
OpenRouter and Together AI
Chat with AFM
AFM blog post

More Related Content

PDF
Running Apache Spark Jobs Using Kubernetes
Databricks
 
PPTX
Legion - AI Runtime Platform
Alexey Kharlamov
 
PDF
From Traction to Production Maturing your LLMOps step by step
Maxim Salnikov
 
PPTX
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
PDF
Alfresco and the Model Context Protocol (MCP)
Angel Borroy López
 
PDF
Tech leaders guide to effective building of machine learning products
Gianmario Spacagna
 
PDF
APIdays Barcelona 2019 - How a Cloud native Architecture helps to drive Busin...
apidays
 
PDF
Spring and Pivotal Application Service - SpringOne Tour - Boston
VMware Tanzu
 
Running Apache Spark Jobs Using Kubernetes
Databricks
 
Legion - AI Runtime Platform
Alexey Kharlamov
 
From Traction to Production Maturing your LLMOps step by step
Maxim Salnikov
 
Open, Secure & Transparent AI Pipelines
Nick Pentreath
 
Alfresco and the Model Context Protocol (MCP)
Angel Borroy López
 
Tech leaders guide to effective building of machine learning products
Gianmario Spacagna
 
APIdays Barcelona 2019 - How a Cloud native Architecture helps to drive Busin...
apidays
 
Spring and Pivotal Application Service - SpringOne Tour - Boston
VMware Tanzu
 

Similar to Trying to figure out MCP by actually building an app from scratch with open source and SLMs (20)

PDF
Content Strategy and Developer Engagement for DevPortals
Axway
 
PDF
Continuous API Strategies for Integrated Platforms
Bill Doerrfeld
 
PDF
Confluent Partner Tech Talk with Reply
confluent
 
PDF
Open-Falcon: A Distributed and High-Performance Monitoring System
Yao-Wei Ou
 
PDF
World Artificial Intelligence Conference Shanghai 2018
Adam Gibson
 
PDF
apidays LIVE Paris - GraphQL meshes by Jens Neuse
apidays
 
PPTX
Manchester MuleSoft Meetup #8 - 28 Sept.pptx
Akshata Sawant
 
PPSX
Elastic-Engineering
Araf Karsh Hamid
 
PDF
Current State of HPC workloads and Containers in the Cloud
Thomas Francis
 
DOC
Vijayalakshmi_Sivaraman
Vijayalakshmi Sivaraman
 
PPTX
MuleSoft London Community October 2017 - Hybrid and SAP Integration
Pace Integration
 
PDF
DCSF19 Adding a Modern API Layer to ‘Dockerized’ Legacy Apps
Docker, Inc.
 
PDF
Which Application Modernization Pattern Is Right For You?
Apigee | Google Cloud
 
PDF
Transform Your Business with API-led Connectivity
MuleSoft
 
PDF
The DevOps paradigm - the evolution of IT professionals and opensource toolkit
Marco Ferrigno
 
PDF
The DevOps Paradigm
NaLUG
 
PDF
TensorFlow meetup: Keras - Pytorch - TensorFlow.js
Stijn Decubber
 
PPTX
Business and IT agility through DevOps and microservice architecture powered ...
Lucas Jellema
 
PDF
KubeCon & CloudNative Con 2024 Artificial Intelligent
Emre Gündoğdu
 
PDF
OpenFaaS 2019 Project Update
Alex Ellis
 
Content Strategy and Developer Engagement for DevPortals
Axway
 
Continuous API Strategies for Integrated Platforms
Bill Doerrfeld
 
Confluent Partner Tech Talk with Reply
confluent
 
Open-Falcon: A Distributed and High-Performance Monitoring System
Yao-Wei Ou
 
World Artificial Intelligence Conference Shanghai 2018
Adam Gibson
 
apidays LIVE Paris - GraphQL meshes by Jens Neuse
apidays
 
Manchester MuleSoft Meetup #8 - 28 Sept.pptx
Akshata Sawant
 
Elastic-Engineering
Araf Karsh Hamid
 
Current State of HPC workloads and Containers in the Cloud
Thomas Francis
 
Vijayalakshmi_Sivaraman
Vijayalakshmi Sivaraman
 
MuleSoft London Community October 2017 - Hybrid and SAP Integration
Pace Integration
 
DCSF19 Adding a Modern API Layer to ‘Dockerized’ Legacy Apps
Docker, Inc.
 
Which Application Modernization Pattern Is Right For You?
Apigee | Google Cloud
 
Transform Your Business with API-led Connectivity
MuleSoft
 
The DevOps paradigm - the evolution of IT professionals and opensource toolkit
Marco Ferrigno
 
The DevOps Paradigm
NaLUG
 
TensorFlow meetup: Keras - Pytorch - TensorFlow.js
Stijn Decubber
 
Business and IT agility through DevOps and microservice architecture powered ...
Lucas Jellema
 
KubeCon & CloudNative Con 2024 Artificial Intelligent
Emre Gündoğdu
 
OpenFaaS 2019 Project Update
Alex Ellis
 
Ad

More from Julien SIMON (20)

PDF
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
PDF
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
PDF
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
PDF
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
PDF
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
PDF
An introduction to computer vision with Hugging Face
Julien SIMON
 
PDF
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
PDF
Building NLP applications with Transformers
Julien SIMON
 
PPTX
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
PDF
Starting your AI/ML project right (May 2020)
Julien SIMON
 
PPTX
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
PPTX
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
An introduction to computer vision with Hugging Face
Julien SIMON
 
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
Building NLP applications with Transformers
Julien SIMON
 
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
Starting your AI/ML project right (May 2020)
Julien SIMON
 
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Ad

Recently uploaded (20)

PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PPTX
IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptx
Rejig Digital
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
Software Development Methodologies in 2025
KodekX
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
PDF
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
Coupa-Overview _Assumptions presentation
annapureddyn
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptx
Rejig Digital
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Software Development Methodologies in 2025
KodekX
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
 
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Coupa-Overview _Assumptions presentation
annapureddyn
 

Trying to figure out MCP by actually building an app from scratch with open source and SLMs

  • 1. Trying to figure out MCP by actually building an app from scratch with open source and SLMs Julien Simon, Chief Evangelist julien@arcee.ai https://www.julien.org https://github.com/juliensimon/smolagents-mcp-demo
  • 2. Just another remote procedure call protocol 👴 1982: UNIX RPC – First mainstream RPC, used in NFS. 1994: CORBA – Early cross-language object middleware. 2000: REST – Dominant HTTP API style. 2000: SOAP – Enterprise XML-based APIs. 2005: gRPC – Protobuf + HTTP/2, high-performance. 2007: Thri f – Mul ti -language RPC. 2010: WebSockets – Real- ti me, bidirec ti onal communica ti on. 2014: GraphQL – Client-driven data queries. 2023: MCP – Standardizes AI data integra ti on ⬅ YOU ARE HERE
  • 3. Food for thought Security, Trust, Audi ti ng - CRITICAL for enterprise adop ti on • What measures are in place to authen ti cate and validate the servers your applica ti on communicates with? • How do you ensure each func ti on is accessible only to authorized users under appropriate condi ti ons? • How do you establish clear iden ti ty management to a tt ribute ac ti ons accurately within MCP systems? Discoverability & Rou ti ng • How does your applica ti on discover and connect to remote servers dynamically? • How do you determine the most suitable server and func ti on(s) for each task within your applica ti on? Versioning & Compa ti bility • How can you ensure that updates do not disrupt exis ti ng func ti onali ti es for users? • How does MCP live alongside other protocols (REST, OpenAI-style func ti on calling, etc.)? Documenta ti on & Usability • How do you ensure that func ti on descrip ti ons are detailed and understandable for models? Performance & Cost Management • How do you op ti mize latency and token consump ti on in agen ti c systems?
  • 4. Arcee AI - Post-trained models State-of-the-art tech stack based on open-source libraries Spectrum (continuous pre-training), MergeKit (merging), DistilKit (distillation), EvolKit (dataset improvement) Best-in-class models based on open-source architectures Hugging Face OpenLLM Leaderboard benchmarks Llama 3.1 70B 🥇 Best 70B model Qwen2 1.5B 🥇 Best 1.5B model Llama 3.1 8B 🥇 Best 8B model Qwen2.5 14B 🥇 Best 14B model Qwen2 72B 🥇 Best Arabic model
  • 7. AFM-4.5B-Preview vs. Qwen-3-4B 8/10. Tie on Industrials, loss on Communication Services. 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 8. AFM-4.5B-Preview vs. Google Gemma-3n-E4B-it 8/10, tie on Healthcare, loss on IT 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 9. AFM-4.5B-Preview vs. Llama-3.2-8B 10/10 😃 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 10. AFM-4.5B-Preview vs. Mixtral-8x7B-Instruct Almost tied (4/10) with 8% of Mixtral’s size 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator
  • 11. Julien Simon, Chief Evangelist julien@arcee.ai https://www.julien.org https://github.com/juliensimon/smolagents-mcp-demo Models on Hugging Face; OpenRouter and Together AI Chat with AFM AFM blog post