Trying to figure out MCP by actually building an app from scratch with open source and SLMs

0 likes•157 views

Julien SIMON

Talk @ MCP Conference, London, 24/07/2025

Technology

More Related Content

PDF

Running Apache Spark Jobs Using KubernetesDatabricks

PPTX

Legion - AI Runtime PlatformAlexey Kharlamov

PDF

From Traction to Production Maturing your LLMOps step by stepMaxim Salnikov

PPTX

Open, Secure & Transparent AI PipelinesNick Pentreath

PDF

Alfresco and the Model Context Protocol (MCP)Angel Borroy López

PDF

Tech leaders guide to effective building of machine learning productsGianmario Spacagna

PDF

APIdays Barcelona 2019 - How a Cloud native Architecture helps to drive Busin...apidays

PDF

Spring and Pivotal Application Service - SpringOne Tour - BostonVMware Tanzu

Running Apache Spark Jobs Using KubernetesDatabricks

Legion - AI Runtime PlatformAlexey Kharlamov

From Traction to Production Maturing your LLMOps step by stepMaxim Salnikov

Open, Secure & Transparent AI PipelinesNick Pentreath

Alfresco and the Model Context Protocol (MCP)Angel Borroy López

Tech leaders guide to effective building of machine learning productsGianmario Spacagna

APIdays Barcelona 2019 - How a Cloud native Architecture helps to drive Busin...apidays

Spring and Pivotal Application Service - SpringOne Tour - BostonVMware Tanzu

Similar to Trying to figure out MCP by actually building an app from scratch with open source and SLMs (20)

PDF

Content Strategy and Developer Engagement for DevPortalsAxway

PDF

Continuous API Strategies for Integrated PlatformsBill Doerrfeld

PDF

Confluent Partner Tech Talk with Replyconfluent

PDF

Open-Falcon: A Distributed and High-Performance Monitoring SystemYao-Wei Ou

PDF

World Artificial Intelligence Conference Shanghai 2018Adam Gibson

PDF

apidays LIVE Paris - GraphQL meshes by Jens Neuseapidays

PPTX

Manchester MuleSoft Meetup #8 - 28 Sept.pptxAkshata Sawant

PPSX

Elastic-EngineeringAraf Karsh Hamid

PDF

Current State of HPC workloads and Containers in the CloudThomas Francis

DOC

Vijayalakshmi_SivaramanVijayalakshmi Sivaraman

PPTX

MuleSoft London Community October 2017 - Hybrid and SAP IntegrationPace Integration

PDF

DCSF19 Adding a Modern API Layer to ‘Dockerized’ Legacy Apps Docker, Inc.

PDF

Which Application Modernization Pattern Is Right For You?Apigee | Google Cloud

PDF

Transform Your Business with API-led ConnectivityMuleSoft

PDF

The DevOps paradigm - the evolution of IT professionals and opensource toolkitMarco Ferrigno

PDF

The DevOps ParadigmNaLUG

PDF

TensorFlow meetup: Keras - Pytorch - TensorFlow.jsStijn Decubber

PPTX

Business and IT agility through DevOps and microservice architecture powered ...Lucas Jellema

PDF

KubeCon & CloudNative Con 2024 Artificial IntelligentEmre Gündoğdu

PDF

OpenFaaS 2019 Project UpdateAlex Ellis

Content Strategy and Developer Engagement for DevPortalsAxway

Continuous API Strategies for Integrated PlatformsBill Doerrfeld

Confluent Partner Tech Talk with Replyconfluent

Open-Falcon: A Distributed and High-Performance Monitoring SystemYao-Wei Ou

World Artificial Intelligence Conference Shanghai 2018Adam Gibson

apidays LIVE Paris - GraphQL meshes by Jens Neuseapidays

Manchester MuleSoft Meetup #8 - 28 Sept.pptxAkshata Sawant

Elastic-EngineeringAraf Karsh Hamid

Current State of HPC workloads and Containers in the CloudThomas Francis

Vijayalakshmi_SivaramanVijayalakshmi Sivaraman

MuleSoft London Community October 2017 - Hybrid and SAP IntegrationPace Integration

DCSF19 Adding a Modern API Layer to ‘Dockerized’ Legacy Apps Docker, Inc.

Which Application Modernization Pattern Is Right For You?Apigee | Google Cloud

Transform Your Business with API-led ConnectivityMuleSoft

The DevOps paradigm - the evolution of IT professionals and opensource toolkitMarco Ferrigno

The DevOps ParadigmNaLUG

TensorFlow meetup: Keras - Pytorch - TensorFlow.jsStijn Decubber

Business and IT agility through DevOps and microservice architecture powered ...Lucas Jellema

KubeCon & CloudNative Con 2024 Artificial IntelligentEmre Gündoğdu

OpenFaaS 2019 Project UpdateAlex Ellis

More from Julien SIMON (20)

PDF

Arcee AI - building and working with small language models (06/25)Julien SIMON

PDF

deep_dive_multihead_latent_attention.pdfJulien SIMON

PDF

Deep Dive: Model Distillation with DistillKitJulien SIMON

PDF

Deep Dive: Parameter-Efficient Model Adaptation with LoRA and SpectrumJulien SIMON

PDF

Building High-Quality Domain-Specific Models with MergekitJulien SIMON

PDF

Tailoring Small Language Models for Enterprise Use CasesJulien SIMON

PDF

Tailoring Small Language Models for Enterprise Use CasesJulien SIMON

PDF

Julien Simon - Deep Dive: Compiling Deep Learning ModelsJulien SIMON

PDF

Tailoring Small Language Models for Enterprise Use CasesJulien SIMON

PDF

Julien Simon - Deep Dive - Optimizing LLM InferenceJulien SIMON

PDF

Julien Simon - Deep Dive - Accelerating Models with Better Attention LayersJulien SIMON

PDF

Julien Simon - Deep Dive - Quantizing LLMsJulien SIMON

PDF

Julien Simon - Deep Dive - Model MergingJulien SIMON

PDF

An introduction to computer vision with Hugging FaceJulien SIMON

PDF

Reinventing Deep Learning  with Hugging Face TransformersJulien SIMON

PDF

Building NLP applications with TransformersJulien SIMON

PPTX

Building Machine Learning Models Automatically (June 2020)Julien SIMON

PDF

Starting your AI/ML project right (May 2020)Julien SIMON

PPTX

Scale Machine Learning from zero to millions of users (April 2020)Julien SIMON

PPTX

An Introduction to Generative Adversarial Networks (April 2020)Julien SIMON

Arcee AI - building and working with small language models (06/25)Julien SIMON

deep_dive_multihead_latent_attention.pdfJulien SIMON

Deep Dive: Model Distillation with DistillKitJulien SIMON

Deep Dive: Parameter-Efficient Model Adaptation with LoRA and SpectrumJulien SIMON

Building High-Quality Domain-Specific Models with MergekitJulien SIMON

Tailoring Small Language Models for Enterprise Use CasesJulien SIMON

Julien Simon - Deep Dive: Compiling Deep Learning ModelsJulien SIMON

Tailoring Small Language Models for Enterprise Use CasesJulien SIMON

Julien Simon - Deep Dive - Optimizing LLM InferenceJulien SIMON

Julien Simon - Deep Dive - Accelerating Models with Better Attention LayersJulien SIMON

Julien Simon - Deep Dive - Quantizing LLMsJulien SIMON

Julien Simon - Deep Dive - Model MergingJulien SIMON

An introduction to computer vision with Hugging FaceJulien SIMON

Reinventing Deep Learning  with Hugging Face TransformersJulien SIMON

Building NLP applications with TransformersJulien SIMON

Building Machine Learning Models Automatically (June 2020)Julien SIMON

Starting your AI/ML project right (May 2020)Julien SIMON

Scale Machine Learning from zero to millions of users (April 2020)Julien SIMON

An Introduction to Generative Adversarial Networks (April 2020)Julien SIMON

Recently uploaded (20)

PDF

Presentation about Hardware and Software in Computersnehamodhawadiya

PDF

CIFDAQ's Market Wrap : Bears Back in Control?CIFDAQ

PPTX

OA presentation.pptx OA presentation.pptxpateldhruv002338

PPTX

How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...Third Rock Techkno

PDF

REPORT: Heating appliances market in Poland 2024SPIUG

PDF

Tea4chat - another LLM Project by Kerem Atama0m0rajab1

PPTX

IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptxRejig Digital

PPTX

The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptxsujalchauhan1305

PDF

BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORTcodernjn73

PDF

Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...Artjoker Software Development Company

PDF

Software Development Methodologies in 2025KodekX

PPTX

What-is-the-World-Wide-Web -- Introductiontonifi9488

PDF

Beyond Automation: The Role of IoT Sensor Integration in Next-Gen IndustriesRejig Digital

PDF

Event Presentation Google Cloud Next Extended 2025minhtrietgect

PPTX

Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...AndreeaTom

PDF

NewMind AI Weekly Chronicles - July'25 - Week IVNewMind AI

PDF

Structs to JSON: How Go Powers REST APIsEmily Achieng

PDF

Automating ArcGIS Content Discovery with FME: A Real World Use CaseSafe Software

PDF

Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdfSandesh Rao

PPTX

Coupa-Overview _Assumptions presentationannapureddyn

Presentation about Hardware and Software in Computersnehamodhawadiya

CIFDAQ's Market Wrap : Bears Back in Control?CIFDAQ

OA presentation.pptx OA presentation.pptxpateldhruv002338

How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...Third Rock Techkno

REPORT: Heating appliances market in Poland 2024SPIUG

Tea4chat - another LLM Project by Kerem Atama0m0rajab1

IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptxRejig Digital

The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptxsujalchauhan1305

BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORTcodernjn73

Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...Artjoker Software Development Company

Software Development Methodologies in 2025KodekX

What-is-the-World-Wide-Web -- Introductiontonifi9488

Beyond Automation: The Role of IoT Sensor Integration in Next-Gen IndustriesRejig Digital

Event Presentation Google Cloud Next Extended 2025minhtrietgect

Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...AndreeaTom

NewMind AI Weekly Chronicles - July'25 - Week IVNewMind AI

Structs to JSON: How Go Powers REST APIsEmily Achieng

Automating ArcGIS Content Discovery with FME: A Real World Use CaseSafe Software

Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdfSandesh Rao

Coupa-Overview _Assumptions presentationannapureddyn

Trying to figure out MCP by actually building an app from scratch with open source and SLMs

1. Trying to figure out MCP by actually building an app from scratch with open source and SLMs Julien Simon, Chief Evangelist julien@arcee.ai https://www.julien.org https://github.com/juliensimon/smolagents-mcp-demo

2. Just another remote procedure call protocol 👴 1982: UNIX RPC – First mainstream RPC, used in NFS. 1994: CORBA – Early cross-language object middleware. 2000: REST – Dominant HTTP API style. 2000: SOAP – Enterprise XML-based APIs. 2005: gRPC – Protobuf + HTTP/2, high-performance. 2007: Thri f – Mul ti -language RPC. 2010: WebSockets – Real- ti me, bidirec ti onal communica ti on. 2014: GraphQL – Client-driven data queries. 2023: MCP – Standardizes AI data integra ti on ⬅ YOU ARE HERE

3. Food for thought Security, Trust, Audi ti ng - CRITICAL for enterprise adop ti on • What measures are in place to authen ti cate and validate the servers your applica ti on communicates with? • How do you ensure each func ti on is accessible only to authorized users under appropriate condi ti ons? • How do you establish clear iden ti ty management to a tt ribute ac ti ons accurately within MCP systems? Discoverability & Rou ti ng • How does your applica ti on discover and connect to remote servers dynamically? • How do you determine the most suitable server and func ti on(s) for each task within your applica ti on? Versioning & Compa ti bility • How can you ensure that updates do not disrupt exis ti ng func ti onali ti es for users? • How does MCP live alongside other protocols (REST, OpenAI-style func ti on calling, etc.)? Documenta ti on & Usability • How do you ensure that func ti on descrip ti ons are detailed and understandable for models? Performance & Cost Management • How do you op ti mize latency and token consump ti on in agen ti c systems?

4. Arcee AI - Post-trained models State-of-the-art tech stack based on open-source libraries Spectrum (continuous pre-training), MergeKit (merging), DistilKit (distillation), EvolKit (dataset improvement) Best-in-class models based on open-source architectures Hugging Face OpenLLM Leaderboard benchmarks Llama 3.1 70B 🥇 Best 70B model Qwen2 1.5B 🥇 Best 1.5B model Llama 3.1 8B 🥇 Best 8B model Qwen2.5 14B 🥇 Best 14B model Qwen2 72B 🥇 Best Arabic model

5. https://www.together.ai/models/afm-4-5b-preview https://www.arcee.ai/blog/announcing-the-arcee-foundation-model-family https://www.arcee.ai/blog/deep-dive-afm-4-5b-the-first-arcee-foundational-model https://www.arcee.ai/blog/extending-afm-4-5b-to-64k-context-length Arcee Foundation Models (AFM)

6. AFM-4.5B-Preview https://api.together.ai/models/arcee-ai/AFM-4.5B-Preview

7. AFM-4.5B-Preview vs. Qwen-3-4B 8/10. Tie on Industrials, loss on Communication Services. 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator

8. AFM-4.5B-Preview vs. Google Gemma-3n-E4B-it 8/10, tie on Healthcare, loss on IT 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator

9. AFM-4.5B-Preview vs. Llama-3.2-8B 10/10 😃 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator

10. AFM-4.5B-Preview vs. Mixtral-8x7B-Instruct Almost tied (4/10) with 8% of Mixtral’s size 200 questions generated by Claude Sonnet 3.7 20 questions for each one of the top 10 industries in the S&P 500 Judge: DeepSeek-R1 (670B) https://github.com/juliensimon/radar-evaluator

11. Julien Simon, Chief Evangelist julien@arcee.ai https://www.julien.org https://github.com/juliensimon/smolagents-mcp-demo Models on Hugging Face; OpenRouter and Together AI Chat with AFM AFM blog post