An MCP (Model Context Protocol) server for managing multiple Kubernetes clusters simultaneously. Provides 60+ tools covering cluster operations, resource management, monitoring, RBAC, storage, networking, and more -- all accessible through AI assistants like Claude Desktop.
Quick Start
Installing via Smithery
hljs language-bash
npx -y @smithery/cli install @razvanmacovei/k8s-multicluster-mcp --client claude
Directory containing kubeconfig files. Each file can contain one or more contexts.
Place your kubeconfig files in the configured directory. The server discovers all contexts across all files automatically.
Prerequisites
Python 3.11 or higher
One or more kubeconfig files with valid cluster credentials
metrics-server installed in clusters for k8s_top_* and k8s_cluster_info tools
Multi-Cluster Management
This server is purpose-built for managing multiple Kubernetes clusters:
Automatic Discovery: Scans all kubeconfig files in KUBECONFIG_DIR and aggregates contexts
Partial Name Matching: Use prod instead of arn:aws:eks:us-east-1:123456:cluster/prod-cluster
Cross-Cluster Operations: Compare resources, health, and configurations across clusters
Context Caching: Efficient 30-second TTL cache avoids re-reading kubeconfig files on every call
Centralized Management: Manage dev, staging, and production from a single interface
Tools Reference (60+ tools)
Cluster & Context Management (6 tools)
Tool
Description
k8s_get_contexts
List all available contexts across all kubeconfig files
k8s_get_namespaces
List all namespaces in a cluster
k8s_create_ns
Create a new namespace with optional labels/annotations
k8s_delete_ns
Delete a namespace (protects system namespaces)
k8s_get_nodes
List all nodes with status, roles, capacity, and version
k8s_cluster_info
Comprehensive cluster health summary (nodes, pods, deployments, warnings)
Resource Discovery & Inspection (7 tools)
Tool
Description
k8s_get_resources
List resources of any kind (pods, deployments, services, ingress, etc.)
k8s_get_resource
Get the complete definition of a single resource
k8s_get_pod_logs
Get pod logs with duration filtering and container selection
k8s_get_events
List cluster events in a namespace, sorted by most recent
k8s_describe
Detailed resource description similar to kubectl describe
k8s_apis
List all available API groups and resources in the cluster
k8s_crds
List all Custom Resource Definitions with versions and scope
Metrics & Monitoring (3 tools)
Tool
Description
k8s_top_nodes
Display node CPU/memory usage alongside capacity
k8s_top_pods
Display pod/container CPU/memory usage
k8s_diagnose_application
Comprehensive app diagnostics with issue detection and recommendations
Rollout Management (6 tools)
Tool
Description
k8s_rollout_status
Get rollout status for deployment/statefulset/daemonset
k8s_rollout_history
Get revision history
k8s_rollout_undo
Roll back to a previous revision
k8s_rollout_restart
Trigger a rolling restart
k8s_rollout_pause
Pause an in-progress rollout
k8s_rollout_resume
Resume a paused rollout
Scaling (3 tools)
Tool
Description
k8s_scale_resource
Scale deployment/statefulset/replicaset to N replicas
k8s_autoscale_resource
Configure Horizontal Pod Autoscaler (HPA)
k8s_update_resources
Update CPU/memory requests and limits for a container
Resource CRUD (6 tools)
Tool
Description
k8s_create_resource
Create a resource from YAML/JSON content
k8s_apply_resource
Apply configuration (create or update, like kubectl apply)
k8s_delete_resource
Delete any resource type with optional force/grace period
k8s_patch_resource
Update specific fields with strategic merge patch
k8s_label_resource
Add or update labels on a resource
k8s_annotate_resource
Add or update annotations on a resource
Workload Management (3 tools)
Tool
Description
k8s_expose_resource
Expose a deployment/pod as a new Service
k8s_run_pod
Create and run a pod with a specified image
k8s_set_resources_for_container
Set resource limits/requests for containers
Node Management (5 tools)
Tool
Description
k8s_cordon_node
Mark a node as unschedulable
k8s_uncordon_node
Mark a node as schedulable
k8s_drain_node
Drain a node by evicting pods (for maintenance)
k8s_taint_node
Add taints to a node
k8s_untaint_node
Remove taints from a node
Pod Operations (1 tool)
Tool
Description
k8s_pod_exec
Execute a command in a container (supports quoted arguments)
Secrets & ConfigMaps (6 tools)
Tool
Description
k8s_list_secret
List secrets with metadata (values hidden for security)
k8s_get_secret_detail
Get a secret; optionally decode values
k8s_create_secret_resource
Create a new secret (auto base64-encodes values)
k8s_list_configmap
List ConfigMaps with their key names
k8s_get_configmap_detail
Get a ConfigMap with full data contents
k8s_create_configmap_resource
Create a new ConfigMap
RBAC (5 tools)
Tool
Description
k8s_get_roles
List RBAC Roles with permission rules
k8s_get_clusterroles
List RBAC ClusterRoles
k8s_get_rolebindings
List RoleBindings (who has what role)
k8s_get_clusterrolebindings
List ClusterRoleBindings
k8s_get_service_accounts
List ServiceAccounts
Storage (3 tools)
Tool
Description
k8s_get_pvcs
List PersistentVolumeClaims with status, capacity, and storage class
k8s_get_pvs
List PersistentVolumes with capacity and bound claims
k8s_get_storage_classes
List StorageClasses with provisioner and default status
Networking (1 tool)
Tool
Description
k8s_get_network_policies
List NetworkPolicies with pod selectors and rule counts
Job Management (2 tools)
Tool
Description
k8s_get_jobs
List Jobs with completion status and timing
k8s_get_cronjobs
List CronJobs with schedule, suspend status, and last run
Usage Examples
Cluster Health Check
hljs language-css
Give me a health overview of my production cluster.
Multi-Cluster Comparison
hljs language-arduino
Compare the number of pods running in the 'backend'namespace between my 'prod'and'staging' contexts.
Diagnose Application Issues
hljs language-arduino
My deployment 'my-app' in the 'production'namespace is having issues. Can you diagnose what's wrong?
Scale Resources
hljs language-arduino
Scale the 'backend' deployment in the 'default'namespace to 5 replicas.
Resource Management
hljs language-arduino
Delete the failed job 'data-migration-v1' in the 'batch'namespace.
hljs language-sql
Create a new namespace called'staging'with the label environment=staging.
Secret Management
hljs language-sql
List all secrets in the 'production' namespace andshow me the keys in the 'api-credentials' secret.
RBAC Inspection
hljs language-sql
Show me all role bindings in the 'default' namespace -- who has access to what?
Storage Overview
hljs language-sql
List all PVCs in the cluster andshow which ones are Pending.
Node Maintenance
hljs language-csharp
Cordon node 'worker-3', drain it ignoring DaemonSets, then uncordon when maintenance is complete.
Rollback Deployment
hljs language-arduino
Roll back the 'api-gateway' deployment in the 'services'namespace to the previous version.
Execute in Pod
hljs language-arduino
Run 'ls -la /app/config' inside the 'app' container of pod 'web-app-xyz' in the 'default'namespace.
Create Resources
hljs language-yaml
Create a new deployment with this YAML:apiVersion:apps/v1kind:Deploymentmetadata:name:nginx-deploymentnamespace:defaultspec:replicas:2selector:matchLabels:app:nginxtemplate:metadata:labels:app:nginxspec:containers:-name:nginximage:nginx:1.27ports:-containerPort:80