AI - Techi Nik

How to Deploy an ML Model on Kubernetes

If you’ve ever trained a machine learning model and wondered, “How do I deploy it so others can actually use […]

How to Deploy an ML Model on Kubernetes Read More »

Run LLMs Locally for Free Using Ollama

Cloud-based AI tools like ChatGPT have become extremely popular, but they come with limitations such as internet dependency, privacy concerns,

Run LLMs Locally for Free Using Ollama Read More »

How to Deploy Ollama on Kubernetes | AI Model Serving on k8s

Running AI models in a Kubernetes cluster provides scalability and flexibility. If you’ve ever wanted to deploy AI models on

How to Deploy Ollama on Kubernetes | AI Model Serving on k8s Read More »

AI-Powered PDF Search with LlamaIndex, Ollama & DeepSeek

Have you ever wanted to chat with your PDFs? Imagine an AI-powered PDF search engine that can extract, index, and

AI-Powered PDF Search with LlamaIndex, Ollama & DeepSeek Read More »

Deploying DeepSeek locally Using Ollama and Open WebUI

This guide demonstrates a step-by-step process for deploying DeepSeek on a virtual machine (VM) using Ollama and Open WebUI. By

Deploying DeepSeek locally Using Ollama and Open WebUI Read More »

NVIDIA GPU MIG Partitioning Guide

Maximize GPU utilization and reduce infrastructure costs The process of GPU MIG (Multi-Instance GPU) partitioning is a vital step in

NVIDIA GPU MIG Partitioning Guide Read More »

Kubernetes vs Traditional Infrastructure for AI Workloads

AI workloads require significant computing power, especially for machine learning (ML) and deep learning models. GPUs accelerate these workloads. However,

Kubernetes vs Traditional Infrastructure for AI Workloads Read More »

NVIDIA GPU Deployment for AI in Kubernetes

In modern AI and machine learning (ML) workloads, NVIDIA GPUs play a crucial role in accelerating both training and inference

NVIDIA GPU Deployment for AI in Kubernetes Read More »