Blog

20 valid responses

Version Control with GIT

This document provides a practical overview for data science teams implementing Git and GitLab for software project management. It emphasizes the role of version control as a critical safeguard, enabling collaborative development through branching. The guide details essential concepts such as branching, which promotes independent development of features.

RAG Simulator with AWS PartyRock

This interactive app illustrates the core principles of Retrieval-Augmented Generation (RAG). It demonstrates a systematic process where users upload their own documents, which are then broken down into smaller, manageable chunks through a process called chunking. The app then leverages Embeddings – essentially, converting text into numerical vectors – to understand the semantic relationships between these chunks, allowing for effective retrieval. The process begins with a user asking a question, and the app then simulates a retrieval process, displaying the most relevant chunks based on the question’s meaning.

Docker Desktop - Zero to Hero

This guide provides a comprehensive walkthrough for setting up and utilizing Docker Desktop on Windows and macOS systems. It details specific hardware requirements, such as virtualization support and memory needs, alongside step-by-step instructions for graphical and command-line installations.

Mastering Secure Shell is Key

A comprehensive guide to Secure Shell (SSH), a vital protocol for establishing encrypted connections over networks. It details the mechanics of key-based authentication, explaining how a private and public key pair allows for secure access without traditional passwords.

Knowledge Graphs for Agents

Modern AI agents often generate fluent but shallow responses. Knowledge graphs (KGs) introduce structured, semantically grounded knowledge that enables agents to reason, check consistency, and retrieve facts — all grounded in explicit relationships. In agentic systems, KGs play a role similar to a semantic memory: a persistent, queryable structure that captures entities, relationships, and context over time.

Pizza Ontology

Below is the complete pizza ontology in Turtle syntax (a readable OWL serialization), with every construct labeled:

PDF Document Processing Pipeline with n8n

I got into the habit of browsing my LinkedIn feed on my smart phone. Every time I see a post about an interesting paper, I download it onto a cloud drive for later reading. Many of these papers were posted on arXiv.org, the open-access digital archive and distribution server for scholarly, non-peer-reviewed preprints (or postprints) in physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, and related fields.

Document Processing Pipeline with Python

This tutorial walks through two complementary approaches for transforming raw documents—PDFs, Word files, PowerPoints, images, and plain text—into structured, machine-readable formats suitable for AI systems such as Retrieval-Augmented Generation (RAG), knowledge bases, and document summarization engines.

Ollama LLM Inference Local and Cloud

Ollama is a tool designed to host large language models (LLMs) locally on macOS, Linux, and Windows systems, offering OpenAI-compatible APIs for seamless integration with existing applications. By running models locally, Ollama eliminates the need for cloud-based services, ensuring privacy and reducing costs.

Structured Output from Large Language Models

Large language models are remarkably good at producing human-readable prose, but production software systems rarely consume prose. They consume JSON payloads, SQL queries, API responses, and typed data structures. The gap between what an LLM naturally produces and what a downstream system can parse is the central engineering challenge of structured output.

Data Science for Agentic AI: Managing, Monitoring, and Evaluating Intelligent Agent Systems

An agentic AI system is a business process. It receives inputs, executes a sequence of operations, consumes resources, produces outputs, and fails in predictable, measurable ways. Every data science technique you have learned — from funnel analysis to queueing theory to A/B testing — applies directly. You do not need new methods. You need new mappings.

Generating SQL from Natural Language with Ollama

Natural language to SQL (text-to-SQL) is one of the most practical applications of local LLMs. This article walks through building a robust, production-quality pipeline using Ollama — from prompt construction to structured output validation and self-correction.

Natural Language to Graph Query Translation with Ollama

Knowledge graphs are a cornerstone of modern agentic AI systems, providing structured, semantically rich representations of domain knowledge. Two dominant paradigms exist: RDF/SPARQL, rooted in W3C semantic web standards with formal ontological reasoning, and LPG (Labeled Property Graph)/Cypher, popularized by Neo4j for its intuitive, developer-friendly model. In both cases, querying these systems requires either expert knowledge of the query language or a translation layer that converts natural language questions into executable queries.