Writing on search engines, machine learning, and data processing.

My thoughts on search technology, machine learning applications, data processing techniques, and software engineering practices, collected in chronological order.

Parse HTML with Newspaper3k on macOS

A practical guide to setting up and using the Newspaper3k library for HTML parsing and content extraction on macOS systems.

Install Solr on AWS: Automated Setup Guide

A comprehensive guide to automatically installing and configuring Apache Solr on AWS infrastructure using Infrastructure as Code.