Search Engine Architect, Cloud Engineer, and Machine Learning Researcher.

I am Jason Stubblefield, a search engine architect specialized in enterprise search systems, indexing, and big data. I do R&D work in SOLR and OpenSearch architecture, machine learning, NLP and applied LLMs. On this website you can find information about my various projects and other interesting links.

Parse HTML with Newspaper3k on macOS

A practical guide to setting up and using the Newspaper3k library for HTML parsing and content extraction on macOS systems.

Areas of Expertise

  • Enterprise SOLR (and OpenSearch) Search Engine Architecture
  • Machine learning, NLP and Applied LLMs
  • Software Engineering and Development
  • Business Hospitality and Leadership
  • Restaurant and Menu Development
  • International Business

The views and opinions expressed here are solely my own and do not reflect the views or policies of my employer or clients.