I love mind mapping. It helps arrange and rearrange new information so a brief glance is enough to spur the memory of the content. Chapter 2 goes over tooling and setting up the books companion repository. Chapter 3 goes over the Data Engineering workflow with emphasis on chapter 5
All posts in Uncategorized
scripts folder
You should write scripts for your computer. They could be your favorite language. Put aliases in zshrc to call them Here is “lsp.py”
Data Ownership
Who owns data on you? Google Maps has tracked nearly every location you’ve been in the last decade. But you cannot query it in any way. Have you ever wanted to know: “How many times have I been to the gym this week?” or “How many days did I spend in California this year?” You…
python reference sheet
[list] -> [list] [list] -> num
How to avoid overwriting files in python
Strategy 1: increment a suffix so like “file.pdf(1)” Version: Simple Version: Guards against race conditions The above is safe if there is only one process writing to the location but it can include a race condition. This is the safer version that “opens” the file. Strategy 2: attach a timestamp
How to use the python rank-bm25 library
Note: this library is called rank-bm25 on pypi (pypi) and NOT bm25 Official docs (github) say: However, you often want to clean the corpus (lowercase, remove punctuation) before indexing the corpus. Once you do that you must keep the original corpus around that index into the original unedited strings. This works for most cases! Note,…
How to invert docker ps output
How Step: Create a new bash script (“.docker_ps_invert.sh) Step: allow executing Why? Alternative You can choose specific columns to show (like below)… But what if you want to see every column? That’s where the script about shines.
install nginx on mac with homebrew
Q: How do I install nginx? Q: Where is nginx configured? Q: Where is nginx hosted on homebrew?
local inference with llama-cpp-python
Step: install the package Step: download a model “TheBloke” on huggingface (link) has a ton of models in “GGUF” format (format introduced by llama.cpp) Click on any of these different quantized (reduced precision) models and find the “download” link Put them in your project somewhere like a “models” directory Step: create your python (.py) or…
How to convert mov to mp4 with ffmpeg
When you record on Quicktime, that saves as .mov which is a SUPER huge file size. Notes: