One of the things that lights my brain up is bring able to process, analyze, and visualize cool datasets. I’ve been incredibly fortunate to be able to make a living out of that. This tag is a catch-all for projects related to data processing, statistics, and data engineering.

Graph View

Recent Posts

  • APL: Pattern 3; City Country Fingers

    — City, with natural stripes — or nature, with city stripes? Annotations for A Pattern Language, and tracking public opinion of where Americans want to live over the past 50 years. (4 min read)

  • Privacy Policy

    — When it comes to collecting data, there's a fine line between making a good site and making an invasive product. To make this site more useful, I want to collect enough data to improve, but never enough to undermine anybody's privacy. (3 min read)

  • Ants in the Neighborhood

    — Is Jane Street run by soccer-loving ants? Inconclusive. We *can* conclude that they're fans of Markov Chains, though — an invaluable tool for understanding complex data structures. (1 min read)

  • A Pattern Language, and a Beautiful Day in the Neighborhood

    — The beginning of a topological review of the 1977 urban design and architecture reference book A Pattern Language, and a journey to understand Earth's greatest graph: the Earth, itself. (12 min read)

  • The Advantages of Advantage: Intro to Probability Convolutions

    — What *is* the advantage of rolling with advantage? Exploring dice roll combinations and probability convolutions. (8 min read)

Show 8 more posts
  • Dice Roll Distribution Widget

    — Discover the probability distributions of different dice roll combinations (1 min read)

  • Some initial thoughts about time tracking

    — TODO. (1 min read)

  • Don't Double Down: Structured Streaming to Wrangle Data

    — Stop reprocessing your entire dataset every time new data arrives. A practical guide to Spark Structured Streaming with code examples and cost logic. (9 min read)

  • Littlefield Simulator: The Art of Laziness

    — Efficiency: spending six hours building a web scraper to avoid five minutes of daily work. Automating a business simulation because checking in is for chumps. (8 min read)

  • Entries to SOME 1

    — Stable matching, optimal stopping, and Bachelor analysis - visualized with some care. (1 min read)

  • Utah Office Consult

    — People problems are the hardest engineering problems. A consult on thinking out-of-the-box to explain morale problems in small companies.

  • College Admissions and the Stability of Marriage

    — My college presentation on the Gale-Shapley paper, recorded on an iPad, like a true professional. Non-market environments, matchmaking lattices, and gratitude for good professors. (1 min read)

  • What *are* Supply Chains, Anyway?

    — A first foray into network visualization: messy graphs, abject terror. Early data viz experiments searching for supply loops. Bad graphs; interesting questions. (2 min read)