Tag: python

  • Medieval Buzzfeed – Debugging Dodgy Datetimes in Pandas and Parquet

    Medieval Buzzfeed – Debugging Dodgy Datetimes in Pandas and Parquet

    I was recently attempting to cache the results of a long-running SQL query to a local parquet file using SQL via a workflow like this: This ended up yielding the following slightly cryptic error message: So obviously there is an issue with my published_at timestamp column. Googling didn’t help me very much, lots of people…

    Read More

  • Parsing Ingredient Strings with SpaCy PhraseMatcher

    Parsing Ingredient Strings with SpaCy PhraseMatcher

    As part of my work on Gastronaut, I’m building a form that allows users to create recipes and which will attempt to parse ingredients lists and find a suitable stock photo for each item the user adds to their recipe. As well as being cute and decorative, this step is important for later when we…

    Read More

  • Gitea Actions and PDM

    Gitea Actions and PDM

    Gitea actions is the new Github-compatible CI/automation pipeline feature that ships with Gitea and Forgejo. In theory it is interoperable with Github actions but there are still a few rough edges and for that reason, the feature is still disabled by default. I have been trying to get a django project that uses PDM for…

    Read More