Tag: python
-
Medieval Buzzfeed – Debugging Dodgy Datetimes in Pandas and Parquet
I was recently attempting to cache the results of a long-running SQL query to a local parquet file using SQL via a workflow like this: This ended up yielding the following slightly cryptic error message: So obviously there is an issue with my published_at timestamp column. Googling didn’t help me very much, lots of people…
-
Parsing Ingredient Strings with SpaCy PhraseMatcher
As part of my work on Gastronaut, I’m building a form that allows users to create recipes and which will attempt to parse ingredients lists and find a suitable stock photo for each item the user adds to their recipe. As well as being cute and decorative, this step is important for later when we…
-
Gitea Actions and PDM
Gitea actions is the new Github-compatible CI/automation pipeline feature that ships with Gitea and Forgejo. In theory it is interoperable with Github actions but there are still a few rough edges and for that reason, the feature is still disabled by default. I have been trying to get a django project that uses PDM for…