I have recently been attending a course called Managing People for Growth run by Scottish Enterprise. It was a hybrid online and in person course…
Category: Problem Solving
The usual approach to prediction problems these days is to create a machine learning model. However machine learning models can struggle to train on sparse…
Sometimes you need to prove something for a project mathematically. However what if there might be an error in your reasoning that means that your…
We can frequently find that we want to combine the results of several calculations into a single column. For instance perhaps we have various data…
Azure Data Studio is a great application for working with databases on Azure. Here are a few hints to improve quality of life on Azure…
Today I want to take a look at a neat feature of Pyspark called Pandas User Defined Functions. As the name suggests, a Pandas UDF…
I recently revisited a project I had been working on to familiarise myself with OpenCV and tidied it up for release. In essence the program…
I recently needed to handle some arguments for a command line Python program. I considered using the built in sys.argv approach, but decided that it…
StackOverflow is a wonderful source of solutions to common yet tricky programming issues. However there are certainly a few things to be aware of when…
Pyspark schema can be labourious to write. One approach to this issue was discussed previously. However DDL definitions may not meet all needs. Particularly where…