Brainsteam

Re-using machine learning models and the “no free lunch” theorem

Published on March 21, 2018 by James Ravenscroft

#machine-learning #ai #work

Why re-use machine learning models?

Model re-use can be a huge cost saver when developing AI systems. But how well will your models perform in their new environment?

Dialect Sensitive Topic Models

Published on July 25, 2017 by James Ravenscroft

#machine learning #python #topic model #PhD #open source

As part of my PhD I’m currently interested in topic models that can take into account the dialect of the writing. That is, how can we build a model that can compare topics discussed in different dialectical styles, such as scientific papers versus newspaper articles. If you’re new to the concept of topic modelling then this article can give you a quick primer.

Vanilla LDA

A diagram of how latent variables in LDA model are connected

Vanilla topic models such as Blei’s LDA are great but start to fall down when the wording around one particular concept varies too much. In a scientific paper you might expect to find words like “gastroenteritis”, “stomach” and “virus” whereas in newspapers discussing the same topic you might find “tummy”, “sick” and “bug”. A vanilla LDA implementation might struggle to understand that these concepts are linked unless the contextual information around the words is similar (e.g. both articles have “uncooked meat” and “symptoms last 24 hours”).

AI can’t solve all our problems, but that doesn’t mean it isn’t intelligent

Published on December 8, 2016 by James Ravenscroft

#AI #machine learning #philosophy #work #phd

Thomas Hobbes, perhaps most famous for his thinking on western politics, was also thinking about how the human mind “computes things” 500 years ago.

A recent opinion piece I read on Wired called for us to stop labelling our current specific machine learning models AI because they are not intelligent. I respectfully disagree.

The builder, the salesman and the property tycoon

Published on November 12, 2016 by James Ravenscroft

#humour #work #machine learning

A testament to marketers around the world is the myth that their AI platform X, Y or Z can solve all your problems with no effort. Perhaps it is this, combined with developers and data scientists often being hidden out of sight and out of mind that leads people to think this way.

Unfortunately, the truth of the matter is that ML and AI involve blood sweat and tears – especially if you are building things from scratch rather than using APIs. If you are using third party APIs there are still challenges. The biggest players in the API space also have large pools of money. Pools of money that can be spent on marketing literature to convince you that their product will solve all your problems with no effort required. I think this is dishonest and is one of the reasons I have so many conversations like the one below.

Cognitive Quality Assurance Pt 2: Performance Metrics

Published on May 29, 2016 by James Ravenscroft

#quality assurance #machine learning #watson #work

EDIT: Hello readers, these articles are now 4 years old and many of the Watson services and APIs have moved or been changed. The concepts discussed in these articles are still relevant but I am working on 2nd editions of them.

Last time we discussed some good practices for collecting data and then splitting it into test and train in order to create a ground truth for your machine learning system. We then talked about calculating accuracy using test and blind data sets.

Cognitive Quality Assurance – An Introduction

Published on March 29, 2016 by James Ravenscroft

#machine learning #quality assurance #watson #work

EDIT: Hello readers, these articles are now 4 years old and many of the Watson services and APIs have moved or been changed. The concepts discussed in these articles are still relevant but I am working on 2nd editions of them.

This article has a slant towards the IBM Watson Developer Cloud Services but the principles and rules of thumb expressed here are applicable to most cognitive/machine learning problems.

Introduction

imagebot-com-2012042714194724316-800px Quality assurance is arguably one of the most important parts of the software development lifecycle. In order to release a product that is production ready, it must be put under, and pass, a number of tests – these include unit testing, boundary testing, stress testing and other practices that many software testers are no doubt familiar with. The ways in which traditional software are relatively clear.In a normal system, developers write deterministic functions, that is – if you put an input parameter in, unless there is a bug, you will always get the same output back. This principal makes it.. well not easy… but less difficult to write good test scripts and know that there is a bug or regression in your system if these scripts get a different answer back than usual.

Content tagged with "Machine Learning"

Re-using machine learning models and the “no free lunch” theorem

Why re-use machine learning models?

Dialect Sensitive Topic Models

Vanilla LDA

AI can’t solve all our problems, but that doesn’t mean it isn’t intelligent

The builder, the salesman and the property tycoon

Cognitive Quality Assurance Pt 2: Performance Metrics

Cognitive Quality Assurance – An Introduction

Introduction