Brainsteam

Published on September 9, 2024 by James Ravenscroft

So it’s looking pretty bad for the creator of Reflection-70B who posted amazing results. Turns out they really were too good to be true. it looks like they were proxying api requests to their model through to Anthropic. The AI space is full of fraudsters but if this is substantiated, it’s the most brazen example I know of so far.

Running Phi MoE 3.5 on Macbook Pro

Published on September 5, 2024 by James Ravenscroft

#AI #llms

The relatively recently released Phi 3.5 model series includes a mixture-of-experts model featuring 16 x 3.3 Billion parameter expert models. It activates these experts two at a time resulting in pretty good performance but only 6.6 billion parameters held in memory at once. I recently wanted to try running Phi MoE 3.5 on my macbook but was blocked from doing so using my usual method whilst support is built into llama.cpp and then ollama.

Bookmarked Where does Postgres fit in a world ...

Published on September 4, 2024 by James Ravenscroft

#AI #database #postgresql #vectors

Bookmarked Where does Postgres fit in a world of GenAI and vector databases? - Stack Overflow.

The title and framing of this talk are weird and it's bugging me

The question could be paraphrased as "why would we need to efficiently store and retrieve data in a deterministic way when we have GenAI?" This is like asking "why do we need cars when we have speedboats?" or "Why do we need butter knives now that we've invented the chainsaw?".

Productivity gains in Software Development through AI

Published on August 27, 2024 by James Ravenscroft

#AI #hype

Bookmarked Productivity gains in Software Development through AI by tante.

A fairly good rebuttal of the 4500 hours saved piece by Amazon - I had some similar thoughts.

Ditch that ChatGPT Subscription: Moving to Pay-as-you-Go AI usage with Open Web UI

Published on July 8, 2024 by James Ravenscroft

#AI #docker #llms #nlp #self-hosting

Introduction

In the world of AI assistants, subscription services like ChatGPT Plus, Claude Pro and Google One AI have become increasingly popular amongst knowledge workers. However these subscription services may not be the most cost-effective or flexible solution for everyone and the not-insignificant fees encourage users to stick to one model that the've already paid for rather than trying out different options.

LLMs: To Fine-Tune or Not to Fine-Tune?

Published on May 7, 2024 by James Ravenscroft

#AI #llms #nlp

Knowing when to fine-tune LLMs and when to use an off-the-shelf model is a tricky question. New research can help shed a light on when each approach makes more sense and eke more performance out of off-the-shelf models without fine-tuning them.

When Fine-Tuning Beats GPT-4

Modern LLMs show impressive performance at a range of tasks out of the box. Even small models like the recent Llama 3: 8B show excellent performance at unseen tasks. However, a recent preprint from the research team at Predibase shows that small models can match and even out-perform GPT-4 when they are fine-tuned for specific tasks. Figure 5 from the paper shows a list of the tasks that were evaluated and the relative performance difference vs GPT-4.

Can Phi3 and Llama3 Do Biology?

Published on April 26, 2024 by James Ravenscroft

#AI #llms #nlp

Small Large Language Model might sound like a bit of an oxymoron. However, I think it perfectly describes the class of LLMs in the 1-10 billion parameter range like Llama and Phi 3. In the last few days, Meta and Microsoft have both released these open(ish) models that can happily run on normal hardware. Both models perform surprisingly well for their size, competing with much larger models like GPT 3.5 and Mixtral. However, how well do they generalise to new unseen tasks? Can they do biology?

Self-hosting Llama 3 on a home server

Published on April 20, 2024 by James Ravenscroft

#AI #llama #llms #nlp #self-hosting

Self-hosting Llama 3 as your own ChatGPT replacement service using a 10 year old graphics card and open source components.

Last week Meta launched Llama 3, the latest in their open source LLM series. Llama 3 is particularly interesting because the 8 billion parameter model, which is small enough to run on a laptop, performs as well as models 10x bigger than it. The responses it provides are as good as GPT-4 for many use cases.

Finding the Best AI-Powered Handwriting OCR

Published on April 2, 2024 by James Ravenscroft

#AI #llms #nlp

Quite unusually for me, this essay started its life as a scribble in a notebook rather than something I typed into a markdown editor. A few weeks ago, Tiago Forte made a video suggesting that people can use GPT-4 to capture their handwritten notes digitally. I've been looking for a "smart" OCR that can process my terribly scratchy, spidery handwriting for many years but none have quite cut the mustard. I thought, why not give it a go? To my absolute surprise, GPT did a reasonable job of parsing my scrawling and capturing text. I was seriously impressed.

Supporting the Underdog when it comes to Automated Decision Making and Power Dynamics

Published on January 25, 2024 by James Ravenscroft

#AI #ethics #nlp

In reply to The (theoretical) risks of open sourcing (imaginary) Government LLMs.

I enjoyed this recent blog post by acclaimed technologist Terence Eden proposing a thought experiment about the ethics of open sourcing a hypothetical LLM classifier trained on benefits sanction appeal letters.

Eden, himself a huge open source advocate, argues, quite compellingly that such a model should be kept closed to prevent the potential leakage of potentially confidential information in the training data or probing of the model for the purpose of abusing it.

However, as some of the post's commentators point out, there is a bigger question at play here: where is it appropriate to be using this kind of tech?

One of the key issues in my mind is the end-user's treatment and the power dynamic at play here. If you're making life-and-death decisions (tw suicide) about people who have few resources to challenge those decisions, then you should have appropriate systems in place to make sure that decisions are fair, explainable and rational. You must provide mechanisms that allow the party with everything to lose in this situation to understand what is happening and why. Finally, There must always be an adequate escape hatch mechanism for recourse if the computer gets it wrong.

Content tagged with "Ai"

Introduction

When Fine-Tuning Beats GPT-4

Self-hosting Llama 3 as your own ChatGPT replacement service using a 10 year old graphics card and open source components.