ChatGPT from Scratch: How to Train an Enterprise AI Assistant

Published
Author
Dr. Phil Winder
CEO

This is a video of a presentation investigating how large language models are built and how to use them, inspired by our large language model consulting work. First presented at GOTO Copenhagen in 2023, the video investigates the history, the technology, and the use of large language models. The demo at the end is borderline cringe, but it’s a fun and demonstrates how you would fine-tune a language model on your proprietary data.

Read more

Part 6: Useful ChatGPT Libraries: Productization and Hardening

Published
Author
Natalia Kuzminykh
Associate Data Science Content Editor

LangChain and LlamaIndex streamline ChatGPT and LLM application development. Boost your project’s efficiency with LangChain’s tools and modules, and LlamaIndex’s advanced document handling. Discover the future of language model orchestration today.

Read more

Part 5: How to Monitor a Large Language Model

Published
Author
Natalia Kuzminykh
Associate Data Science Content Editor

The article explores the complexities and nuances of monitoring and evaluating Large Language Models (LLMs) like ChatGPT in business applications. It emphasizes the insufficiency of traditional metrics, the importance of real-time tracking, human feedback, and specialized evaluation methods to ensure model safety, efficiency, and performance optimization.

Read more

Part 4: How to Deploy a ChatGPT Model or LLM

Published
Author
Natalia Kuzminykh
Associate Data Science Content Editor

In our previous articles, you learned how to build and train your personal ChatGPT model (large-language model). However, it’s important to understand that these models are merely components within a larger software landscape. After achieving adequate performance in a controlled environment, the next step is to integrate it into your broader system.

Read more

Part 3: Training Custom ChatGPT and Large Language Models

Published
Author
Natalia Kuzminykh
Associate Data Science Content Editor

In just a few years since the transformer architecture was first published, large language models (LLMs) have made huge strides in terms of performance, cost, and potential. In the previous two parts of this series, we’ve already explored the fundamental principles of such models and the intricacies of the development process.

Yet, before an AI product can reach its users, the developer must make yet more key decisions. Here, we’re going to dig into whether you should train your own ChatGPT model with custom data.

Read more

Part 2: An Overview of LLM Development & Training

Published
Author
Natalia Kuzminykh
Associate Data Science Content Editor

The premise of LLMs is beautifully exemplified by products like ChatGPT, that use these models to power conversational interfaces, offering a seamless and engaging chat user experience. In this second part of our series on ChatGPT, we provide an overview of what it’s like to develop against commercial LLM offerings and what it takes to begin developing your bespoke model.

Read more

Part 1: Introduction to Large Language Models and ChatGPT

Published
Author
Natalia Kuzminykh
Associate Data Science Content Editor

The launch of ChatGPT by OpenAI has attracted a lot of interest from people around the world, sparking curiosity not only about artificial intelligence in general but also about the underlying technology that powers this AI chatbot. While fancy terms like Large Language Models (LLMs) and Generative Models may sound complex, at their core, they’re really about helping computers understand and generate human-like text in almost any language (including programming languages).

Read more
}